For my personal usage, I do use Grafana, and it does make life easier for solving stuff.
External reviews
External reviews are not included in the AWS star rating for the product.
Community support and visualization features impress but integration guides could simplify
What is our primary use case?
What is most valuable?
The main benefits I have seen from using Grafana in my day-to-day activities is the visualization of the metrics, specifically Dora Metrics. Instead of saying 'I feel the team is developing bad or slow,' you can actually have metrics to support that. You can actually show that this team could work better here, in this case, or needs to focus more on peer reviews or on deployments. The visualization of the data is key.
What needs improvement?
For my personal usage of Grafana, I think there could be improvements, particularly that I use the Grafana Cloud SAS solution. The integration guides could be simpler and easier.
Regarding the clarity of the official documentation for installation, I think the official documentation, which has something called Alloy, the Alloy integration, is not that clear. It was mostly for personal usage that I'm talking about right now.
For how long have I used the solution?
I have been using Grafana for around a year, maybe more.
What do I think about the stability of the solution?
Grafana is stable; however, in the previous company, we also had Grafana for a long period of time, and its stability depends on the load. Once you get to a higher load, you need to re-evaluate your architecture and put that into account.
What do I think about the scalability of the solution?
In assessing Grafana's scalability, we started noticing logs missing or metrics not syncing in time, which indicated a bottleneck that we had to address by updating the infrastructure.
How are customer service and support?
I'm not familiar with Grafana's feature in terms of community and documentation since I've been here only a few months. The community provides extensive knowledge about Grafana. You will get faster answers about Grafana than about Datadog from the community.
My advice for people who are new to Grafana or considering it is to reach out to the community mainly, as that's the primary benefit of Grafana.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
The reasons for moving from Grafana to Datadog include the fact that Datadog is a SAS solution, and we don't need to manage it. I was told, and as I mentioned I'm kind of new, there was already Datadog running, so it's not creating a new one but just moving the data. The main benefit of Datadog versus Grafana is the ease of creating a dashboard, which is just simple for the user.
What about the implementation team?
In the previous company, about three or four people worked on Grafana implementation. It was an ongoing effort that took several months to complete.
Which other solutions did I evaluate?
Grafana is easier to sell to management compared to Datadog and other solutions because it has an open-source tier, which makes it easier.
If I'm coming as a personal user, then Grafana is the best solution out there right now. But if we're looking at it as an enterprise, then when money is less of an issue, Datadog is much easier to use and implement than Grafana.
What other advice do I have?
Managing and maintaining Grafana is kind of managing another application; as far as Grafana itself, you don't have much to do. It's mostly about the infrastructure, resources, and if you're deploying it in high availability, maybe more instances. So it's mostly infrastructure work and less about Grafana itself.
I rate Grafana an eight out of ten.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Efficient real-time log analysis and resilient adaptation to open source dynamics improve operational workflows
What is our primary use case?
We manage log processing with Grafana because we found that it is much easier for us to manage it on our infrastructure on AWS. We can maintain all the things we do not need. DataDog told us we have to wait and that they need to provide features we need to develop, so it does not suit our needs.
We are still using DataDog, but for important assets that we need to analyze the logs, we send it to Grafana.
The challenges we face with DataDog compared to Grafana include the need to analyze very important brands, network trafficking, and maintaining many websites, most of which are very important domains that cost a lot of money, so we are getting attacked each day and we need to analyze all of the logs. Sometimes we have false positives and things similar to that, so we have to make sure that we are doing the correct decision of blocking or trying to mitigate attacks. Using the logs with Grafana it is much easier for us to analyze rather than DataDog. DataDog has their own language and they want you to plot things with their own vocabulary. We do not have time to memorize things. We especially wanted to use something that was open source at the beginning, and then other people started using it, took that product and modified it for extra cost, but it is a better solution for us.
We switched from DataDog to Grafana because we wanted to reduce the logs costs, as we are streaming approximately five million logs or even less.
What is most valuable?
We can find information with Grafana much more quickly compared to DataDog because it was open source and there was extensive documentation about it, enabling us to fetch data or information much more quickly using AI tools. With DataDog, they always wanted us to have a meeting or talk with us on a call. It was redundant. We just wanted to get to the solution without making a big deal out of it.
Grafana saves us hours compared to DataDog. It takes about two weeks to figure out what is going on with DataDog, but with Grafana, we just started to initialize the service, had a few issues, fixed them, and that was it. I did not have any major problems that forced me to halt everything in my work. It cost me hundreds of hours with DataDog because I needed to see all the documentation and all the special caveats they have there.
What needs improvement?
I would rate Grafana overall as an eight out of ten. It is pretty good, and I would recommend it. I would give it a ten if it were much simpler for users who just want to get a simple objective in Grafana and are not experienced with technical configuration. It would be better if users could simply state they want to see the amount of requests on a graph through an AI implementation. For example, when getting attacked, users should be able to easily filter all requests to a specific site or resource, or identify IPs that were recently attempting access.
I would describe Grafana's ease of use as a necessity to just get things fixed. We are working in a marketing company, and if something does not work great, or people are looking for the solution, I am looking to get a temporary solution until I can fix it properly, and then I can reach support if I cannot fix it for the long-term.
For how long have I used the solution?
I have been using Grafana for quite a long time, but we have only recently started using all of its features.
What do I think about the stability of the solution?
I would rate Grafana's overall stability as much more stable because when it is not working, users are given a much broader oversight of what is not working, rather than talking to DataDog and asking them to check it out. When something in their dashboard does not work, because it is open source, I am able to find all the relative combinations that people are having, making it much easier for me to fix.
What do I think about the scalability of the solution?
The size of our infrastructure affects using Grafana as it really depends on that. We are considered quite small or medium at least, so it is quite easy for us. In terms of our company, the infrastructure is using two availability zones in AWS, one is US East, so we are looking to expand. Currently, we have only two availability zones to deploy, so it is quite easy for us right now.
How are customer service and support?
I do not use Grafana's support for technical issues because I have found solutions on Stack Overflow and ChatGPT helps me as well.
On a day-to-day basis with Grafana, I tend to refrain from using their support, not because of their level of professionalism.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
We switched from DataDog to Grafana because we wanted to reduce the logs costs, as we are streaming approximately five million logs or even less.
How was the initial setup?
I initially learned to use Grafana by working locally and seeing how it works and presenting a proof of concept. We started to make a small plan of how we could make the change, how to deploy the switch, making the changes and checking it in other environments that we have. We are working with Agile, so I have to work with development first and then we are staging it. It is a long process but eventually we made the switch quite quickly, it was just a simple day when we deployed it.
What about the implementation team?
Three people manage Grafana in our organization: one is a sysadmin, another one is a DevOps full-time, and I am managing the entire operation.
What's my experience with pricing, setup cost, and licensing?
The costs associated with using Grafana are somewhere in the ten thousands because we are able to control the logs in a more efficient way to reduce it. That is pretty much great for us.
What other advice do I have?
My recommendation for future users of Grafana is that it is all great. I am hoping that all the companies, at least the open-source projects, do not go closed source because then users will have to find something else. That is what we love to do - we love to use open source projects and improve them for our usage, not something that follows an agenda of another company's product.
My recommendation to other users of Grafana is to not be afraid and always look online. I started using it a long time ago for different projects. Users need to learn about the basics before they can really understand what is going on. There are people who start the self-hosted web server and see all the metrics going to their server, but they do not really understand what is happening, they just see a small count of the graph. Users need to understand that configuration is necessary. I experienced this when I started and was clueless, thinking of going to DataDog. Then I realized I just did not understand it correctly - there is much more once you see a bigger horizon of things that you could do.
Overall rating: 8/10
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Dashboard integration and data source setup simplify monitoring tasks
What is our primary use case?
My main use case is monitoring. We use a lot of different products for monitoring, but Grafana is specifically for monitoring Kubernetes. We use Grafana mainly for Prometheus.
What is most valuable?
The features I appreciate most are the dashboards and the integrations with multiple data sources. The feature that sets Grafana apart from its competitors is how easy it is to set up data sources. The integration helps our organization in centralizing and analyzing data from diverse sources.
What needs improvement?
Regarding joining between queries, merging between two queries that give the same information could be simple, and there are some ways to do that, but if there was something even easier, it would be great.
For how long have I used the solution?
The deployment process was completed before I joined the organization. We have been using it for a couple of years, but it was deployed previously.
What do I think about the stability of the solution?
We never had any issues with Grafana at all.
How are customer service and support?
Grafana's customer support is mainly for developers. We didn't need to reach out to them for troubleshooting or any issues.
How would you rate customer service and support?
Positive
What other advice do I have?
We centralize all the metrics from Prometheus and also from Graphite and all other data sources. We have dashboards to integrate Grafana's real-time metrics with visualization capabilities. We're not using Grafana's role-based access control and multi-tenancy features.
Seeing the metrics helps in finding issues, such as memory leaks or spikes and some optimization. We use Grafana on a day-to-day basis to get a better look at our environments and the usage of our resources.
We don't use Grafana for alerting, just for visualization. For alerting, we have different tools. On a scale of 1-10, I would rate Grafana as 8.5.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Open-source solutions work adequately but future AI enhancements and advanced tracing capabilities are desired
What is our primary use case?
What is most valuable?
What needs improvement?
For how long have I used the solution?
What do I think about the scalability of the solution?
How are customer service and support?
How would you rate customer service and support?
Neutral
What other advice do I have?
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Effective alerting and user management enhance monitoring but needs a better interface
What is our primary use case?
I use Grafana mostly for monitoring purposes, such as tracking errors or alerts. When certain thresholds of warnings or errors are breached, Grafana sends me an alert because I am supplying it with data. Another use case I have is monitoring CPU usage and bandwidth.
What is most valuable?
I find Grafana beneficial due to its numerous plugins. Its alerting feature is effective because it allows me to set thresholds to send an email if a certain threshold is met. User management is good, and the administration features enable me to control user access.
What needs improvement?
The interface is not well-liked by my customers. Grafana cannot be easily embedded into certain applications and offers limited customization options for graphs. I think predictive capabilities in alerting could also be improved.
What do I think about the scalability of the solution?
Scalability is uncertain for me since I installed Grafana on only one server and did not scale it further.
How was the initial setup?
The setup process was easy for me to follow. I just needed to adhere to a step-by-step guide which was quite straightforward.
What other advice do I have?
I end up using my own customized open-source visualization tool instead of Grafana. Most of the time, I deal with technical aspects.
I rate the overall product seven out of ten.
Which deployment model are you using for this solution?
Displays data visually from multiple sources while integrating seamlessly with existing systems
What is our primary use case?
The primary use case for Grafana is to display data from Prometheus in a graphical user interface for easy monitoring and analysis of metrics. It is used for end-to-end tracing of applications and integrates with AWS services.
How has it helped my organization?
Grafana improves the organization by showing data from Prometheus in a user interface.
What is most valuable?
Grafana provides a user-friendly interface for viewing infrastructure metrics through dashboards. It integrates with Prometheus to pull data and offers a straightforward setup process.
Users can monitor metrics with greater ease, and the tool aids in quickly identifying issues by providing a visual representation of data. Grafana's integration capability is straightforward, which facilitates building dashboards as needed.
What needs improvement?
Grafana could consider building its own metrics system to eliminate the reliance on other tools like Prometheus, providing a one-stop solution.
For how long have I used the solution?
I have used Grafana for the last four to five years.
What do I think about the stability of the solution?
There are some instances, about one to two percent of the time, where dashboards do not work, however, these are manageable and typically attributed to the underlying data source.
What do I think about the scalability of the solution?
Grafana is pretty scalable as it can source data from multiple providers without limitation.
How was the initial setup?
The initial setup was straightforward. Those familiar with deployment find it easy to manage and deploy.
What about the implementation team?
The implementation is managed by a different team, and issues with integration have been minimal.
What's my experience with pricing, setup cost, and licensing?
I used the open-source version for my learning, which was free. In an enterprise setting, pricing is reasonable, as many customers use it.
What other advice do I have?
I rate Grafana eight or nine out of ten. The deployment of multiple tools could be streamlined into a single solution for better efficiency.
Complex features hinder streamlined user interface and good stability
What is our primary use case?
I use Grafana to deploy OpenVideo and to write an article on Medium on how we can deploy some open source projects with CloudFormation.
How has it helped my organization?
I have installed Grafana to see custom metrics like the number of connected users, number of rooms, and CPU usage.
What is most valuable?
All the features are complicated. It's comparable to Terraform, and I prefer CloudWatch.
What needs improvement?
The features are complicated and not intuitive. It would be better if they made the technology easy to use without needing to read extensive documentation. The dashboard interface is too wide, and it's challenging to find buttons or features.
What do I think about the stability of the solution?
Grafana is stable and has great engineers.
What do I think about the scalability of the solution?
I would rate the scalability at six out of ten. It meets some needs, however, it could be improved.
Which solution did I use previously and why did I switch?
I prefer CloudWatch over Grafana due to its ease of use.
How was the initial setup?
The initial setup is too complicated. Even at a junior level, I found it challenging to navigate without guidance.
Which other solutions did I evaluate?
I prefer CloudWatch with custom metrics.
What other advice do I have?
Make the interface intuitive and user-friendly. Highlight features clearly on the interface so that users can easily find and access them.
I'd rate the solution four out of ten.
Which deployment model are you using for this solution?
Can monitor metrics on dashboards but needs to have own query language
What is our primary use case?
We use the tool to monitor metrics on dashboards.
What is most valuable?
It's easy to make changes. We can put many dashboards in one place. There's no delay in showing data - what you see on Grafana matches the server metrics.
What needs improvement?
Grafana could improve by having its own query language. Currently, it uses languages like Prometheus or InfluxDB, which not everyone knows. Their query language should be easier.
For how long have I used the solution?
I have been using the product for four years.
What do I think about the stability of the solution?
Stability-wise, our on-premises version was mixed, but the cloud version is stable.
What do I think about the scalability of the solution?
My company has 2000 users.
How was the initial setup?
Setting up Grafana is moderately difficult, especially writing query languages and custom metrics. Creating custom metrics requires writing new query language at the server level, which can be challenging.
What other advice do I have?
If you want to use Grafana for the first time, I'd say you can. It's cost-effective compared to other products like Datadog. But if cost isn't an issue, I'd recommend Datadog. For beginners, learning to use Grafana is moderately difficult. I rate it a seven out of ten.
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Provides good alerts and is useful for monitoring user traffic and visualizing metrics
What is our primary use case?
We use it to monitor user behavior. We have AI LLM models in NVIDIA Triton. NVIDIA Triton gives us some metrics. These metrics are integrated with Grafana and Prometheus. We monitor user traffic to visualize metrics like p99 latency, increase in user and CPU performance, utilization of the GPUs, RAM utilization, and CPU utilization.
What is most valuable?
We use the free templates of the solution. We have tested some alerts on metrics. We like the alert features. We plan to use it more. Slack and PagerDuty alerts in Grafana are helpful.
What needs improvement?
It was pretty complicated to integrate the product into our workflow. We relied on templates we found on the GitHub open-source repositories. It is pretty hard to integrate Grafana and set it up according to our use case. The solution must provide tutorials and guides.
We are a small team. We don't have the time to learn Grafana end-to-end, but we want some guides to help us use it quickly. Prebuilt templates will help a lot. The product must provide more alerting metrics. It must be integrated with iOS and Android mobile phones.
For how long have I used the solution?
I have been using the solution for four months.
What do I think about the stability of the solution?
The tool’s stability is quite good. I rate the stability a nine out of ten.
What do I think about the scalability of the solution?
I rate the tool’s scalability a six out of ten. Scaling is complex. We have four developers in the organization who use the product.
How was the initial setup?
We have good Docker experience, so the setup was pretty straightforward. We had to run a couple of Docker commands. We integrated everything with Docker Compose. As we scale up, the setup will get complicated. If the number of users increases, we might consider switching to the cloud.
Which other solutions did I evaluate?
Compared to Elasticsearch, ELK Stack, and Logstash, Grafana is pretty strong. Grafana is way ahead of its competitors. SigNoz is also an alternative to Grafana and Prometheus. SigNoz is pretty easy to set up.
What other advice do I have?
We haven't worked or dived deep into the documentation and tutorial. I will recommend the product to users willing to take the time and patience to set up and learn Grafana in-depth. There are a lot of tools that we can use to build excellent integrations and get the most out of them with little knowledge. However, to leverage the value of Grafana and Prometheus, we must learn the tools in depth. It will save us time, and we can get much value if we invest time and effort. Overall, I rate the solution an eight out of ten.
Which deployment model are you using for this solution?
A tool that users can understand and learn quickly to use
What is our primary use case?
I use Grafana since it is interconnected with Zabbix. I also use Grafana for the dashboard of machines from VMware and for some network devices connected to Zabbix.
What is most valuable?
The ability to use Grafana with Zabbix is a valuable feature for me. Zabbix provides me with alerts and information if something goes wrong, while Grafana offers a better dashboard that is more stable. If you know Grafana well, it can show you everything, like the uptime of different sources and other areas.
What needs improvement?
I have a problem with Grafana in the area of documentation. There is a problem in the area of some graphs where you connect elements. With Grafana, there is only one name that can be included for ten hours. In Grafana, only one big field with one text name is shown in the documentation. When it comes to production, it is only for a period of time that some fields on each side give the information they have. I researched on Google and other forums and saw that people face problems when they can't connect a field with the same value. The aforementioned area in the tool needs improvement.
For how long have I used the solution?
I have been using Grafana for nine months. I am a customer of the product.
What do I think about the stability of the solution?
I have always had a stable version of the product. In general, I have not had any issues with the product.
What do I think about the scalability of the solution?
The technical personnel in my company use the tool, and it is possible to add or delete devices and other statuses in the solution. Around five people in my company use the product.
How are customer service and support?
I have always used the online forums or the internet to find information related to the product. I have not contacted the tool's technical support.
How was the initial setup?
The product's initial setup phase was very easy.
The solution is deployed on an on-premises model.
What's my experience with pricing, setup cost, and licensing?
For me, Grafana is a cheap tool because I don't have to spend much time learning the product since it is a simple solution. I don't know if there are different paid versions of the product.
Which other solutions did I evaluate?
My company chose Grafana over other products in the market considering its integration capabilities with Zabbix, good documentation, and the many topics about the tool that are available on the internet.
What other advice do I have?
I would suggest Grafana to those who use Zabbix because it helps users get a lot of information on networks and resolve a lot of problems.
I could do everything in Grafana with the help of graphs and documentation and get information on different types of devices.
I rate the overall tool an eight to nine out of ten.