
Overview
Datadog is a SaaS-based unified observability and security platform providing full visibility into the health and performance of each layer of your environment at a glance. Datadog allows you to customize this insight to your stack by collecting and correlating data from more than 600 vendor-backed technologies and APM libraries, all in a single pane of glass. Monitor your underlying infrastructure, supporting services, applications alongside security data in a single observability platform.
Prices are based on committed use per month over total term of the agreement (the Total Expected Use).
Highlights
- Get started in minutes from AWS Marketplace with our enhanced integration for account creation and setup. Turn-key integrations and easy-to-install agent to start monitoring all of your servers and resources in minutes.
- Quickly deploy modern monitoring and security in one powerful observability platform.
- Create actionable context to speed up, reduce costs, mitigate security threats and avoid downtime at any scale.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Trust Center
Buyer guide

Financing for AWS Marketplace purchases
Quick Launch
Pricing
Custom pricing options
How can we make this page better?
Legal
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
Contact our knowledgable Support Engineers via email, live chat, or in-app messages
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.


Standard contract
Customer reviews
Centralized monitoring has reduced troubleshooting time and improves proactive incident response
What is our primary use case?
My main use case for Datadog is infrastructure and log monitoring in a cloud-based environment. From a network and security perspective, I mainly use it to monitor server health, track network-level metrics, and analyze logs for troubleshooting issues such as VPN instabilities, traffic spiking, or unexpected behavior.
One recent example where I used Datadog was during a VPN-related issue where users were reporting intermittent disconnections. I checked our Datadog dashboard and noticed spiking in network latencies and a sudden increase in connections dropped around the same time users reported the issues. I then correlated this with the logs and found that one of the back-end servers handling the connection was hitting high CPU utilization. Because everything was centralized, I did not have to jump between multiple tools. I was able to quickly identify the impacted servers and escalate it to the infrastructure team. Once the load was balanced, the issue got resolved.
With Datadog, I mainly focus on creating meaningful dashboards and tuning alerts properly. In the beginning, we saw a lot of alert noise, so we had to refine thresholds and conditions to make sure alerts are actually actionable. Once that was done, it became much more effective for proactive monitoring instead of just reactive troubleshooting.
What is most valuable?
One of the best features of Datadog, in my opinion, is its unified visibility across the metrics, logs, and traces in a single platform. The dashboards are very flexible and customizable, which helps a lot in creating meaningful monitoring views based on different use cases. I also find the log management quite useful because it allows quick correlation with metrics during troubleshooting. Another strong feature is its integration, especially with cloud platforms such as AWS or Azure , which makes onboarding and monitoring much easier without heavy manual work.
Integration with cloud platforms such as Amazon Web Services or Microsoft Azure has really made daily monitoring much easier. Once the integration is set up, Datadog automatically pulls metrics from services such as virtual machines, load balancers, and databases without needing manual configuration on each resource. In one case, I was monitoring a cloud-based application where we started seeing performance issues through Datadog's Azure integrations. I could quickly view metrics from the application server and the back-end database in the same dashboard. It helped me identify that the issue was not network-related but due to the increased load on the backend services. Instead of checking multiple portals, everything was available in one place, which saved time and made troubleshooting faster.
Datadog has had a positive impact mainly by improving visibility and reducing troubleshooting times. Earlier, we had to rely on multiple tools to check metrics and logs, which delayed root cause analysis. With Datadog, everything is centralized, so it is much faster to identify issues and take actions. It has also helped in proactive monitoring with properly tuned alerts. We are able to detect unusual behaviors such as spiking in traffic or resource usage before it turns into a major incident. Overall, it has improved operational efficiency and reduced downtime by enabling quicker responses during incidents.
What needs improvement?
If you are asking for improvements, I feel some small areas where Datadog can improve. One area is alert management. In a dynamic environment, it can generate a lot of alert noise if not tuned properly. More intelligent alerting or built-in recommendations would help. Another aspect is cost visibility. As log ingestion increases, pricing can scale quickly. Having more transparent and granular cost control features would make it easier to manage usage. Also, the initial setup and configuration can feel a bit complex for new users.
For how long have I used the solution?
I have been using Datadog for ten months.
What do I think about the stability of the solution?
In my experience, it has been quite stable; we have not faced any major outages or reliability issues from the platform side. Data collection and dashboards have been consistent, and alerts are delivered on time as long as they are properly configured. Most of the issues we have seen were related to configuration or alert tuning rather than the platform itself.
What do I think about the scalability of the solution?
It has scaled well for our needs. As we added more servers and services, Datadog was able to handle the increased load without any major issues. Since it is a SaaS platform, we did not have to worry about backend scaling. New hosts and services get onboarded easily with the agents, and metric collection continues smoothly even as the environment grows. The only thing we monitor closely is log volume because as scale increases, ingestion and costs also go up, but from a performance and handling perspective, it has been quite good.
How are customer service and support?
In my experience, the customer support from Datadog has been quite reliable. For standard issues and queries, the response time is generally good, and the documentation is also very helpful for resolving common problems. For more complex cases, support may take some time for investigations, but they usually provide proper guidance and follow-up. Overall, I would say support is responsive and helpful, especially when combined with their strong documentation.
Which solution did I use previously and why did I switch?
This is the first time I am using Datadog. Before that, there was not any solution in place.
How was the initial setup?
The initial setup cost is relatively low since it is a SaaS model and getting started is straightforward with agent-based deployments. However, the main challenge is the ongoing cost, which depends on data ingestion such as logs, metrics, and traces. As usage grows, especially with log collection, the costs can increase quickly, which requires proper planning around what data to collect, retention policies, and filtering to keep control. Overall, I think it is flexible, but cost optimization needs continuous monitoring.
What was our ROI?
We have seen a return on investment with Datadog, mainly in terms of saving operational efficiency. For example, earlier our troubleshooting process involved checking multiple tools, which used to take around forty to forty-five minutes just to identify the root cause. With Datadog, since metrics and logs are centralized, we are usually able to reduce the time to around ten to twenty minutes in many cases. This has improved our response time and reduced the duration of incidents. While it may not directly reduce headcount, it definitely improves team productivity and helps handle more issues efficiently with the same team.
While we do not track exact numbers in all cases, with Datadog we have definitely seen a noticeable improvement in incident response time. For example, earlier it could take around thirty to forty-five minutes to identify the root cause analysis because we had to check multiple tools. With Datadog's centralized dashboards and logs, we are usually able to narrow it down within ten to fifteen minutes in most cases. We have also seen fewer escalations for minor issues because alerts help us catch problems earlier, which indirectly reduces downtime and improves overall efficiency.
Which other solutions did I evaluate?
We did consider a few alternatives, but they each have their own standards. We considered solutions such as Splunk, New Relic , and Prometheus. Everything is more costly, but I prefer Datadog. I have just heard about Datadog and other monitoring tools from some colleagues. As per their comparisons, I feel Datadog is much better.
What other advice do I have?
If anyone is looking to use Datadog, I would advise planning your monitoring strategy from the beginning. Focus on what metrics and logs are actually important because collecting everything can increase noise and costs. It is also important to spend some time on proper alert tuning; otherwise, you may end up with too many non-actionable alerts. I would also recommend starting with key integrations, especially with cloud platforms, and then gradually expanding use instead of enabling everything at once. I would rate this product an eight out of ten.
Unified monitoring has improved incident detection and reduced resolution time across our stack
What is our primary use case?
Datadog 's main use case is end-to-end monitoring that helps check problems across infrastructure, application, database, security, and logs.
For example, when checking a problem with a mobile application such as an error from a user hitting a transaction, we check from the client-side mobile device and also from the back end for the API to see if there is latency or an error that triggers the problem. There may be an issue on the database, such as a locking query or high latency on query performance. For infrastructure, if the application is slow, it may be impacted on infrastructure monitoring by CPU and memory consumption.
Datadog is a powerful observability tool that allows us to correlate and see problems on the infrastructure or application side. In an incident war room, we can see the correlation and the detailed root cause of the problem across real user monitoring, application, database, and infrastructure.
How has it helped my organization?
Datadog has positively impacted our organization because our customers are very happy using it. With silo monitoring, where infrastructure has separate monitoring, application has another, and cloud has another, it becomes tricky and complex. We cannot correlate the silo monitoring, and pricing is complicated. With Datadog, we can centralize and use one observability tool for monitoring all components across all features or modules, unifying the monitoring process.
Regarding specific outcomes, I observe that tools with Datadog's capabilities enable us to quickly achieve mean time to detect problems. We can specifically check the root cause analysis of issues from the infrastructure, application, database, or security sides. Mean time to resolve is improved with Datadog since it provides many suggestions and actions to resolve problems, which heavily impacts the business for our application customers when issues arise.
What is most valuable?
Datadog's best feature is real user monitoring.
I prefer Datadog's real user monitoring most because of its analytics capabilities. First, Datadog is recognized in the Gartner Digital Experience for real user monitoring. Second, the analytics capability is very powerful, enabling us to check the experience of customers first. We can also correlate with the back-end side of the performance for real user monitoring and application monitoring. Finally, the capability of metrics within real user monitoring provides many helpful insights for mobile developers to improve their mobile application performance.
What needs improvement?
Datadog could improve its pricing because it is very tricky, and most of our customers notice many hidden costs. Additionally, if possible, Datadog should offer deployment options not only for SaaS but also for on-premises solutions, which would benefit banking transactions.
Regarding pricing, it remains tricky with many hidden costs. For technological enhancement, there could be an on-premises option alongside the SaaS version. I also find setting up and configuring Datadog to be very complex.
For how long have I used the solution?
What do I think about the stability of the solution?
What do I think about the scalability of the solution?
How are customer service and support?
Which solution did I use previously and why did I switch?
What was our ROI?
What's my experience with pricing, setup cost, and licensing?
My experience with pricing, setup costs, and licensing is that it is very tricky due to many hidden costs, so we need to check repeatedly for allotments and commitments regarding what we receive from the license.
Which other solutions did I evaluate?
What other advice do I have?
My advice for others looking into using Datadog is to initially simplify the technical setup and configuration. Secondly, regarding pricing mechanisms, it would be wise to commit to clear allotments to avoid hidden costs for customers, as it significantly impacts pricing.
I believe Datadog is the largest single observability platform, with correlation as a differentiation factor, enterprise readiness as a measure, and cost management now being a key topic with a very clear roadmap and direction. I would rate this product nine out of ten.
Unified observability has improved incident response and now reduces downtime across environments
What is our primary use case?
My main use case for Datadog is unified observability, as I use it to correlate metrics, traces, and logs in a single pane of glass to ensure the health and security of our cloud infrastructure and application.
I correlate those metrics, traces, and logs using the Service Map to visualize dependencies between our microservices, and for example, during a latency spike, I can instantly see if there is a bottleneck in a specific database query or a downstream API, which allows me to route the issues to the right team immediately.
What is most valuable?
Datadog is an incredibly powerful daily driver for any engineer, and the recent addition of LLM observability for AI apps and Cloud Security Management makes it feel like a platform that is truly keeping up with modern tech trends. The dashboarding and alert integrations are great features offered by Datadog, giving us all the required information on a single screen, and the alert integration performs its job in a very good manner.
Datadog has positively impacted our organization, as it has eliminated many negative issues, which I call tool sprawl, by replacing four or five separate monitoring tools with one unified platform. This has improved our MTTR and broken down silos between Dev and Ops teams.
Since Datadog has been introduced, the response time when seeing an alert has increased, so alerts have been taken care of within less time and routed to the other teams who have been taking the required actions. This has given us a very positive approach towards the entire working culture.
What needs improvement?
Datadog is a platform that can be improved by making its pricing more predictable, as sometimes it is difficult to forecast exactly how much a new project will cost until after we have started ingesting the data.
When it comes to the documentation, we do not have much available right now, so if Datadog can improve the documentation part, it would really help the engineers to work on this.
Datadog is the most comprehensive observability tool on the market, and it only loses two points because the pricing for log ingestion can grow quickly if we do not carefully manage our filters.
For how long have I used the solution?
I have been using Datadog for about three years to monitor our cloud-native application and infrastructure across multiple environments.
What do I think about the stability of the solution?
Datadog is extremely stable, as it is built for high scalable environments and consistently maintains high availability, which is why I trust it as our primary monitoring tool.
What do I think about the scalability of the solution?
Datadog is built for hyperscale, as it automatically scales when we add new hosts or containers, and its Monitoring as Code approach via Terraform allows us to scale our monitoring setup instantly as our infrastructure grows.
How are customer service and support?
Their technical documentation is some of the best in the industry, and their support engineers are very proactive, helping us optimize the ingestion cost.
Which solution did I use previously and why did I switch?
I previously used a mix of open-source tools like Prometheus and Grafana , and I switched because manual upkeep was too high and I needed a platform that could handle logs and traces alongside metrics without having to manage the backend storage.
How was the initial setup?
Buying Datadog through the AWS Marketplace was seamless and helped me meet AWS spending commitments, and while Datadog's custom metric pricing can be complex, the setup cost is very low because the agent is easy to deploy.
What was our ROI?
I have seen a strong ROI through a thirty percent reduction in downtime and significant cost savings by identifying under-utilized cloud resources, for example, the ideal EC2 instances through their cloud cost management.
Which other solutions did I evaluate?
I evaluated New Relic , Dynatrace , and Amazon CloudWatch before choosing Datadog, and I chose Datadog because of its massive library of over seven hundred integrations and its superior user interface, which is easier for our developers to use daily.
What other advice do I have?
My biggest advice is to set up ingestion rules and filters early, as you should not send all your logs and metrics at once, and being selective about what you need to store can maximize your ROI from day one. I would rate this review as an eight.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Monitoring has improved digital experiences and speeds root cause analysis for incident tickets
What is our primary use case?
I intend to use Datadog for application performance monitoring, digital user experiences, and troubleshooting to find the root cause analysis of tickets that will be generated in my managed environment. Digital user experience happens to be the priority for me, as I am evaluating this feature across some competing products.
What is most valuable?
The best features Datadog offers are digital user experience, troubleshooting, and remediation capabilities, which help identify what is going wrong and where. I focused on the root cause analysis of incidents and tickets, as examining the RCAs makes it easier to find remediations and helps with shifting incidents left. Datadog will positively impact my organization by allowing me to handle ticket resolutions at a much faster pace and bring productivity by reducing the number of support engineers required at the monitoring level. If I integrate Datadog with my managed environment or cloud environment, the RCAs and all the left shift will be automated, and with automation, I will be able to reduce the number of support engineers.
What needs improvement?
Datadog could be improved with a simpler graphical user interface that can be extended to non-technical users, such as a CXO, if they want to review the dashboard overall for current tickets and the ticketing dashboard. It would be beneficial to have documentation auto-generated while examining remediations or integration with existing systems.
For how long have I used the solution?
I have been working for more than fifteen years in data center, disaster recovery solutions, and cloud computing, which includes private, public, and hybrid environments.
What do I think about the stability of the solution?
Datadog seems to be more stable, and I really want to have a complete demo before making a call to decide on this.
What do I think about the scalability of the solution?
I hope that Datadog will be able to extend to digital users, even if they are on a scale of thousands for an organization and connect to corporate bandwidth, and the server should be pretty much scalable on the server side.
How are customer service and support?
I find the customer support impressive from what I have heard about Datadog, and I really want to onboard this solution for my customers.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
As of now, we are using cloud-native monitoring with CloudWatch and Azure Monitor for our multi-cloud environment, and we really want to extend it to greater detail that will cover deliberations at greater depth. We have looked at ManageEngine and SolarWinds before choosing Datadog, but they were not very impressive, as the amount of Datadog functionality is not available in these two platforms.
How was the initial setup?
I am looking to deploy Datadog on AWS and Azure for multi-cloud management support and really want to extend it at the server side and at the end-user side for digital user experience. I will start with AWS and extend it to Azure six months down the line. I plan to purchase Datadog through the AWS Marketplace once I have the demo.
What was our ROI?
I am looking at metrics that will help me decide whether I need to really deploy Datadog, and the metrics will primarily be centered around reducing the number of employees and cost optimization.
What's my experience with pricing, setup cost, and licensing?
I did not get the complete information regarding the licenses and commercials associated with Datadog, and I would like to have some idea about the license.
What other advice do I have?
I hope to have some literature on how I can leverage my managed support for cloud environments, plus how I can integrate this with my managed support at the end-user devices. Finding the root cause analysis at greater depth, reducing the number of employees to manage or monitor infrastructure incidents, and increasing satisfaction on the application performance monitoring part are the advice I would give to others looking into using Datadog. I give this review a rating of eight.
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Unified monitoring has streamlined global reporting and standardized alerts across teams
What is our primary use case?
My main use case for Datadog is that we offer the application performance management service within PwC as a global team.
A specific example of how my team uses Datadog for performance management is that my team does not directly use Datadog for performance management; however, we work with approximately 300 teams that use it daily for monitoring their apps. One of the most used cases is to observe when services are up and down and if services are not degraded.
We use most of every product within Datadog across the 300 customers that we have internally.
How has it helped my organization?
Datadog has positively impacted my organization because before Datadog, we had multiple APM tools and monitoring tools, which fragmented the service. The reason was that some tools offered benefits to certain teams, while other tools offered different benefits to other teams. With Datadog, we managed to get everyone on board into a single place and a single tool, providing teams with one spot where they can check everything related to monitoring, and enabling management and leadership to have an overview of all tools working together.
I measured the impact of bringing everything into one place through observation, and I can confirm that efficiency in reporting improved dramatically and it became much easier to observe changes. Standardization was a tremendous win for us. Having a set of standard alerts and monitoring in place allowed us to speed up onboarding for every app. Once the resources are in Datadog, the system provides alerting out of the box. Additionally, cost has decreased dramatically.
What is most valuable?
Datadog's best features include very high demand for logs management for alerting on indexed logs and a shift towards Flex Logs for storage and long-term storage. Most recently, BitsAI and the LLM part within Datadog has been in focus for us.
Flex Logs has helped my teams because we are migrating from other services to have a unique place to store all the logs, the non-security logs, and the app logs. This has benefited those teams because they also benefit from other services within Datadog such as APM or other monitoring solutions. By bringing the logs into Datadog, they now have a single place where they can correlate everything.
The LLM integration within Datadog has helped my teams because LLM usage is at the beginning stage right now, and people are very excited. We have all these AI and LLM-based tools, and having the option of monitoring them is a great benefit for us. However, we are in the exploratory phase of this process and have just begun.
BitsAI is very interesting; we have done some testing and we are going to promote it and use it in our production environment. This is a very exciting new tool for us.
What needs improvement?
Datadog can be improved because sometimes it seems it has not been developed for enterprises. We work with over 300 customers, with each customer having multiple instances or apps within Datadog. We are facing difficulties in controlling access, in privacy settings, and splitting usage and costs for these customers.
We want to be able to customize the cost part, and we would appreciate more granular access control.
For how long have I used the solution?
I have been using Datadog for four or five years.
What do I think about the stability of the solution?
Datadog is stable.
What do I think about the scalability of the solution?
We have never had an issue with Datadog's scalability.
How are customer service and support?
Datadog's customer support is good; it could be improved in terms of communication, but it is adequate.
Which solution did I use previously and why did I switch?
How was the initial setup?
My experience with pricing, setup cost, and licensing is good; nothing out of the ordinary.
Which other solutions did I evaluate?
Before choosing Datadog, the biggest contender we evaluated was AppDynamics.
What other advice do I have?
My advice for others looking into using Datadog is to test it out and see if it works for you. Try to become accustomed to the tagging part of things, and go through each product to understand what each product within Datadog is offering. I would rate this product an eight out of ten.