Datadog Pro (Pay-As-You-Go) for GovCloud

Datadog is a SaaS-based unified observability and security platform providing full visibility into the health and performance of each layer of your environment at a glance.

4.3

Request private offer

Request demo

Overview

Try agent mode

Create proposal

Ask question

Datadog is a SaaS-based unified observability and security platform providing full visibility into the health and performance of each layer of your environment at a glance. Datadog allows you to customize this insight to your stack by collecting and correlating data from more than 600 vendor-backed technologies and APM libraries, all in a single pane of glass. Monitor your underlying infrastructure, supporting services, applications alongside security data in a single observability platform.

Prices are based on committed use per month over total term of the agreement (the Total Expected Use).

Highlights

Get started in minutes from AWS Marketplace with our enhanced integration for account creation and setup. Turn-key integrations and easy-to-install agent to start monitoring all of your servers and resources in minutes.
Quickly deploy modern monitoring and security in one powerful observability platform.
Create actionable context to speed up, reduce costs, mitigate security threats and avoid downtime at any scale.

Details

Sold by

Datadog

Introducing multi-product solutions

You can now purchase comprehensive solutions tailored to use cases and industries.

Learn more

Explore multi-product solutions

Features and programs

Trust Center

Access real-time vendor security and compliance information through their Trust Center powered by Drata or Vanta. Review certifications and security standards before purchase.

View Trust Center

Buyer guide

Gain valuable insights from real users who purchased this product, powered by PeerSpot.

Get the buyer guide

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Quick Launch

Leverage AWS CloudFormation templates to reduce the time and resources required to configure, deploy, and launch your software.

Learn more

Pricing

Custom pricing options

Request private offer

Pricing is based on your specific requirements and eligibility. Request a private offer to receive a custom quote. Sign in to view any offers that have been extended to you.

How can we make this page better?

Tell us how we can improve this page, or report an issue with this product.

Legal

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Request demo

Delivery details

Software as a Service (SaaS)

SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

Resources

Vendor resources

Product Features

API Documentation

Datadog AWS Integration IAM Policy

Support

Vendor support

Contact our knowledgable Support Engineers via email, live chat, or in-app messages

Get support

AWS infrastructure support

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Get support

Product comparison

Info

Updated weekly

Datadog Pro (Pay-As-You-Go) for GovCloud

By Datadog

Infrastructure Monitoring and Observability Platform

By LogicMonitor

The Observability Cloud

By Observe, Inc.

Accolades

Info

Top

100

In Log Analysis

Top

In Monitoring and Observability, Migration

Top

In Application Performance and UX Monitoring

Customer reviews

Info

Sentiment is AI generated from actual customer reviews on AWS and G2

Reviews

Functionality

Ease of use

Customer service

Cost effectiveness

22 reviews

500 reviews

2 reviews

Insufficient data

Positive reviews

Mixed reviews

Negative reviews

Overview

Info

AI generated from product descriptions

Multi-Source Data Integration

Collects and correlates data from more than 600 vendor-backed technologies and APM libraries in a single platform

Unified Observability Dashboard

Provides full visibility into health and performance across all environment layers through a single pane of glass

Infrastructure and Application Monitoring

Monitors underlying infrastructure, supporting services, applications, and security data simultaneously within one observability platform

Rapid Deployment and Setup

Enables quick deployment through turn-key integrations and easy-to-install agent for monitoring servers and resources

Security and Performance Analytics

Delivers actionable insights to reduce costs, mitigate security threats, and prevent downtime across any scale

Automated Device Discovery and Configuration

Automatic recognition and configuration of 2,000+ technologies with preconfigured alert thresholds and best practices-based setup without manual intervention

Agentless Monitoring Architecture

Agentless Collector deployment enabling hybrid and multi-cloud visibility with reduced operational overhead and no requirement for continuous agent upgrades

Unified Multi-Environment Visibility

Single-pane-of-glass monitoring across on-premises, hybrid, and multi-cloud infrastructures including AWS services with panoramic performance visibility

Flexible Data Collection Mechanism

Capability to pull metrics from virtually any device or API with support for custom graphs, dashboards, and alerts for application status analysis and trend identification

Performance Forecasting and Customizable Dashboards

Built-in performance forecasting capabilities combined with rich, customizable dashboards and full reporting functionality for actionable infrastructure insights

Data Ingestion and Query Performance

Ingests petabytes of telemetry per day with capability to process hundreds of terabytes and execute tens of millions of queries daily without performance degradation

Knowledge Graph Architecture

Utilizes O11y Knowledge Graph to structure and correlate data across logs, metrics, and traces for fast search and correlation capabilities

Natural Language Processing for Incident Analysis

Enables troubleshooting of complex incidents using natural language queries through O11y AI for accelerated root cause analysis

Open Data Lake Foundation

Built on Snowflake data lake architecture providing open data storage without vendor lock-in

Multi-Signal Correlation

Correlates and correlates telemetry signals across logs, metrics, and traces with context-aware analysis for incident resolution

Contract

Info

Standard contract

Customer reviews

Leave a review

Ratings and reviews

Info

4.3

79 ratings

5 star

4 star

3 star

2 star

1 star

51%

48%

22 AWS reviews

57 external reviews

External reviews are from PeerSpot .

Kavya S

Centralized monitoring has improved cloud observability and reduces manual debugging efforts

Reviewed on Jun 01, 2026

Review from a verified AWS customer

What is our primary use case?

My main use case for Datadog is to monitor the logs and capture metrics like CPU metrics, memory, and traces across different services in a cloud-based monitoring system where I initially worked, specifically to debug failing systems and systems which are slow, mainly for monitoring my servers in AWS .

What is most valuable?

The best features of Datadog for me are the user-friendly real-time dashboard and its ability to easily integrate with AWS , Azure , Kubernetes , Kafka, and provide a centralized log management system, which gives me excellent visibility into the microservice architecture.

Datadog has impacted my organization by providing a centralized monitoring system so that each person can trace what is happening in the VM servers, and it has given us a centralized dashboard view.

Since adopting Datadog, it has reduced the manual effort by around seven to eight hours per week, making the process completely automated.

Datadog has improved the collaboration across the teams and cross-functional teams, making it very fast and allowing us to easily track what is wrong.

What needs improvement?

If I could change one thing about Datadog, it would be the pricing, as it has extraordinary functionality, but the pricing is somewhat expensive, and as we increase the number of servers and monitoring services, the cost increases. A more predictable and flexible pricing structure would be beneficial, along with additional customization options and reporting features.

For how long have I used the solution?

I have been familiar with Datadog for more than two years.

What do I think about the stability of the solution?

I have not yet faced any frustration with Datadog.

Which solution did I use previously and why did I switch?

Before I landed on Datadog, I used to review the CloudWatch logs in AWS, and we initially had the tool Checkmk for monitoring.

How was the initial setup?

When I first implemented Datadog, it took me around thirty to forty minutes for the basic setup because we had a very large application to monitor metrics. After the configuration, the data actually appeared within three to four minutes.

What about the implementation team?

We did not have any formal training on Datadog. Instead, we referred to Google documentation regarding what Datadog is, how to set it up, and what the use cases are, and based on that, we initially set up Datadog.

Which other solutions did I evaluate?

When evaluating options before choosing Datadog, I compared it with tools such as New Relic and Grafana Labs with Prometheus. The main reason I chose Datadog is that it is a single platform where I can see metrics, logs, traces, and alerts, and it easily integrates with Kubernetes and other services such as Kafka.

What other advice do I have?

Our workflow is both team-wide and individual, as we check the end-to-end observability and the monitoring of our end-to-end application, infrastructure, and cloud services individually as well as in a team.

When I open Datadog, the first thing I do is see the home dashboard, which will have the active alerts and the system health status, as well as listing out all the monitored resources, including the servers, virtual machines, Kubernetes pods, and nodes. I will also see the CPU usage and memory usage, including the disk utilization.

Datadog is used by the cloud infrastructure monitoring team and the application team within the company, and everyone uses it on the same level as I do.

I have not experienced any features during implementation of Datadog that I am not really using in practice.

As of now, for my use case, I am satisfied with what Datadog offers, and I do not wish for any specific features that it currently lacks.

My advice to someone considering Datadog who has a similar workflow to mine is to read the entire documentation and work on it. I would rate my overall experience with Datadog as an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Kallamuddin Ansari

Unified monitoring has improved incident response and now reduces root cause analysis time

Reviewed on May 04, 2026

Review provided by PeerSpot

What is our primary use case?

Datadog serves as my primary tool for infrastructure monitoring and log analysis in a cloud environment. From a network and security perspective, I use it to monitor server health, track network metrics like latencies and traffic patterns, and analyze logs for troubleshooting issues such as VPN instability and unexpected spikes. The ability to correlate metrics and logs in one place makes it much faster to identify the root cause instead of checking multiple tools.

One example where Datadog proved invaluable was during a sudden spike in application response time. We received alerts on increased latencies, and instead of checking multiple tools, I used Datadog's dashboard to quickly correlate metrics. I noticed that while the application CPU was normal, there was a spike in database response times. Using the logs and metrics together, I was able to confirm that the issue was coming from the database, not the application. This helped us quickly involve the right team and resolve the issue faster.

What is most valuable?

The best features of Datadog are the correlation capabilities and unified visibility. The most useful aspect is that I can see metrics, logs, and service-level data in one place. During troubleshooting, I do not have to switch tools; I can directly correlate spikes in latencies with log error patterns, which saves considerable time. Another feature I find very useful is the dashboards, which are flexible, and I can create views based on what I actually need to monitor daily instead of relying on default setups. The integration with cloud services makes onboarding very easy, and once integrated, most of the data starts flowing automatically without much manual effort.

Datadog has had a positive impact, mainly by improving how quickly we detect and understand issues. Earlier, when something went wrong, considerable time went into figuring out where the problem actually was. Now, with better visibility across services and logs, we can quickly narrow down the source, whether it is application, infrastructure, or dependency-related. It has also helped in reducing the back and forth between teams because we can validate issues with the data before escalating, which has made incident handling smoother and more efficient overall.

What needs improvement?

One area where Datadog can be improved is around alert quality. In the beginning, it tends to generate many alerts, and without proper tuning, many of them are not actionable. It would help if there were more built-in guidance or smarter defaults to reduce noise. Another improvement area is cost visibility and control. As log and metric ingestion increases, it has not always been straightforward to track which data is driving the cost. More granular and real-time cost insights would make it easier to manage. Additionally, while the dashboards are flexible, navigating and organizing them at scale can become slightly difficult. Better structuring or management options would help in larger environments.

For how long have I used the solution?

I have been using Datadog for nearly two years.

What do I think about the stability of the solution?

Datadog has been stable overall in my experience. We have not seen any major platform outages. Metrics collection and alerting have been consistent in day-to-day use. Most issues we have faced were related to configurations or alert tuning rather than the platform itself. The platform is stable with no major platform issues, only configuration-related challenges.

What do I think about the scalability of the solution?

Datadog scales well as environments grow in my experience. As we add more servers and services, onboarding is straightforward with agents and integrations. We have not faced any major performance issues from the platform side; it handles increased metrics and monitoring loads smoothly. The primary consideration is managing log volume carefully because as the scale increases, data ingestion and costs also go up. Datadog is scalable technically, but the ingestion costs need to be managed as the environment grows.

How are customer service and support?

We do not rely on Datadog support for day-to-day issues. Most of the time, we are able to resolve things using the dashboards, logs, and their documentation. We have only reached out in a few cases, mainly for configuration-related queries, and in those situations, support was helpful, though sometimes it required a few back and forth interactions to get to the exact solution. Overall, support is decent, but we mostly depend on self-troubleshooting.

Which solution did I use previously and why did I switch?

Before Datadog, we were mainly using native cloud monitoring like Azure Monitor , along with a few basic tools. The main issue was that monitoring was fragmented. Metrics, logs, and alerts were spread across different places, and so during an incident, we had to switch between multiple tools to understand what was happening. We moved to Datadog to have everything in one place. The ability to correlate metrics and logs in a single platform made troubleshooting much faster and more efficient.

How was the initial setup?

Setting up dashboards and integrations in Datadog is relatively straightforward in my experience, especially for standard cloud services. For integrations, once we connect our cloud account, most of the metrics start coming in automatically, so the initial setup is not very complex. The documentation also helps considerably during this phase. For dashboards, basic ones are easy to create using existing templates, but to make them truly useful, we have to spend time customizing them based on our actual use cases, like adding specific metrics and refining the layout. Overall, the initial setup is easy, but making it truly effective takes practical tuning.

What was our ROI?

We have seen a clear return on investment with Datadog, mainly in terms of time saved and faster incident handling. For example, earlier when an issue occurred, it would take around thirty-five to forty-five minutes just to identify the root cause because we had to check multiple tools. With Datadog, we are usually able to narrow it down within ten to fifteen minutes using the centralized dashboard and logs. We have also reduced repeated troubleshooting efforts because we can identify patterns and fix the root cause instead of dealing with the same issues repeatedly. It has not reduced headcount, but it has definitely improved team efficiency and allowed us to handle more incidents with the same team.

What's my experience with pricing, setup cost, and licensing?

My experience with pricing for Datadog has been mixed. The initial setup cost is relatively low since it is a SaaS model and does not require a heavy upfront investment. Getting started is quite quick with agent-based deployments. However, the ongoing cost is something that needs to be managed. Pricing is mainly based on data ingestion, such as logs, metrics, and traces, and it can increase quickly if everything is enabled by default. Licensing is flexible, but it requires continuous monitoring and optimization to keep costs under control.

What other advice do I have?

One additional point I can add is that with Datadog, I focused considerably on making alerts actionable and reducing noise. In the initial phases, we had too many alerts that were not very useful, so we spent time tuning thresholds, adding conditions, and correlating alerts with real impact. After that, alerts became much more meaningful and helpful in faster response. I also use it regularly for trend analysis, checking for recurring spikes or patterns over time, which helps in identifying potential issues before they become incidents.

The features of Datadog become truly useful when you start combining them, not just using them separately. For example, just looking at the metrics alone does not always give the full picture, but when you combine metrics with logs and service-level data, it becomes much easier to understand what is actually happening during an incident. Features like tagging help considerably in filtering data across environments and services, especially when the setup grows. Without proper tagging, it can get difficult to navigate. Overall, the strength of Datadog is not just the individual features, but how well they work together in real scenarios.

We have seen noticeable improvements after using Datadog, mainly in terms of time saved and faster incident handling. Earlier when an issue occurred, it could take around twenty to forty minutes just to understand where the problem was. Now, with the centralized visibility and correlation of metrics and logs, we are often able to narrow it down within fifteen to twenty-five minutes. We have also seen fewer repeated incidents because we can identify patterns and fix the root cause instead of just resolving symptoms. Incidents are getting resolved faster, and the time spent on troubleshooting has reduced significantly.

My advice for anyone considering Datadog is to be selective about what you monitor from day one. It is tempting to enable everything, but that usually leads to too much data and noisy alerts. Instead, start with critical services and key metrics, and then expand gradually. Invest time in tagging and structuring your data properly because it makes a considerable difference later when you need to filter, troubleshoot, or build dashboards. Finally, review your setup regularly because what works in the beginning may not stay relevant as the environment grows. Start small, avoid collecting all data, use proper tagging, and keep refining your setup over time. This review reflects an overall rating of eight.

SurajYadav

Centralized monitoring has reduced troubleshooting time and improves proactive incident response

Reviewed on May 03, 2026

Review provided by PeerSpot

What is our primary use case?

My main use case for Datadog is infrastructure and log monitoring in a cloud-based environment. From a network and security perspective, I mainly use it to monitor server health, track network-level metrics, and analyze logs for troubleshooting issues such as VPN instabilities, traffic spiking, or unexpected behavior.

One recent example where I used Datadog was during a VPN-related issue where users were reporting intermittent disconnections. I checked our Datadog dashboard and noticed spiking in network latencies and a sudden increase in connections dropped around the same time users reported the issues. I then correlated this with the logs and found that one of the back-end servers handling the connection was hitting high CPU utilization. Because everything was centralized, I did not have to jump between multiple tools. I was able to quickly identify the impacted servers and escalate it to the infrastructure team. Once the load was balanced, the issue got resolved.

With Datadog, I mainly focus on creating meaningful dashboards and tuning alerts properly. In the beginning, we saw a lot of alert noise, so we had to refine thresholds and conditions to make sure alerts are actually actionable. Once that was done, it became much more effective for proactive monitoring instead of just reactive troubleshooting.

What is most valuable?

One of the best features of Datadog, in my opinion, is its unified visibility across the metrics, logs, and traces in a single platform. The dashboards are very flexible and customizable, which helps a lot in creating meaningful monitoring views based on different use cases. I also find the log management quite useful because it allows quick correlation with metrics during troubleshooting. Another strong feature is its integration, especially with cloud platforms such as AWS or Azure , which makes onboarding and monitoring much easier without heavy manual work.

Integration with cloud platforms such as Amazon Web Services or Microsoft Azure has really made daily monitoring much easier. Once the integration is set up, Datadog automatically pulls metrics from services such as virtual machines, load balancers, and databases without needing manual configuration on each resource. In one case, I was monitoring a cloud-based application where we started seeing performance issues through Datadog's Azure integrations. I could quickly view metrics from the application server and the back-end database in the same dashboard. It helped me identify that the issue was not network-related but due to the increased load on the backend services. Instead of checking multiple portals, everything was available in one place, which saved time and made troubleshooting faster.

Datadog has had a positive impact mainly by improving visibility and reducing troubleshooting times. Earlier, we had to rely on multiple tools to check metrics and logs, which delayed root cause analysis. With Datadog, everything is centralized, so it is much faster to identify issues and take actions. It has also helped in proactive monitoring with properly tuned alerts. We are able to detect unusual behaviors such as spiking in traffic or resource usage before it turns into a major incident. Overall, it has improved operational efficiency and reduced downtime by enabling quicker responses during incidents.

What needs improvement?

If you are asking for improvements, I feel some small areas where Datadog can improve. One area is alert management. In a dynamic environment, it can generate a lot of alert noise if not tuned properly. More intelligent alerting or built-in recommendations would help. Another aspect is cost visibility. As log ingestion increases, pricing can scale quickly. Having more transparent and granular cost control features would make it easier to manage usage. Also, the initial setup and configuration can feel a bit complex for new users.

For how long have I used the solution?

I have been using Datadog for ten months.

What do I think about the stability of the solution?

In my experience, it has been quite stable; we have not faced any major outages or reliability issues from the platform side. Data collection and dashboards have been consistent, and alerts are delivered on time as long as they are properly configured. Most of the issues we have seen were related to configuration or alert tuning rather than the platform itself.

What do I think about the scalability of the solution?

It has scaled well for our needs. As we added more servers and services, Datadog was able to handle the increased load without any major issues. Since it is a SaaS platform, we did not have to worry about backend scaling. New hosts and services get onboarded easily with the agents, and metric collection continues smoothly even as the environment grows. The only thing we monitor closely is log volume because as scale increases, ingestion and costs also go up, but from a performance and handling perspective, it has been quite good.

How are customer service and support?

In my experience, the customer support from Datadog has been quite reliable. For standard issues and queries, the response time is generally good, and the documentation is also very helpful for resolving common problems. For more complex cases, support may take some time for investigations, but they usually provide proper guidance and follow-up. Overall, I would say support is responsive and helpful, especially when combined with their strong documentation.

Which solution did I use previously and why did I switch?

This is the first time I am using Datadog. Before that, there was not any solution in place.

How was the initial setup?

The initial setup cost is relatively low since it is a SaaS model and getting started is straightforward with agent-based deployments. However, the main challenge is the ongoing cost, which depends on data ingestion such as logs, metrics, and traces. As usage grows, especially with log collection, the costs can increase quickly, which requires proper planning around what data to collect, retention policies, and filtering to keep control. Overall, I think it is flexible, but cost optimization needs continuous monitoring.

What was our ROI?

We have seen a return on investment with Datadog, mainly in terms of saving operational efficiency. For example, earlier our troubleshooting process involved checking multiple tools, which used to take around forty to forty-five minutes just to identify the root cause. With Datadog, since metrics and logs are centralized, we are usually able to reduce the time to around ten to twenty minutes in many cases. This has improved our response time and reduced the duration of incidents. While it may not directly reduce headcount, it definitely improves team productivity and helps handle more issues efficiently with the same team.

While we do not track exact numbers in all cases, with Datadog we have definitely seen a noticeable improvement in incident response time. For example, earlier it could take around thirty to forty-five minutes to identify the root cause analysis because we had to check multiple tools. With Datadog's centralized dashboards and logs, we are usually able to narrow it down within ten to fifteen minutes in most cases. We have also seen fewer escalations for minor issues because alerts help us catch problems earlier, which indirectly reduces downtime and improves overall efficiency.

Which other solutions did I evaluate?

We did consider a few alternatives, but they each have their own standards. We considered solutions such as Splunk, New Relic , and Prometheus. Everything is more costly, but I prefer Datadog. I have just heard about Datadog and other monitoring tools from some colleagues. As per their comparisons, I feel Datadog is much better.

What other advice do I have?

If anyone is looking to use Datadog, I would advise planning your monitoring strategy from the beginning. Focus on what metrics and logs are actually important because collecting everything can increase noise and costs. It is also important to spend some time on proper alert tuning; otherwise, you may end up with too many non-actionable alerts. I would also recommend starting with key integrations, especially with cloud platforms, and then gradually expanding use instead of enabling everything at once. I would rate this product an eight out of ten.

Abednego Petrus

Unified monitoring has improved incident detection and reduced resolution time across our stack

Reviewed on Jan 22, 2026

Review from a verified AWS customer

What is our primary use case?

Datadog 's main use case is end-to-end monitoring that helps check problems across infrastructure, application, database, security, and logs.

For example, when checking a problem with a mobile application such as an error from a user hitting a transaction, we check from the client-side mobile device and also from the back end for the API to see if there is latency or an error that triggers the problem. There may be an issue on the database, such as a locking query or high latency on query performance. For infrastructure, if the application is slow, it may be impacted on infrastructure monitoring by CPU and memory consumption.

Datadog is a powerful observability tool that allows us to correlate and see problems on the infrastructure or application side. In an incident war room, we can see the correlation and the detailed root cause of the problem across real user monitoring, application, database, and infrastructure.

How has it helped my organization?

Datadog has positively impacted our organization because our customers are very happy using it. With silo monitoring, where infrastructure has separate monitoring, application has another, and cloud has another, it becomes tricky and complex. We cannot correlate the silo monitoring, and pricing is complicated. With Datadog, we can centralize and use one observability tool for monitoring all components across all features or modules, unifying the monitoring process.

Regarding specific outcomes, I observe that tools with Datadog's capabilities enable us to quickly achieve mean time to detect problems. We can specifically check the root cause analysis of issues from the infrastructure, application, database, or security sides. Mean time to resolve is improved with Datadog since it provides many suggestions and actions to resolve problems, which heavily impacts the business for our application customers when issues arise.

What is most valuable?

Datadog's best feature is real user monitoring.

I prefer Datadog's real user monitoring most because of its analytics capabilities. First, Datadog is recognized in the Gartner Digital Experience for real user monitoring. Second, the analytics capability is very powerful, enabling us to check the experience of customers first. We can also correlate with the back-end side of the performance for real user monitoring and application monitoring. Finally, the capability of metrics within real user monitoring provides many helpful insights for mobile developers to improve their mobile application performance.

What needs improvement?

Datadog could improve its pricing because it is very tricky, and most of our customers notice many hidden costs. Additionally, if possible, Datadog should offer deployment options not only for SaaS but also for on-premises solutions, which would benefit banking transactions.

Regarding pricing, it remains tricky with many hidden costs. For technological enhancement, there could be an on-premises option alongside the SaaS version. I also find setting up and configuring Datadog to be very complex.

For how long have I used the solution?

I have been using Datadog for two years.

What do I think about the stability of the solution?

Datadog is very stable, and the features are quickly updated because the research and development process moves swiftly, making it reliable for fixes and updates.

What do I think about the scalability of the solution?

Datadog's scalability is very strong due to its cloud-native distributed architecture, massive data capability, extensive integration ecosystem, seamless expansion, and real-world scalability evidence.

How are customer service and support?

Customer support is very good because there is extensive support from Datadog, including live chat, ticketing, and a very high SLA of 99.98%.

Which solution did I use previously and why did I switch?

I was using Instana and Dynatrace as different solutions before Datadog.

What was our ROI?

I have seen a return on investment because Datadog helps save money and reduces the need for fewer employees while also saving time, which is very beneficial.

What's my experience with pricing, setup cost, and licensing?

My experience with pricing, setup costs, and licensing is that it is very tricky due to many hidden costs, so we need to check repeatedly for allotments and commitments regarding what we receive from the license.

Which other solutions did I evaluate?

I evaluated other options before choosing Datadog, specifically Dynatrace .

What other advice do I have?

My advice for others looking into using Datadog is to initially simplify the technical setup and configuration. Secondly, regarding pricing mechanisms, it would be wise to commit to clear allotments to avoid hidden costs for customers, as it significantly impacts pricing.

I believe Datadog is the largest single observability platform, with correlation as a differentiation factor, enterprise readiness as a measure, and cost management now being a key topic with a very clear roadmap and direction. I would rate this product nine out of ten.

BasilJiji

Unified observability has improved incident response and now reduces downtime across environments

Reviewed on Dec 29, 2025

Review from a verified AWS customer

What is our primary use case?

My main use case for Datadog is unified observability, as I use it to correlate metrics, traces, and logs in a single pane of glass to ensure the health and security of our cloud infrastructure and application.

I correlate those metrics, traces, and logs using the Service Map to visualize dependencies between our microservices, and for example, during a latency spike, I can instantly see if there is a bottleneck in a specific database query or a downstream API, which allows me to route the issues to the right team immediately.

What is most valuable?

Datadog is an incredibly powerful daily driver for any engineer, and the recent addition of LLM observability for AI apps and Cloud Security Management makes it feel like a platform that is truly keeping up with modern tech trends. The dashboarding and alert integrations are great features offered by Datadog, giving us all the required information on a single screen, and the alert integration performs its job in a very good manner.

Datadog has positively impacted our organization, as it has eliminated many negative issues, which I call tool sprawl, by replacing four or five separate monitoring tools with one unified platform. This has improved our MTTR and broken down silos between Dev and Ops teams.

Since Datadog has been introduced, the response time when seeing an alert has increased, so alerts have been taken care of within less time and routed to the other teams who have been taking the required actions. This has given us a very positive approach towards the entire working culture.

What needs improvement?

Datadog is a platform that can be improved by making its pricing more predictable, as sometimes it is difficult to forecast exactly how much a new project will cost until after we have started ingesting the data.

When it comes to the documentation, we do not have much available right now, so if Datadog can improve the documentation part, it would really help the engineers to work on this.

Datadog is the most comprehensive observability tool on the market, and it only loses two points because the pricing for log ingestion can grow quickly if we do not carefully manage our filters.

For how long have I used the solution?

I have been using Datadog for about three years to monitor our cloud-native application and infrastructure across multiple environments.

What do I think about the stability of the solution?

Datadog is extremely stable, as it is built for high scalable environments and consistently maintains high availability, which is why I trust it as our primary monitoring tool.

What do I think about the scalability of the solution?

Datadog is built for hyperscale, as it automatically scales when we add new hosts or containers, and its Monitoring as Code approach via Terraform allows us to scale our monitoring setup instantly as our infrastructure grows.

How are customer service and support?

Their technical documentation is some of the best in the industry, and their support engineers are very proactive, helping us optimize the ingestion cost.

Which solution did I use previously and why did I switch?

I previously used a mix of open-source tools like Prometheus and Grafana , and I switched because manual upkeep was too high and I needed a platform that could handle logs and traces alongside metrics without having to manage the backend storage.

How was the initial setup?

Buying Datadog through the AWS Marketplace was seamless and helped me meet AWS spending commitments, and while Datadog's custom metric pricing can be complex, the setup cost is very low because the agent is easy to deploy.

What was our ROI?

I have seen a strong ROI through a thirty percent reduction in downtime and significant cost savings by identifying under-utilized cloud resources, for example, the ideal EC2 instances through their cloud cost management.

Which other solutions did I evaluate?

I evaluated New Relic , Dynatrace , and Amazon CloudWatch before choosing Datadog, and I chose Datadog because of its massive library of over seven hundred integrations and its superior user interface, which is easier for our developers to use daily.

What other advice do I have?

My biggest advice is to set up ingestion rules and filters early, as you should not send all your logs and metrics at once, and being selective about what you need to store can maximize your ROI from day one. I would rate this review as an eight.

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

View all reviews