Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Datadog Pro

Datadog | 1

Reviews from AWS customer

9 AWS reviews

External reviews

678 reviews
from and

External reviews are not included in the AWS star rating for the product.


4-star reviews ( Show all reviews )

    reviewer2561139

Consistent, centralized service for varied cloud-based applications

  • September 19, 2024
  • Review provided by PeerSpot

What is our primary use case?

The current use case for Datadog in our environment is observability.  We use Datadog as the primary log ingestion and analysis point, along with consolidation of application/infrastructure metrics across cloud environments and realtime alerting to issues that arise in production.  

Datadog integrates within all aspects of our infrastructure and applications to provide valuable insights into Containers, Serverless functions, Deep Logging Analysis, Virtualized Hardware and Cost Optimizations.

How has it helped my organization?

Datadog improved our observability layer by creating a consistent, centralized service for all of our varied cloud-based applications. All of our production and non-production environment applications and infrastructure send metrics directly to Datadog for analysis and determination of any issues that would need to be looked at by the Infrastructure, Platform and Development teams for quick remediation. Using Datadog as this centralized Observability platform has enabled us to become leaner without sacrificing project timelines when issues arise and require triage for efficient resolution.

What is most valuable?

All of Datadog's features have become valuable tools in our cloud environments.

Our primary alerts, based on metrics and synthetic transactions, are the most used and relied upon for decreased MTTA/MTTR across all of our platforms. This is followed by deep log analysis that enables us to quickly and easily get to a preliminary root cause that someone on the infrastructure, platform or development teams can take and focus their attention on the precise target that Datadog revealed as the issue to be remediated.

What needs improvement?

The two areas I could see needing improvement or a feature to add value are building a more robust SIM that would include container scanning to rival other such products on the market so we do not need to extend functionality to another third-party provider. The other expands the alerting functions by creating a new feature to add direct SMS notifications, on-call rotation scheduling, etc., that could replace the need to have this as an external third party solution integration. 

For how long have I used the solution?

I've been a Datadog user for almost ten years.

What do I think about the stability of the solution?

Datadog is very stable, and we've only come across a few items that needed to be addressed quickly when there were issues.

What do I think about the scalability of the solution?

Scalability is very favorable, aside from cost/budget, which limits the scalability of this platform.

How are customer service and support?

Both customer service and support need a little work, as we have had a number of requests/issues that were not addressed as we needed them to be.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

Being an Observability SME, I have used many native and third party solutions, including Dynatrace, New Relic, CloudWatch and Zabbix. As previously mentioned, Datadog provides a superior platform for centralizing and consolidating our Observability layer. Switching to Datadog was a no-brainer when most other solutions either didn't provide the maturity of functions, or have them available, at all.

How was the initial setup?

The initial setup was very straightforward, and the integrations were easily configured.

What about the implementation team?

We implemented Datadog in-house.

What was our ROI?

For the most part, Datadog's ROI is quite impressive when you consider all of the features and functions that are centralized on the platform. It doesn't require us to purchase additional third-party solutions to fill in the gaps.

What's my experience with pricing, setup cost, and licensing?

The setup was dead simple once the cloud integrations and agent components were identified and executed. Licensing falls into our normal third-party processes, so it was a familiar feeling when we started with Datadog. Cost is the only outlier when it comes to a perfect solution. Datadog is expensive, and each add-on drives that cost further into the realm of requiring justifications to finance expanding the core suite of features we would like to enable.

Which other solutions did I evaluate?

Yes, we evaluated several competing platforms that included Dynatrace, New Relic and Zabbix.

What other advice do I have?

They should provide more inclusive pricing, or an "all you can eat" tier that would include all relevant features, as opposed to individual cost increases to let Datadog to become more valuable and replace even more third-party solutions that have a lower cost of entry.

Which deployment model are you using for this solution?

Hybrid Cloud


    Kevin Palmer

Useful log aggregation and management with helpful metrics aggregation

  • September 19, 2024
  • Review provided by PeerSpot

What is our primary use case?

We use Datadog for log aggregation and management, metrics aggregation, application performance monitoring, infrastructure monitoring (serverless (Lambda functions), containers (EKS), standalone hosts (EC2)), database monitoring (RDS) and alerting based on metric thresholds and anomalies, log events, APM anomalies, forecasted threshold breaches, host behaviors and synthetics tests.

Datadog serves a whole host of purposes for us, with an all-in-one UI and integrations between them built in and handled without any effort required from us.

We use Datadog for nearly all of our monitoring and information analysis from the infrastructure level up through the application stack.

How has it helped my organization?

Datadog provides us value in three major ways:

First, Datadog provides best-in-class functionality in many, if not all, of the products to which we subscribe (infrastructure, APM, log management, serverless, synthetics, real user monitoring, DB monitoring). In my experience with other tools that provide similar functionality, Datadog provides the largest feature set with the most flexibility and the best performance.

Second, Datadog allows us to access all of those services in one place. Having to learn and manage only one tool for all of those purposes is a major benefit.

Third, Datadog provides significant connectivity between those services so that we can view, summarize, organize, translate and correlate our data with maximum effect. Not needing to manually integrate them to draw lines between those pieces of information is a huge time savings for us.

What is most valuable?

I use log management and monitors most often.

Log management is a great way for me to identify changes in behavior across services and environments as we make changes or as user behavior evolves. I can filter out excess or not useful logs, in part or in full, I can look for trends and I can group by multiple facets.

Monitors allow me to rest easy knowing that I'll be alerted to unexpected changes in behavior throughout our environments so that I can be proactive without having to dedicate active cycles to watching all facets of our environments.

What needs improvement?

In my four years using the product, the only feature request I, or anyone on my team, has had was the ability to view query parameters in query samples. 

Otherwise, improvements are already released faster than we can give them sufficient time and attention, so I'm very happy with the product and don't have any specific requests at this time.

The cost does add up quickly, so it can be some effort to justify the necessary outlay to those paying the bills. That said, Datadog provides sufficient benefits to warrant our continued use.

For how long have I used the solution?

I've used the solution for four years.

What do I think about the stability of the solution?

In four years of daily use I haven't noticed any periods of downtime.

What do I think about the scalability of the solution?

It's amazing to me how performant Datadog is given how much data we pass to it.

How are customer service and support?

We've opened probably six or eight support tickets in four years of use. In some cases, the problem or question was complex and took some time to resolve. That said, customer support was always able to debug the issue and find a solution for us, so my experience has been very positive.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

I've used New Relic, Honeycomb, Grafana, Splunk, Prometheus, Graylog and others.

How was the initial setup?

Given the breadth of configuration options, the initial setup was fairly involved for us. We also use several services and deploy the agent in various ways because we're using traditional servers, serverless, and K8s.

What about the implementation team?

We implemented the solution in-house.

What's my experience with pricing, setup cost, and licensing?

The solution can be pricey if you're using many services and/or shipping lots of data, but in my opinion, the value is greater than the cost, so I would suggest doing an evaluation before making a decision.

Which deployment model are you using for this solution?

Public Cloud


    Ravel Leite

Proactive, provides user trends, and works harmoniously

  • September 19, 2024
  • Review provided by PeerSpot

What is our primary use case?

From day one, we have seamlessly integrated our new product into Datadog, a comprehensive monitoring and analytics platform. By doing so, we are continuously collecting essential data such as host information, system logs, and key performance metrics. This enables us to gain deep insights into product adoption, monitor usage patterns, and ensure optimal performance. Additionally, we use Datadog to capture and analyze errors in real-time, allowing us to troubleshoot, replay, and resolve production issues efficiently.

How has it helped my organization?

It has proven invaluable in helping us identify early issues within the product as soon as they occur, allowing us to take immediate action before they escalate into more significant problems. This proactive approach ensures that potential challenges are addressed in real-time, minimizing any impact on users. Furthermore, the system allows us to measure product adoption and usage trends effectively, providing insights into how customers are interacting with the product and identifying areas for improvement or enhancement.

What is most valuable?

There isn't any single aspect that stands out in particular; rather, everything is interconnected and works together harmoniously. Each component complements the other, creating a cohesive system where data, logs, and metrics are seamlessly integrated. This interconnectedness ensures that no part operates in isolation, allowing for a more holistic view of the product's performance and health. The way everything binds together strengthens our ability to monitor, analyze, and improve the product efficiently.

What needs improvement?

At the moment, nothing specific comes to mind. Everything seems to be functioning well, and there are no immediate concerns or issues that I can think of. 

The system is operating as expected, and any challenges we've faced so far have been successfully addressed. If anything does come up in the future, we will continue to monitor and assess it accordingly, but right now, there’s nothing that stands out requiring attention or improvement. 

Datadog is too pricey when compared to its competitors, and this is something that its always on my mind during the decision-making process.

For how long have I used the solution?

I've used the solution for nearly two years now.

Which deployment model are you using for this solution?

Public Cloud


    reviewer9816413

Easy, more reliable, and transparent monitoring

  • September 19, 2024
  • Review provided by PeerSpot

What is our primary use case?

We use the solution to monitor and investigate issues with production services at work. We're periodically reviewing the service catalog view for the various applications and I use it to identify any anomalies with service metrics, any changes in user behavior evident via API calls, and/or spikes in errors.  

We use monitors to trigger alerts for on-call engineers to act upon. The monitors have set thresholds for request latency, error rates, and throughput. 

We also use automated rules to block bad actors based on request volume or patterns.

How has it helped my organization?

Datadog has made setting up monitors easier, more reliable, and more transparent. This has helped standardize our on-call process and set all of our on-call engineers up for success.  

It has also standardized the way we evaluate issues with our applications by encouraging all teams to use the service catalog.  

It makes it easier for our platforms and QA teams to get other engineering teams up to speed with managing their own applications' performance. 

Overall, Datadog has been very helpful for us.

What is most valuable?

The service catalog view is very helpful for periodic reviews of our application. It has also standardized the way we evaluate issues with our applications.  Having one page with an easy-to-scan view of app metrics, error patterns, package vulnerabilities, etc., is very helpful and reduces friction for our full-stack engineers.

Monitors have also been very valuable when setting up our on-call processes. It makes it easy to set up and adjust alerting to keep our teams aware of anything going wrong.

What needs improvement?

Datadog is great overall. One thing to improve would be making it easier to see common patterns across traces. I sometimes end up in a trace but have a hard time finding other common features about the error/requests that are similar to that trace. This could be easier to get to; however, in that case, it's actually an education issue.  

Another thing that could be improved is the service list page sometimes refreshes slowly, and I accidentally click the wrong environment since the sort changes late.

For how long have I used the solution?

I've used the solution for about a year.

What do I think about the stability of the solution?

It is very stable. I have not seen any issues with Datadog.

What do I think about the scalability of the solution?

It seems very scalable.

How are customer service and support?

I've had no specific experience with technical support.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

We used Honeycomb before. We switched since Datadog offered more tooling.

How was the initial setup?

Each application has been easy to instrument.

What about the implementation team?

We implemented the solution in-house.

What was our ROI?

Engineers save an unquantifiable amount of time by having one standard view for all applications and monitors.

What's my experience with pricing, setup cost, and licensing?

I am not exposed to this aspect of Datadog.

Which other solutions did I evaluate?

We did not evaluate other options. 

Which deployment model are you using for this solution?

Public Cloud


    ZJ

Very good custom metrics, dashboards, and alerts

  • September 18, 2024
  • Review from a verified AWS customer

What is our primary use case?

Our primary use case for Datadog involves utilizing its dashboards, monitors, and alerts to monitor several key components of our infrastructure. 

We track the performance of AWS-managed Airflow pipelines, focusing on metrics like data freshness, data volume, pipeline success rates, and overall performance. 

In addition, we monitor Looker dashboard performance to ensure data is processed efficiently. Database performance is also closely tracked, allowing us to address any potential issues proactively. This setup provides comprehensive observability and ensures that our systems operate smoothly.

How has it helped my organization?

Datadog has significantly improved our organization by providing a centralized platform to monitor all our key metrics across various systems. This unified observability has streamlined our ability to oversee infrastructure, applications, and databases from a single location. 

Furthermore, the ability to set custom alerts has been invaluable, allowing us to receive real-time notifications when any system degradation occurs. This proactive monitoring has enhanced our ability to respond swiftly to issues, reducing downtime and improving overall system reliability. As a result, Datadog has contributed to increased operational efficiency and minimized potential risks to our services.

What is most valuable?

The most valuable features we’ve found in Datadog are its custom metrics, dashboards, and alerts. The ability to create custom metrics allows us to track specific performance indicators that are critical to our operations, giving us greater control and insights into system behavior. 

The dashboards provide a comprehensive and visually intuitive way to monitor all our key data points in real-time, making it easier to spot trends and potential issues. Additionally, the alerting system ensures we are promptly notified of any system anomalies or degradations, enabling us to take immediate action to prevent downtime. 

Beyond the product features, Datadog’s customer support has been incredibly timely and helpful, resolving any issues quickly and ensuring minimal disruption to our workflow. This combination of features and support has made Datadog an essential tool in our environment.

What needs improvement?

One key improvement we would like to see in a future Datadog release is the inclusion of certain metrics that are currently unavailable. Specifically, the ability to monitor CPU and memory utilization of AWS-managed Airflow workers, schedulers, and web servers would be highly beneficial for our organization. These metrics are critical for understanding the performance and resource usage of our Airflow infrastructure, and having them directly in Datadog would provide a more comprehensive view of our system’s health. This would enable us to diagnose issues faster, optimize resource allocation, and improve overall system performance. Including these metrics in Datadog would greatly enhance its utility for teams working with AWS-managed Airflow.

For how long have I used the solution?

I've used the solution for four months.

What do I think about the stability of the solution?

The stability of Datadog has been excellent. We have not encountered any significant issues so far. 

The platform performs reliably, and we have experienced minimal disruptions or downtime. This stability has been crucial for maintaining consistent monitoring and ensuring that our observability needs are met without interruption.

What do I think about the scalability of the solution?

Datadog is generally scalable, allowing us to handle and display thousands of custom metrics efficiently. However, we’ve encountered some limitations in the table visualization view, particularly when working with around 10,000 data points. In those cases, the search functionality doesn’t always return all valid results, which can hinder detailed analysis.

How are customer service and support?

Datadog's customer support plays a crucial role in easing the initial setup process. Their team is proactive in assisting with metric configuration, providing valuable examples, and helping us navigate the setup challenges effectively. This support significantly mitigates the complexity of the initial setup.

Which solution did I use previously and why did I switch?

We used New Relic before.

How was the initial setup?

The initial setup of Datadog can be somewhat complex, primarily due to the learning curve associated with configuring each metric field correctly for optimal data visualization. It often requires careful attention to detail and a good understanding of each option to achieve the desired graphs and insights

What about the implementation team?

We implemented the solution in-house.


    Aaron J.

Becoming the Gold Standard

  • June 26, 2024
  • Review provided by G2

What do you like best about the product?
DataDog provides thorough insights accross all of the important facets
What do you dislike about the product?
DataDog has an excellent offering and continues to provides new services to keep clients and fulfill the needs of the industry. However, with this comes a premium and in turn hinders its adoption or for those who use it, to use it completely.
What problems is the product solving and how is that benefiting you?
Providing key actionable insights


    Benjamin L.

This is a good product, but is only just starting to bubble up observability. Takes minutes

  • June 26, 2024
  • Review provided by G2

What do you like best about the product?
It is all in one place, the UI is fairly nice. Customer support is bomb! Thanks!
What do you dislike about the product?
Graphs are a bit small, it would be nice to have more datapoints. Also still no real useable HEATMAPS.
What problems is the product solving and how is that benefiting you?
Incident management using workflows/appbuilder/incident management
APM is growing by leaps and bounds


    Marketing and Advertising

Incredibly flexible, intuitive, and accessible

  • June 26, 2024
  • Review provided by G2

What do you like best about the product?
The ease of integration for most of their products like RUM, super easy to set up even with limited knowledge, and due to their wide variety of products you can have all your logs/alerting/monitoring in one place across multiple services.
What do you dislike about the product?
Once you get deep into custom metrics and analysis it can get confusing, their graphing/visualization tool is unfortunately not very intuitive and they are missing a lot of features that something like Tableau might have.
What problems is the product solving and how is that benefiting you?
Being able to track user activity/history makes it much easier when debugging errors


    Fund-Raising

Powerful insights for infrastructure, services, and applications

  • June 26, 2024
  • Review provided by G2

What do you like best about the product?
When implemented well, it has a comprehensive feature set that encompasses the entire SDLC.
What do you dislike about the product?
The documentation only enables a real-world application to a degree, but some hands on support/implementation would be beneficial and drive value for our Datadog investment.
What problems is the product solving and how is that benefiting you?
Datadog is helping us resolve incidents in near real-time.


    Airlines/Aviation

Good

  • June 25, 2024
  • Review provided by G2

What do you like best about the product?
Ability to create dashboard easily and customizing them
What do you dislike about the product?
Not a lot of feature. Available features are pretty basic.
What problems is the product solving and how is that benefiting you?
Improve observability of our platform