Listing Thumbnail

    Cloudera on AWS

     Info
    Sold by: Cloudera 
    Deployed on AWS
    An enterprise data cloud that manages, secures and connects the data lifecycle in AWS. Cloudera delivers powerful self-service analytics across hybrid and multi-cloud environments, along with sophisticated and granular security and governance policies that IT and data leaders demand.

    Overview

    Play video

    Cloudera on AWS is an enterprise data platform that is easy to deploy, manage, and use. By simplifying operations, Cloudera reduces the time to onboard new use cases. Cloudera manages data in any environment, including multiple public clouds, private cloud, and hybrid cloud. With Cloudera's Shared Data Experience (SDX), IT can confidently deliver secure and governed analytics running against data anywhere. Cloudera is a new approach to enterprise data, running anywhere from the Edge to AI.

    Cloudera on AWS delivers easy-to-use analytics that support the most complex, demanding use cases

    Complete: All functions needed to ingest, transform, query, optimize, and make predictions from data are available, eliminating the need for point products

    Integrated: Unified analytic functions work together eliminating data silos and copies of data

    Cloudera SDX technologies ensures and enterprise data cloud is secure by design:

    Consistency: Security and governance policies are set once and applied across all data and workloads

    Portability: Policies stay with the data even as it moves across all supported infrastructures

    Pricing: Use of Cloudera on AWS requires a prepay commitment (in dollars) of cloud credits. For more information on usage rates and instance types, see cloudera.com/products/pricing.html.

    You may use the platform until your commitment is consumed (used against prepaid commitment amount), any additional usage beyond the prepaid commitment will require negotiation with Cloudera for the purchase of additional prepaid credits.

    Highlights

    • Provides elasticity, agility, and ease of use for hybrid and public cloud by intelligently autoscaling workloads up and down for more cost-effective use of cloud infrastructure. Consistent user experience makes it faster and easier to analyze data.
    • Optimizes the data lifecycle with multi-function analytics that solves demanding business use cases. Cloudera on AWS is composed of three primary services with a standardized user experience: Data Warehouse, Machine Learning and Data Hub for custom analytics.
    • Ensures all workloads on the platform share common security, governance, and metadata. Users can efficiently find, curate, and share data, enabling self-service access to trusted data and analytics

    Details

    Delivery method

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Cloudera on AWS

     Info
    Pricing is based on the duration and terms of your contract with the vendor, and additional usage. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for overages or additional usage not covered in the contract. These charges are applied on top of the contract price. If you choose not to renew or replace your contract before the contract end date, access to your entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    12-month contract (1)

     Info
    Dimension
    Description
    Cost/12 months
    Cloudera
    Subscription Cloudera on AWS
    $50,000.00

    Additional usage costs (1)

     Info

    The following dimensions are not included in the contract terms, which will be charged based on your usage.

    Dimension
    Cost/unit
    Consumption by Customer based on Cloud usage
    $0.01

    Vendor refund policy

    No refunds available

    Custom pricing options

    Request a private offer to receive a custom quote.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Resources

    Support

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Data Analysis
    Top
    10
    In Databases & Analytics Platforms, ML Solutions, Data Analytics
    Top
    10
    In Data Warehouses

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Data Platform Architecture
    Enterprise data platform supporting multi-cloud, hybrid cloud, and on-premises data management environments
    Security and Governance Framework
    Shared Data Experience (SDX) technology providing consistent security and governance policies across data workloads and infrastructures
    Multi-Function Analytics
    Integrated analytics platform supporting data ingestion, transformation, querying, optimization, and predictive modeling without requiring separate point products
    Workload Optimization
    Intelligent autoscaling capabilities for dynamically adjusting cloud infrastructure resources based on computational requirements
    Data Lifecycle Management
    Comprehensive platform supporting data processing across multiple services including Data Warehouse, Machine Learning, and custom analytics environments
    Data Platform Architecture
    Unified platform integrating data engineering, analytics, business intelligence, data science, and machine learning on a single architecture
    Open Source Foundation
    Built on open source data projects with support for open standards and data formats
    Lakehouse Infrastructure
    Provides a common data management approach using a lakehouse architecture running on Amazon S3
    Data Intelligence Engine
    Advanced engine capable of interpreting organizational data context and enabling broad data access across teams
    Collaborative Workflow
    Native collaboration capabilities enabling cross-functional data and AI workflow integration
    Data Lake Query Performance
    Provides sub-second query response times using SQL query service on data lake platforms
    Open Standards Support
    Utilizes community-driven standards like Apache Iceberg and Apache Arrow for processing engines
    Multi-Source Data Integration
    Enables joining data from data lakes and external databases without data movement
    Compute Engine Management
    Automatically handles compute engine lifecycle including provisioning, scaling, pausing, and decommissioning
    VPC-Based Data Processing
    Deploys compute engines within customer's Amazon Virtual Private Cloud for secure data processing

    Security credentials

     Info
    Validated by AWS Marketplace
    FedRAMP
    GDPR
    HIPAA
    ISO/IEC 27001
    PCI DSS
    SOC 2 Type 2
    No security profile
    No security profile
    -
    -
    -
    -

    Contract

     Info
    Standard contract
    No
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    |
    39 external reviews
    Star ratings include only reviews from verified AWS customers. External reviews can also include a star rating, but star ratings from external reviews are not averaged in with the AWS customer star ratings.
    Shan Hasan

    ETL processes benefit from cost-effective offloading and could see improved deployment capabilities

    Reviewed on May 05, 2025
    Review provided by PeerSpot

    What is our primary use case?

    The primary usage of Cloudera Data Platform  is to offload ETL processes because it's cheaper compared to data warehouse solutions like Teradata  or Oracle. Furthermore, basic reporting can be done, and some real-time processes can be managed.

    What is most valuable?

    The foremost benefit is offloading data from the warehouse to Cloudera Data Platform , which allows for cheaper storage. We use it to push transformations and run ETL processes, leveraging tools like Spark. Cloudera also supports various functionalities, including AI and Gen  AI tools. Basic reporting and some real-time functions are manageable on the platform.

    What needs improvement?

    Cloudera Data Platform should include additional capabilities and features similar to those offered by other data management solutions like Azure  and Databricks .

    For how long have I used the solution?

    I have been using Cloudera Data Platform for more than five years.

    What was my experience with deployment of the solution?

    The installation of Cloudera Data Platform had some challenges, but this is common with many products. An improved deployment process would help deliver solutions more quickly.

    What do I think about the stability of the solution?

    I would rate the stability of Cloudera Data Platform as eight out of ten.

    What do I think about the scalability of the solution?

    Integration with other tools works well for us and we successfully scaled the solution after two to three years without any issues. I would rate the scalability as eight out of ten.

    How are customer service and support?

    I have communicated with technical support, and they are responsive and helpful. I would rate their support as seven out of ten.

    How would you rate customer service and support?

    Neutral

    Which solution did I use previously and why did I switch?

    Initially, the decision for Cloudera was driven by pricing and the support they provided.

    How was the initial setup?

    The initial setup may take several hours or days, depending on the challenges faced during installation. It's not always a smooth process due to potential complexities.

    What about the implementation team?

    The implementation involved multiple teams, including Cloudera support, with three to four people from our client's side involved.

    What other advice do I have?

    I recommend Cloudera Data Platform. Overall, I would rate it a seven out of ten despite the complexities in deployment. I suggest including my alternative email address for contact in case of access issues. The overall product rating is seven out of ten.

    Which deployment model are you using for this solution?

    On-premises
    Miodrag-Stanic

    Distributed computing improves data processing while upgrade complexity needs addressing

    Reviewed on Apr 14, 2025
    Review provided by PeerSpot

    What is our primary use case?

    We heavily use Cloudera Data Platform  for data science activities. Various departments in the company utilize it as a sandbox for data discovery. We have multiple data pipelines running on a daily and hourly basis, along with some real-time data pipelines.

    What is most valuable?

    Cloudera Data Platform  has significantly improved our data management. Distributed computing with Spark has enabled many processing types that were not possible before. By using the Hadoop  File System for distributed storage, we have 1.5 petabytes of physical storage with 500 terabytes of effective storage due to a replication factor of three.

    What needs improvement?

    There are challenges with upgrading or updating various services like Spark, Impala, and Hive  on on-premise and bare metal solutions. We aim to address these issues with a Kubernetes-based platform that will simplify the task of upgrading services. We also wish to implement lakehouse capabilities with Iceberg or Delta Lake frameworks.

    For how long have I used the solution?

    I have been using Cloudera Data Platform since 2021. We began with a project a year prior, but it has been in production since then.

    What do I think about the stability of the solution?

    I would rate the stability of Cloudera Data Platform as seven out of ten.

    What do I think about the scalability of the solution?

    For scalability, I rate Cloudera Data Platform at an eight out of ten as it is an on-premise solution.

    How are customer service and support?

    I would rate the technical support from Cloudera as seven out of ten. Their support is helpful.

    How would you rate customer service and support?

    Neutral

    Which solution did I use previously and why did I switch?

    Before Cloudera, we did not work with other big data platforms. This is our first big data platform, and we also have a classical data warehouse.

    What about the implementation team?

    We employed local vendors for the implementation, and from our company's side, around ten to twenty people were involved, including engineers, data scientists, and business personnel.

    What's my experience with pricing, setup cost, and licensing?

    The pricing model for Cloudera Data Platform is complex and has increased significantly compared to CDH. Initially, CDH had a straightforward pricing model based on nodes, but CDP includes factors like processors, cores, terabytes, and drives, making it difficult to calculate costs.

    What other advice do I have?

    For on-premise use, I would not recommend Cloudera Data Platform as it is expensive and complicated to upgrade. However, for cloud usage, I am uncertain as I do not use it on the cloud. Currently, around thirty to forty people use Cloudera Data Platform in our organization. My final rating for Cloudera Data Platform is seven out of ten.

    Which deployment model are you using for this solution?

    On-premises
    Sachin Shukre

    Good for secure containerization, and governance capabilities

    Reviewed on Dec 06, 2023
    Review provided by PeerSpot

    What is our primary use case?

    We use it for multiple domains, including oil & gas, finance (Morgan Stanley), and healthcare. We process around 186 TB of data per day for analytics purposes.

    Currently, we use it for healthcare domain. 

    What is most valuable?

    Distributed computing, secure containerization, and governance capabilities are the most valuable features.

    What needs improvement?

    Since Cloudera acquired HDP, it's been bundled with CBH and HDP. However, the biggest challenge is cloud storage integration with Azure, GCP, and AWS. These platforms offer competitive storage solutions like Gen2, Gen1, Bigtable, BigQuery, Lightstore, S3 buckets, etc., which pose a significant competition to HDP.

    For how long have I used the solution?

    I have experience with this product. The short form is HDP 2.7. I have been using it since 2011. 

    It was on-premises and hybrid for the first three months, then we migrated it to AWS and Azure.

    What do I think about the stability of the solution?

    In terms of storing data in different formats, it's been somewhat unstable. But when compared to Azure Gen2 and its support and features, it's much more advanced. The suitability depends on specific use cases, but overall, HDP seems more mature than it was in the past.

    What do I think about the scalability of the solution?

    From my experience with both HDP and CDH, they are both scalable. Currently, most people in my company have shifted to Azure, so they are using Gen2 primarily and discarding Gen1.  

    How are customer service and support?

    I have frequently contacted technical support for both Cloudera and Hortonworks.

    We have an IT system to raise issues against their team. Issues usually get attended by someone at an L1, L2, or L3 support level. They connect with us directly.

    Which solution did I use previously and why did I switch?

    Previously, we used Cloudera Data Platform (CDP), which turned out to be a cloud-based Azure infrastructure, and implemented metadata solutions like Hive and others.

    How was the initial setup?

    The setup was very difficult on non-cloud platforms. We had to implement a version-based approach. However, it became simpler with the use of Docker. We used to do it HDP sandboxes and VM boxes and then created clusters in the ancient days. Now, on cloud platforms, it's much easier, just a matter of a few clicks. That's another approach we can take.

    What's my experience with pricing, setup cost, and licensing?

    I haven't done a price analysis specifically for HDP. However, when it was first introduced as Hadoop 2.0, there were a few use cases where the price was quite high.

    It was particularly expensive for Cloudera and Hortonworks Data Platform. Both options were quite resource-intensive.

    So, seven, or even nine or ten years ago, it was quite expensive.

    What other advice do I have?

    I recommend a mature decision-making model. Assess your specific needs and use cases. If HDP suits your requirements, use it. Otherwise, there are many advanced options available. Review and choose the best one for your use case.

    Overall, I would rate the solution a nine out of ten. 

    I simply love this technology when it comes to new developments. And I've been working with it for the past twelve to thirteen years. However, with the emergence of new technologies, there might be a chance that I would reduce one point because there's room for improvement.  

    Which deployment model are you using for this solution?

    Hybrid Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Microsoft Azure
    Leslie Mavonyani

    Helps with data management and has good scalability

    Reviewed on Aug 31, 2023
    Review provided by PeerSpot

    What is our primary use case?

    We use Hortonworks Data Platform for data management, significant data ingestion, and analytics.

    What needs improvement?

    Hortonworks Data Platform has a limited user community. I haven't seen much discussion about user experiences. More information could be there to simplify the process of running the product.

    For how long have I used the solution?

    We have been using Hortonworks Data Platform for a couple of months.

    What do I think about the stability of the solution?

    I rate the product's stability an eight out of ten.

    What do I think about the scalability of the solution?

    We have five Hortonworks Data Platform users in our organization. It is a scalable platform.

    How was the initial setup?

    The initial setup could be more straightforward. It would help if you are technically inclined to follow the necessary steps. There could be easy ways to set it up. It takes 45 minutes to complete and requires a team of five people to execute the process.

    What about the implementation team?

    We implement the product in-house.

    What's my experience with pricing, setup cost, and licensing?

    Currently, we are using the product in a sandbox environment, and there is no licensing. We might choose a licensing option once we get the results.

    What other advice do I have?

    I recommend Hortonworks Data Platform to others and rate it an eight out of ten.

    Information Technology and Services

    Dropped the ball and company is in disarray

    Reviewed on Aug 04, 2023
    Review provided by G2
    What do you like best about the product?
    Was the coolest thing In 2014 when Big Data was the trend
    What do you dislike about the product?
    Lacks strategy and vision, chases trends and defers core customers
    What problems is the product solving and how is that benefiting you?
    Hadoop platform
    View all reviews