Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Cloudera on AWS

Cloudera | 1

Reviews from AWS customer

0 AWS reviews
  • 5 star
    0
  • 4 star
    0
  • 3 star
    0
  • 2 star
    0
  • 1 star
    0

External reviews

39 reviews
from and

External reviews are not included in the AWS star rating for the product.


    Shan Hasan

ETL processes benefit from cost-effective offloading and could see improved deployment capabilities

  • May 05, 2025
  • Review provided by PeerSpot

What is our primary use case?

The primary usage of Cloudera Data Platform is to offload ETL processes because it's cheaper compared to data warehouse solutions like Teradata or Oracle. Furthermore, basic reporting can be done, and some real-time processes can be managed.

What is most valuable?

The foremost benefit is offloading data from the warehouse to Cloudera Data Platform, which allows for cheaper storage. We use it to push transformations and run ETL processes, leveraging tools like Spark. Cloudera also supports various functionalities, including AI and Gen AI tools. Basic reporting and some real-time functions are manageable on the platform.

What needs improvement?

Cloudera Data Platform should include additional capabilities and features similar to those offered by other data management solutions like Azure and Databricks.

For how long have I used the solution?

I have been using Cloudera Data Platform for more than five years.

What was my experience with deployment of the solution?

The installation of Cloudera Data Platform had some challenges, but this is common with many products. An improved deployment process would help deliver solutions more quickly.

What do I think about the stability of the solution?

I would rate the stability of Cloudera Data Platform as eight out of ten.

What do I think about the scalability of the solution?

Integration with other tools works well for us and we successfully scaled the solution after two to three years without any issues. I would rate the scalability as eight out of ten.

How are customer service and support?

I have communicated with technical support, and they are responsive and helpful. I would rate their support as seven out of ten.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

Initially, the decision for Cloudera was driven by pricing and the support they provided.

How was the initial setup?

The initial setup may take several hours or days, depending on the challenges faced during installation. It's not always a smooth process due to potential complexities.

What about the implementation team?

The implementation involved multiple teams, including Cloudera support, with three to four people from our client's side involved.

What other advice do I have?

I recommend Cloudera Data Platform. Overall, I would rate it a seven out of ten despite the complexities in deployment. I suggest including my alternative email address for contact in case of access issues. The overall product rating is seven out of ten.

Which deployment model are you using for this solution?

On-premises


    Miodrag-Stanic

Distributed computing improves data processing while upgrade complexity needs addressing

  • April 14, 2025
  • Review provided by PeerSpot

What is our primary use case?

We heavily use Cloudera Data Platform for data science activities. Various departments in the company utilize it as a sandbox for data discovery. We have multiple data pipelines running on a daily and hourly basis, along with some real-time data pipelines.

What is most valuable?

Cloudera Data Platform has significantly improved our data management. Distributed computing with Spark has enabled many processing types that were not possible before. By using the Hadoop File System for distributed storage, we have 1.5 petabytes of physical storage with 500 terabytes of effective storage due to a replication factor of three.

What needs improvement?

There are challenges with upgrading or updating various services like Spark, Impala, and Hive on on-premise and bare metal solutions. We aim to address these issues with a Kubernetes-based platform that will simplify the task of upgrading services. We also wish to implement lakehouse capabilities with Iceberg or Delta Lake frameworks.

For how long have I used the solution?

I have been using Cloudera Data Platform since 2021. We began with a project a year prior, but it has been in production since then.

What do I think about the stability of the solution?

I would rate the stability of Cloudera Data Platform as seven out of ten.

What do I think about the scalability of the solution?

For scalability, I rate Cloudera Data Platform at an eight out of ten as it is an on-premise solution.

How are customer service and support?

I would rate the technical support from Cloudera as seven out of ten. Their support is helpful.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

Before Cloudera, we did not work with other big data platforms. This is our first big data platform, and we also have a classical data warehouse.

What about the implementation team?

We employed local vendors for the implementation, and from our company's side, around ten to twenty people were involved, including engineers, data scientists, and business personnel.

What's my experience with pricing, setup cost, and licensing?

The pricing model for Cloudera Data Platform is complex and has increased significantly compared to CDH. Initially, CDH had a straightforward pricing model based on nodes, but CDP includes factors like processors, cores, terabytes, and drives, making it difficult to calculate costs.

What other advice do I have?

For on-premise use, I would not recommend Cloudera Data Platform as it is expensive and complicated to upgrade. However, for cloud usage, I am uncertain as I do not use it on the cloud. Currently, around thirty to forty people use Cloudera Data Platform in our organization. My final rating for Cloudera Data Platform is seven out of ten.

Which deployment model are you using for this solution?

On-premises


    Sachin Shukre

Good for secure containerization, and governance capabilities

  • December 06, 2023
  • Review provided by PeerSpot

What is our primary use case?

We use it for multiple domains, including oil & gas, finance (Morgan Stanley), and healthcare. We process around 186 TB of data per day for analytics purposes.

Currently, we use it for healthcare domain. 

What is most valuable?

Distributed computing, secure containerization, and governance capabilities are the most valuable features.

What needs improvement?

Since Cloudera acquired HDP, it's been bundled with CBH and HDP. However, the biggest challenge is cloud storage integration with Azure, GCP, and AWS. These platforms offer competitive storage solutions like Gen2, Gen1, Bigtable, BigQuery, Lightstore, S3 buckets, etc., which pose a significant competition to HDP.

For how long have I used the solution?

I have experience with this product. The short form is HDP 2.7. I have been using it since 2011. 

It was on-premises and hybrid for the first three months, then we migrated it to AWS and Azure.

What do I think about the stability of the solution?

In terms of storing data in different formats, it's been somewhat unstable. But when compared to Azure Gen2 and its support and features, it's much more advanced. The suitability depends on specific use cases, but overall, HDP seems more mature than it was in the past.

What do I think about the scalability of the solution?

From my experience with both HDP and CDH, they are both scalable. Currently, most people in my company have shifted to Azure, so they are using Gen2 primarily and discarding Gen1.  

How are customer service and support?

I have frequently contacted technical support for both Cloudera and Hortonworks.

We have an IT system to raise issues against their team. Issues usually get attended by someone at an L1, L2, or L3 support level. They connect with us directly.

Which solution did I use previously and why did I switch?

Previously, we used Cloudera Data Platform (CDP), which turned out to be a cloud-based Azure infrastructure, and implemented metadata solutions like Hive and others.

How was the initial setup?

The setup was very difficult on non-cloud platforms. We had to implement a version-based approach. However, it became simpler with the use of Docker. We used to do it HDP sandboxes and VM boxes and then created clusters in the ancient days. Now, on cloud platforms, it's much easier, just a matter of a few clicks. That's another approach we can take.

What's my experience with pricing, setup cost, and licensing?

I haven't done a price analysis specifically for HDP. However, when it was first introduced as Hadoop 2.0, there were a few use cases where the price was quite high.

It was particularly expensive for Cloudera and Hortonworks Data Platform. Both options were quite resource-intensive.

So, seven, or even nine or ten years ago, it was quite expensive.

What other advice do I have?

I recommend a mature decision-making model. Assess your specific needs and use cases. If HDP suits your requirements, use it. Otherwise, there are many advanced options available. Review and choose the best one for your use case.

Overall, I would rate the solution a nine out of ten. 

I simply love this technology when it comes to new developments. And I've been working with it for the past twelve to thirteen years. However, with the emergence of new technologies, there might be a chance that I would reduce one point because there's room for improvement.  

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure


    Leslie Mavonyani

Helps with data management and has good scalability

  • August 31, 2023
  • Review provided by PeerSpot

What is our primary use case?

We use Hortonworks Data Platform for data management, significant data ingestion, and analytics.

What needs improvement?

Hortonworks Data Platform has a limited user community. I haven't seen much discussion about user experiences. More information could be there to simplify the process of running the product.

For how long have I used the solution?

We have been using Hortonworks Data Platform for a couple of months.

What do I think about the stability of the solution?

I rate the product's stability an eight out of ten.

What do I think about the scalability of the solution?

We have five Hortonworks Data Platform users in our organization. It is a scalable platform.

How was the initial setup?

The initial setup could be more straightforward. It would help if you are technically inclined to follow the necessary steps. There could be easy ways to set it up. It takes 45 minutes to complete and requires a team of five people to execute the process.

What about the implementation team?

We implement the product in-house.

What's my experience with pricing, setup cost, and licensing?

Currently, we are using the product in a sandbox environment, and there is no licensing. We might choose a licensing option once we get the results.

What other advice do I have?

I recommend Hortonworks Data Platform to others and rate it an eight out of ten.


    Information Technology and Services

Dropped the ball and company is in disarray

  • August 04, 2023
  • Review provided by G2

What do you like best about the product?
Was the coolest thing In 2014 when Big Data was the trend
What do you dislike about the product?
Lacks strategy and vision, chases trends and defers core customers
What problems is the product solving and how is that benefiting you?
Hadoop platform


    TonyOladipo

Upgrades and patching are addressed by the solution, and they offer a sandbox for testing

  • July 19, 2023
  • Review provided by PeerSpot

What is our primary use case?

There are a lot of use cases for the Hortonworks Data Platform. We use it alongside GPFS, so most of the information we use for operational analytics is primarily on the Hortonworks Data Platform.

What is most valuable?

The upgrades and patches must come from Hortonworks. Therefore, if we encounter any problems, they will be responsible for addressing them. This is one of the instances where we have to rely on them for all the upgrades.

What needs improvement?

The cost of the solution is high and there is room for improvement.

For how long have I used the solution?

I have been using the Hortonworks Data Platform for two years.

What do I think about the scalability of the solution?

Hortonworks Data Platform is scalable, but it lacks the capability for horizontal scaling. Therefore, we need to add more servers to increase its capacity.

How was the initial setup?

I am responsible for setting up the infrastructure, but I don't handle the engineering work.

What other advice do I have?

I would rate Hortonworks Data Platform an eight out of ten. The solution delivers on its promises, and Hortonworks provides a sandbox for testing before making a purchase.

The maintenance requires a lot of people, including the DRE and IRE teams.

It is not practical for most organizations that lack large amounts of resources to maintain their own data platform. The Hortonworks Data Platform makes it easier for such organizations.


    Ishita G.

Very robust and scalable application

  • June 20, 2023
  • Review provided by G2

What do you like best about the product?
It is wonderful technology to use while working on the confidential data
What do you dislike about the product?
Hope to see more UI improvements in future
What problems is the product solving and how is that benefiting you?
Building big data solutions and working on large scale data


    Suresh S.

My experience with cloudera is good

  • June 20, 2023
  • Review provided by G2

What do you like best about the product?
EASY to use and friendly. Most of the time it works without any issues.
What do you dislike about the product?
Have to improve on providing documentation for various errors
What problems is the product solving and how is that benefiting you?
I have used Cloudera while in a company to solve mobile network related issues using big data analytics


    Ahmed S.

Excelent

  • June 19, 2023
  • Review provided by G2

What do you like best about the product?
It serve my business, help me in reaching more customers and help my customers to find my software .
What do you dislike about the product?
Technical support , mobile app , company location .
What problems is the product solving and how is that benefiting you?
marketing, provide good references , ease to use, ai


    Võ Q.

The most comprehensive big data stack

  • June 15, 2023
  • Review provided by G2

What do you like best about the product?
Fullly implement hadoop ecosystem and control version compability as well. It also has quickly supported on technical and setting up issues. The community is really large and good.
What do you dislike about the product?
The documentation layout is quite unclear. The community version setting up guides are not working. I think cloudera should provide the simple version in docker or helm chart for trial easier.
What problems is the product solving and how is that benefiting you?
Cloudera platform provides the powerful processing tools that help our company save very much time to setup hadoop ecosystem. It also allows us to monitor and control the big data infrastucture easier and more efficient.