Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Reviews from AWS customer

6 AWS reviews

External reviews

627 reviews
from and

External reviews are not included in the AWS star rating for the product.


4-star reviews ( Show all reviews )

    Asna K.

Best platform for data engineering and data science

  • January 07, 2025
  • Review provided by G2

What do you like best about the product?
We used Databricks for its features such asreal time data processing and dat exploration tools for visualizing data.AutoML and Mlflow is one of the best AI integration in this platform.This software is cost efficient
What do you dislike about the product?
Limited tutorials for new users , not beginner freindly interface
What problems is the product solving and how is that benefiting you?
We used this platform analyzing and processing big data and process data from various formats, this tool is really great


    ShubhamSharma7

Capability to integrate diverse coding languages in a single notebook greatly enhances workflow

  • January 03, 2025
  • Review provided by PeerSpot

What is our primary use case?

I am working as a data engineer at Fractal. On a daily basis, I work on Azure Cloud, and I use Databricks frequently. We have EDF pipelines and utilize Synapse for our daily tasks.

What is most valuable?

Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. 

I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.

What needs improvement?

As a data engineer, I see cluster failure in our Databricks user databases as a major issue. I am unsure why, however, our flow, typically involving three to four notebooks, sometimes leads to cluster failure. Despite attempts to identify the problem, there are times when the reason remains unclear. Adjusting features like worker nodes and node utilization during cluster creation could mitigate these failures.

For how long have I used the solution?

I have been using the solution for three years now.

What do I think about the stability of the solution?

Cluster failure is one of the biggest weaknesses I notice in our Databricks.

Which solution did I use previously and why did I switch?

Databricks is beneficial for cost-saving since clients I work for transitioned from AWS Cloud to Azure Cloud for this reason.

How was the initial setup?

The initial setup is very straightforward for us.

What's my experience with pricing, setup cost, and licensing?

I am not very aware of the pricing. We use three to four clusters in our project. Increasing the number or size of clusters, such as adding more workers, would result in higher costs. That's why we limit ourselves to four clusters for our business.

Which other solutions did I evaluate?

In terms of cost efficiency, it's very useful because our clients switched from AWS Cloud to Azure Databricks to save costs.

What other advice do I have?

I would rate the overall product eight out of ten. 

Everything is probably good as far as I have used it, but there's room for improvement in cluster integration. Enhancing cluster capabilities while keeping costs lower would be beneficial.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure


    John S.

The Best Data Engineering Tool uses Delta Lake

  • January 02, 2025
  • Review provided by G2

What do you like best about the product?
This tool is very efficient because it using Delta lake. This supports ETL Pipelines and Machine Learning workflows which Guide to extract and transform data into Various forms. And i like the interactive notebooks supporting python language .
AutoML and Delta Lake is best features.
What do you dislike about the product?
This tool in begining there is complexity for using now it became simople.
What problems is the product solving and how is that benefiting you?
the problems solved this tool , hectic data analysis and processing many type of datas


    Jessica S.

The best Bigdata Processing Tool

  • December 31, 2024
  • Review provided by G2

What do you like best about the product?
I have used this tool for past two years , the attractive feature were faster data processing and data warehousing. i can easily intergrate it with power bi so it become easy to implement it
What do you dislike about the product?
I dont like the interface of this tool , and also latency issues
What problems is the product solving and how is that benefiting you?
My main problem was processing data from clients and upload the processed data to cloud by using this , this task became very easy


    Arsath H.

Revolutionizing Data analytics and AI integration

  • December 31, 2024
  • Review provided by G2

What do you like best about the product?
MLflow , and coloborative notebooks are the main feature of this tool and anothere features i like about his is Data Lake Storage layer nd Auto ml model traing helps for efficient processing.
What do you dislike about the product?
I dont like the SQlanalytics feature , gives error most of time , better improving this .
What problems is the product solving and how is that benefiting you?
We using this tool for Data Warehousing and Dataprocessing in a bulk , by using this tool we can improve time efficently


    Akash K.

Big Data processing using Databricks

  • December 31, 2024
  • Review provided by G2

What do you like best about the product?
the best feature in this tool is end to end machine learning life cycle ,and api for data processing. This tool also have Delta sharing and cross-functional collaboration by this big data can be processed efficiently.
What do you dislike about the product?
We cant process specific use cases in data processing and also this tool is not affordable
What problems is the product solving and how is that benefiting you?
We used this product because of data reliablility and security, this tool has high security and high speed data processing


    Accounting

Designated as Associate Data engineer, sharing my experience as a feedback using this feedback form

  • September 11, 2024
  • Review provided by G2

What do you like best about the product?
The Collaboration of everything on one platform - MLflow, SQL, Warehouse, Data analyst tools and data engineer tools makes learning of different roles and new verticals easy to process.
What do you dislike about the product?
AI integration can be improvised, can provide more credits for their different teir plans, should add more data visualisation support
What problems is the product solving and how is that benefiting you?
Bringing all the team on a single platform makes integration and pipelining things a lot easier, apart from support from databricks having things open-source delta and unity catalog this becomes much more versatile for us


    Information Technology and Services

Easy to build data pipeline

  • September 08, 2024
  • Review provided by G2

What do you like best about the product?
Ease of implementation and modification of scripts
What do you dislike about the product?
Has a bit of a learning curve if new to the field
What problems is the product solving and how is that benefiting you?
User friendly and ease of integration with other services


    Bhagirath S.

Very likely to recommend data brick intelligence platform

  • September 06, 2024
  • Review provided by G2

What do you like best about the product?
i like easy to use interface with strong features supporting and helping in data transformation and implementing data pipeline quickly. i also like it has capability to be used by data analyst using query and result in dashboard , data engineer using notebooks, workflow, CDC, Timetravel, DLT and data scientist with ML model all in one intelligent interface.
What do you dislike about the product?
i do not like only 30 days trail period and missing update mail from databricks.
What problems is the product solving and how is that benefiting you?
Databrick is solving multiple data tools issues for multiple role, IT unified all strong features by provding data lakehouse with ETL, warehouse , ML feature support on top of external cloud host with GEN AI capability too. I am using databrick in my current engagement and it is helping me as simple and strong tool to ingest and transform data from bronze to gold layer using workflows.


    reviewer2514822

Provides resources to users quickly without much hassle

  • July 15, 2024
  • Review provided by PeerSpot

What is our primary use case?

I have recently gotten into Databricks and trained on one model. I started using Databricks because of its hardware support and all the other things that it provides, and it is easier to get into. Earlier, when I had to test some part of my code or test if it was working or not, it was not just a fair, not a full production run, but just a fair testing; I had to get a machine, raise a request, get into the whole process. With Databricks, I can just simply create one myself. I could get the resources, whatever they are required, test it out all there, and then go ahead with that, and that is why I have been using it primarily.

What is most valuable?

The most valuable features of the solution are the hardware and the resources it quickly provides without much hassle.

What needs improvement?

I think setting up the whole account for one person and giving access are areas that can be difficult to manage and should be made a little easier.

For how long have I used the solution?

I have experience with Databricks.

What do I think about the stability of the solution?

I think there's a duration after which our training without any activity would expire, which I think is a fair point, and that is the only place where I think this will stop. I haven't come across a lot of problems with Databricks.

What do I think about the scalability of the solution?

The tool is not used as frequently as PyTorch. I don't know why I am comparing Databricks to PyTorch, but I think around five people use it.

How are customer service and support?

I have not contacted the solution's technical support team.

Which solution did I use previously and why did I switch?

Before Databricks, I used to use a cloud support platform.

How was the initial setup?

The solution is deployed on the cloud.

Which other solutions did I evaluate?

I chose Databricks over other products, considering the hardware support it offers.

What other advice do I have?

A little bit of time will be needed to get comfortable with Databricks.

I rate the tool an eight out of ten.