Helps users with data processing and analytics
What is our primary use case?
I use Databricks to manage the setting up of data lakes for SaaS.
What needs improvement?
The biggest problem associated with the product is that it is quite pricey. We cannot find a better solution than Databricks in the market currently.
For how long have I used the solution?
I have been using Databricks for a year.
What's my experience with pricing, setup cost, and licensing?
It is an expensive tool. The licensing model is a pay-as-you-go one.
What other advice do I have?
The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale.
For my general use cases, I would say that I am not a technical person, so I cannot explain to you how the tool helps with the area of data engineering tasks.
There is another team in my company that is involved in the use of machine learning and AI features in Databricks. My team is mostly into operations. The tool is used in a multi-country project.
For example, in my company, they make some shopping decisions related to solutions based on what is the product chosen by the whole company.
I rate the tool an eight out of ten.
Databricks Review
What do you like best about the product?
The greatest upside to the Databricks Platform that it's constantly being developed. Databricks as well as other companies are developing code and utilities to run on this platform. Notably Mosaic AI, has a tool called Mosaic Composer that is a low-code acellerator for training AI models which has been very benefical to use.
What do you dislike about the product?
I dislike that Databricks is beginning to abstract some of the configurability options available. For example, Databricks serverless. I want to keep the ability to tailor a cluster and libaries specific to my use-case rather than it handled by Databricks.
What problems is the product solving and how is that benefiting you?
Decreasing time from data to model.
Manager Data Science
What do you like best about the product?
Unified platform with lots of capabilities, open source based. No vendor locked.
What do you dislike about the product?
Appears to have a learning curve. Not now-no code environment on par with PowerBI.
What problems is the product solving and how is that benefiting you?
Able to manage large-scale data and easily upkeep the pipelines for it.
Slice through billions of records like butter
What do you like best about the product?
As a data scientist I'm able to analyze billions of records from my web browser without having to worry about memory or CPU limitations on my laptop.
What do you dislike about the product?
There are a lot of ways to configure the platform which can be intimidating for new users.
What problems is the product solving and how is that benefiting you?
Enable analysis of large datasets that was not possible with legacy tools.
Recommend Databricks as the Data Platform for Most use cases
What do you like best about the product?
Ease of Implementation, Prompt Customer Support, Ease of Use
What do you dislike about the product?
Too much focus on exciting user to ramp up DBUs (Cost), Over communicating capabilities even as they are not matured enough
What problems is the product solving and how is that benefiting you?
Capability to bring diverse data together, and capability to process same at scale
It provides unified solutions for end-to-end Data Science project.
What do you like best about the product?
Data processing, scheduling and management, in particular ETL. Big data democratization and collaboration opportunities, the Databricks team aims to make big data analytics easier for enterprises.
What do you dislike about the product?
File management on DBFS can be improved.
Visualization in MLFLOW experiment can be enhanced.
What problems is the product solving and how is that benefiting you?
By providing unified intelligence Data Platform to handle messy, siloed and slow data.
"Unlocking Data Potential: A Comprehensive Review of Databricks"
What do you like best about the product?
Interactive user interface.
A powerful tool that contains Computer power, integration with different sources of data, integrated Machine Learning, and collaboration with team using the workspace.
Provide workflow and Delta live Table pipeline which makes this fantastic.
What do you dislike about the product?
I am not able to migrate Dlt pipelines from the dev workspace to the prod workspace using Azure DevOps, if there is any way please let me know.
What problems is the product solving and how is that benefiting you?
it provides workspace, notebook, Apache spark, Delta Lake, and Scalability which processes large amounts of data easily.
It has a centralized environment by which team can work collaboratively.
Its integration power can integrate with ADF, SYNAPSE and Data warehouse easily.
Cloud resources limit 11
Hi Databricks team
I want to increase AWS Databricks Cloud resources limit. We have limit Cloud resources 11 and VPC limit 5.
Databricks Plan Premium.
1. Storage configuration
2. Credential configuration.
Kindly share the process to increase.
If anything, you need from my side please let me know.
Amazing tool for data processing which uses spark an
What do you like best about the product?
It supports all the cloud platforms i.e azure, Aws and GCP
What do you dislike about the product?
As of now no any negative thing i know about this tool
What problems is the product solving and how is that benefiting you?
We are using databricks platform to run jobs which helps for data processing, Previously job was running on hadoop map reduce which was taking too much of time for processing but now it completes jobs too quickly
Interesting Product for Data Science
What do you like best about the product?
Data lineage is one of the valuable features I like in Unity Catalog. It makes it easy for us to do data debugging and analysis of data calculations with access control.
What do you dislike about the product?
Not much, but for ease of understanding, the data lineage visualization section needed some work.
What problems is the product solving and how is that benefiting you?
In the data science project, many times we needed to verify and recheck the calculations of the KPI matrix and what table and transformation were needed for this. We had difficulty tracing back the flow of calculations through the original data. The Unity Catalog directly provides a data lineage feature for this. We can easily determine what data transfer was required and how this KPI matrix was calculated from the original data through the dashboard and graphs with the access control feature.