We use the solution for data science and machine learning.
Reviews from AWS customer
-
5 star0
-
4 star0
-
3 star0
-
2 star0
-
1 star0
External reviews
External reviews are not included in the AWS star rating for the product.
Data okie for Project Management
Streamlining Data Science Workflows
Helpful for project collaboration
Saves a lot of time because I can quickly handle all the data preparation tasks and concentrate on building my machine learning algorithms
What is our primary use case?
How has it helped my organization?
We were a team of six Dataiku scientists and one data engineer. We focused on fully leveraging Dataiku for all our data science-related tasks. This included data preparation, preprocessing, benchmarking machine learning algorithms, handling everything related to production, and making our algorithms available to stakeholders.
What is most valuable?
The advantage is that you can focus on machine learning while having access to what they call 'recipes.' These recipes allow me to preprocess and prepare data without writing any code. This saves a lot of time because I can quickly handle all the data preparation tasks and concentrate on building my machine learning algorithms.
What needs improvement?
One of the main challenges was collaboration. Developers typically use GitHub to push and manage code, but integrating GitHub with Dataiku was complicated. While it was theoretically possible to use GitHub with Dataiku, in practice, it was difficult to manage our code effectively and push it from Dataiku to GitHub.
Another limitation was its ability to handle different types of data. While Dataiku is powerful for working with structured data, like regular or geospatial data, it struggled with more complex data types such as text and image. In addition to the challenges with GitHub integration, the limited support for diverse data types was another feature lacking at that time.
For how long have I used the solution?
I have been using Dataiku for over a year.
What do I think about the stability of the solution?
Since Dataiku relies on various open-source libraries and tools, updates or upgrades to these components can sometimes impact the stability of Dataiku's features. This can make it challenging to maintain consistent stability, as changes in the underlying open-source tools can affect how Dataiku functions.
I rate the stability as six out of ten.
What do I think about the scalability of the solution?
There are some scalability issues.
I rate the scalability as seven out of ten.
How are customer service and support?
Technical support was very good compared to other tools. We had access to chat and support.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup is very easy. It has many tutorials and many guidelines. After the initial deployment, it took about a week to manage all the setup and resolve various issues before we had a stable version of Dataiku that we could use consistently.
I rate it as eight out of ten, whereas ten is easy.
What's my experience with pricing, setup cost, and licensing?
It is very expensive.
What other advice do I have?
I wouldn't recommend using Dataiku if only one data scientist is on the team. However, having a larger team—let's say more than five data scientists—can be very helpful. Dataiku offers features that are especially useful when multiple people are working on the same project, and it also has tools that make it easier to move from the proof of concept stage to production.
Overall, I rate the solution as seven out of ten.
Which deployment model are you using for this solution?
Gives different aspects of modeling approaches and good for multiple teams' collaboration
What is our primary use case?
My current client has Dataiku. We do sentiment analysis and some small large language models right now. We use Dataiku as a Jupyter Notebook.
We use it a lot for marketing and analytics. The marketing and sales team uses Dataiku.
What is most valuable?
It's got good feature selection and creation of feature stores, and it also gives different aspects of modeling approaches. There are a lot of similarities with DataRobot.
So feature selection, different modeling, and financial metrics are good aspects.
What needs improvement?
The no-code/low-code aspect, where DataRobot doesn't need much coding at all.
Dataiku still needs some coding, and that could be a difference where business data scientists would go for DataRobot more than Dataiku because you still have to code and use either Python or R, or Scala. However, with DataRobot, you don't have to do that at all.
For how long have I used the solution?
I've used Dataiku for about four years.
How are customer service and support?
The company is based in France. But they're more and more in America as well.
So, the support was okay.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I used DataRobot. Dataiku has a different kind of structure to it. It's not financially heavy like DataRobot, which caters more to financial companies, like banks. Dataiku doesn't have that yet. I think they are also working on that area. But yeah, there are some key differences between the two products.
DataRobot has an additional feature with financial firms that it creates all these financial metrics when you run a time series analysis. Those things I have not seen in Dataiku.
If any financial company is choosing between DataRobot and Dataiku, they will definitely go for DataRobot because it creates all these financial metrics. It creates deltas, time series, time difference fields, and things like that. So, that is an added feature that DataRobot has.
What's my experience with pricing, setup cost, and licensing?
Pricing is pretty steep. Dataiku is also not that cheap. It depends on the client and how much they want to spend towards a tool.
What other advice do I have?
Overall, I would rate it an eight out of ten, except for some coding things that are there, which some people may not want to do, like certain business data scientists.
Dataiku is good for multiple teams' collaboration. If many teams are collaborating and sharing Jupyter notebooks, it's very useful. It has a good data processing structure and includes most of the models. I haven't checked the large language models in it yet, but it's a pretty good tool. It does well with analytics and has a sound structure on the back end.
Some coding aspects are necessary, but it generates SQL code, which is an added feature. A lot of data engineers like Dataiku because it generates SQL code on the right side.
Robust tool for data science engineer
Easy to use: this tool is easy to use.
It's very user friendly
Easy to integration: this tool is easy to integration with other platforms.
So that we could collaborate with other team.
No of features: there are no of features .like
1.robust data integration
2. scalability
3. Visual data preparation
4.modelmonitering
Etc
Easy to implementation: dataiku dss is easy to implementation
Customer support: this is very good customer support.
Cost: cost is also high .this is not suitable for small organization .
Limited free version:there are limitations of free version.
It's robust data integration tool for it integrate data from various data sources
Best tool to Analysis and Presenting data in different Dimention
Dataiku is the best tool to present data different views and representation data.
It can made available with other tool where they can connect the date to analyze it.
All-in-one data pipeline management tool
Its recently added features like ML Notebooks are very useful for running Machine Learning classification or regression tasks on a dataset. Dataiku has complete support for training and evaluating Machine Learning models.