Simplifying Complex Data Pipelines
What do you like best about the product?
I like how StreamSets makes building and managing data pipelines very intuitive with its drag-and-drop interface. It supports real-time streaming as well as batch processing, and the built-in monitoring/alerts give great visibility into pipeline health and data quality.
What do you dislike about the product?
Some advanced features have a steep learning curve, and the platform can feel resource-intensive at times. Documentation could be more detailed, and faster response from support would make the experience even better.
What problems is the product solving and how is that benefiting you?
IBM StreamSets helps us handle complex data integration across multiple sources and destinations in real time. It solves the challenge of building scalable, automated pipelines without heavy coding, while ensuring data quality and observability. This has reduced manual effort, improved reliability, and allowed faster decision-making based on up-to-date data.
Overall good experience, I like the ease of using it.
What do you like best about the product?
I like IBM StreamSets ease of use and Customer Support Team.
What do you dislike about the product?
Almost everything is good. Number of interactive features can be improved.
What problems is the product solving and how is that benefiting you?
Currently using it for Data Extraction.
Streamlining Data Pipelines with ease
What do you like best about the product?
I really like how user friendly IBM StreamSets is, especially the drag and drop interface for designing data pipelines. It makes the process much easier without needing to write complex code. The platform supports both real-time and batch processing, and it has a wide range of connectors, which helped me integrate different data sources without much hassle. I also appreciated the built-in monitoring tools that helped me keep an eye on data flows and troubleshoot issues quickly.
What do you dislike about the product?
One downside I experienced was performance lag when handling large volumes of data it wasn’t always as fast as I needed. The error logs were sometimes difficult to interpret, especially for more complex issues. Also, while basic tasks were easy to manage, getting into advanced configurations took more time than I expected, and the documentation didn’t always provide clear guidance. Support response times could also be slow when I needed urgent help.
What problems is the product solving and how is that benefiting you?
IBM StreamSets is solving the challenge of building and managing complex ETL workflows in a fast-changing data environment. It helps me extract data from various sources, transform it on the fly, and load it into target systems all while handling schema changes and data drift automatically. This has been a huge benefit for me because I no longer have to manually adjust pipelines when source formats change. It also supports real-time stream analytics, so I can process and analyze data as it flows in, which improves decision making speed and keeps my data infrastructure responsive and up to date.
Good Product
What do you like best about the product?
I like the ease of use of this tool. Customer support is ok.
What do you dislike about the product?
Number of features can be improved upon.
What problems is the product solving and how is that benefiting you?
For ETL tools, it is easy to use and implement
Powerful and Flexible ETL solution with IBM StreamSets
What do you like best about the product?
I like IBM StreamSets for its easy-to-use visual interface, real-time data handling, and strong integration with various cloud and on-premise systems.
What do you dislike about the product?
While IBM StreamSets is powerful, it can sometimes be complex to troubleshoot issues in large pipelines, and performance tuning may require additional effort for very high-volume data loads.
What problems is the product solving and how is that benefiting you?
IBM StreamSets solves the challenge of building, managing, and scaling complex data pipelines by providing real-time data integration and smart handling of data changes. It benefits me by simplifying pipeline development, reducing maintenance efforts, and enabling faster, more reliable data delivery across systems.
streaming data pipelines through GUI is great
What do you like best about the product?
I like how it makes easy in the use-cases of AI, where you can do the continuous training process.
What do you dislike about the product?
I don't fee that there are any such. Have to use in-order to know.
What problems is the product solving and how is that benefiting you?
Training AI models.
Efficient Data Pipeline Tool with Some Limitations
What do you like best about the product?
The best thing is how simple it is to use. You don’t need to write much code and the drag and drop makes things fast. It connects to lots of sources which is helpful. Also the monitoring tools are good and helps when things go wrong.
What do you dislike about the product?
It can get slow when dealing with big amount of data or when you add many steps. The docs are sometimes confusing or missing stuff. Support takes time to respond sometimes and the price is a bit much for smaller teams.
What problems is the product solving and how is that benefiting you?
We needed a way to move and process data between different systems without building everything from scratch. StreamSets made it easier to connect data sources and automate the flows. It saves us a lot of time and helps catch issues early with built-in alerts, so we don’t have to monitor everything manually all the time. It also helps to scale things easier when data volume grows.
Enables effective batch loading with visual interface and enterprise support
What is our primary use case?
We are using StreamSets for batch loading.
What is most valuable?
StreamSets is GUI-based and takes care of load balancing. It allows a hybrid installation approach, rather than being completely cloud-based or on-premises. Additionally, StreamSets provides good enterprise support with a quick turnaround.
What needs improvement?
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure. I had to switch to a new EC2 box, even though the processor was not fully utilized. It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades. Additionally, it would be a great enhancement if StreamSets could produce a lineage graph to visualize how the data has passed through the system.
For how long have I used the solution?
I started using StreamSets in 2022, so it's been almost four years now.
What do I think about the stability of the solution?
From one to ten, I would rate the stability of the product at eight point five.
What do I think about the scalability of the solution?
For scalability, I would also rate it at eight point five.
How are customer service and support?
IBM technical support sometimes transfers tickets between different teams due to shift changes, which can be frustrating. The transition can make resolution slow, as I have to explain the issue multiple times. Overall, I would rate the technical support as eight out of ten.
How would you rate customer service and support?
How was the initial setup?
The initial setup of StreamSets isn't simple, but it's not too complex either. It’s a standard setup and is fine.
Which other solutions did I evaluate?
StreamSets is the leader in the market. There are many products, and the choice depends on needed features and use cases, but I view StreamSets as the leader due to its capabilities.
What other advice do I have?
If asked, I definitely recommend StreamSets to other users. My overall rating for the solution is nine.
Which deployment model are you using for this solution?
Hybrid Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Good integration tool
What do you like best about the product?
Streamsets is a good and lightweight integration tool with good ease of integration. It's fast and reliable. It has a decent library of connectors which are easy to use. I have been using streamsets for a year now and recently switched to data bricks. Customer support turnaround is decent. Ease of implementation is not that good as the learning curve is high without a good resource to study from
What do you dislike about the product?
lack of documentation and community support
What problems is the product solving and how is that benefiting you?
Stream sets have a good amount of pre-built connectors which accelerates the speed of data ingestion
StreamSets : Review
What do you like best about the product?
I love using streamsets because it helps in moving data and make necessary transformations to the data by using processors.
What do you dislike about the product?
As of now I haven't faced any issues for this.
What problems is the product solving and how is that benefiting you?
StreamSets helps me in transforming the data and move it from one place to another with ease.