
Reviews from AWS customer
0 AWS reviews
-
5 star0
-
4 star0
-
3 star0
-
2 star0
-
1 star0
External reviews
70 reviews
from
External reviews are not included in the AWS star rating for the product.
Accurate and reliable
What do you like best about the product?
Accurate transcription, reliable service and great prices. It is easy to integrate, easy to use, and full of valuable insights for your audio
What do you dislike about the product?
It only supports EU and US data residency. Regional self deployments would be great.
Moreover, for companies that deal with both text and audio data, it would be useful to have the same pii redaction and insights for both data types, but AssemblyAI only accepts audio inputs, forcing us to try and replicate their pii redaction on text data through other means, or skip their pii redaction and insights for sake of uniformity.
Moreover, for companies that deal with both text and audio data, it would be useful to have the same pii redaction and insights for both data types, but AssemblyAI only accepts audio inputs, forcing us to try and replicate their pii redaction on text data through other means, or skip their pii redaction and insights for sake of uniformity.
What problems is the product solving and how is that benefiting you?
Transcription of calls
AssemblyAI: accurate transcriptions simple API to integrate advanced features fast and effective
What do you like best about the product?
AssemblyAI is one of the best choices for automatically transcribing and analyzing audio. It is very accurate, fast, and easy to use. It has many features and is perfect for developers, tech companies, and anyone who wants to manage large amounts of voice data automatically. With the API system, you can create your own software and customize it as you wish. I use the APIs with my own program in Python.
Strengths
Accuracy: among the best accuracy rates in the industry, with a very low Word Error Rate (WER) and consistent performance even on complex audio.
Speed: asynchronous transcription in less than 45 seconds and real-time with latency under 600 ms.
Developer experience: well-documented API, easy to integrate, with practical examples and effective technical support.
Versatility: suitable for both simple use cases (webinar transcription, meetings, podcasts) and complex workflows (sentiment analysis, entity extraction, content moderation).
Accessibility: competitive pay-as-you-go pricing, with no hidden costs.
Strengths
Accuracy: among the best accuracy rates in the industry, with a very low Word Error Rate (WER) and consistent performance even on complex audio.
Speed: asynchronous transcription in less than 45 seconds and real-time with latency under 600 ms.
Developer experience: well-documented API, easy to integrate, with practical examples and effective technical support.
Versatility: suitable for both simple use cases (webinar transcription, meetings, podcasts) and complex workflows (sentiment analysis, entity extraction, content moderation).
Accessibility: competitive pay-as-you-go pricing, with no hidden costs.
What do you dislike about the product?
I can't say I've found any problems in the system. Excellent and reliable. The best.
What problems is the product solving and how is that benefiting you?
Audio transcriptions
Excellent support. Low cost.
What do you like best about the product?
Excellent documentation and responsive support that will help you resolve any issues with using the API.
Multiple language support and automatic detection. The ability to upload files directly to their server, which makes it faster than saving them to third-party services.
You pay for usage instead of a subscription, which is very nice.
Multiple language support and automatic detection. The ability to upload files directly to their server, which makes it faster than saving them to third-party services.
You pay for usage instead of a subscription, which is very nice.
What do you dislike about the product?
During my time using the service, I haven't found much that I dislike. The main my issue is that I would like to see support for video files from services such as YouTube directly via a link. Currently, I have to use third-party services to download and process videos from YouTube before sending them to AssamblyAI.
What problems is the product solving and how is that benefiting you?
I am a mobile and web application developer.
My applications are based on converting video or audio files into text. Therefore, AssamblyAI fully covers all the functionality of my applications.
My applications are based on converting video or audio files into text. Therefore, AssamblyAI fully covers all the functionality of my applications.
Great tool, better than most available out there.
What do you like best about the product?
The ability to identify speakers and get a detailed time stamp based division.
What do you dislike about the product?
The segmentation of dialogues is a little rusty with associating the right speaker.
What problems is the product solving and how is that benefiting you?
Helps with my translation workflow.
Easy and accurate way to implement transcription in your software
What do you like best about the product?
It’s easy to implement and delivers great value for money. The way the API is designed is excellent!
What do you dislike about the product?
I think the delay with some large transcriptions is a downside, but so far it hasn’t affected my users much.
What problems is the product solving and how is that benefiting you?
Need to have the meeting document recorded to identify the key points of the projects.
Excellent advanced tool, but simple to use
What do you like best about the product?
...it is a very comprehensive tool that allows not only for a very accurate transcription of the text, but also includes punctuation. It also has various features such as speaker recognition. Very satisfied.
What do you dislike about the product?
I have nothing to report that I don't like at the moment.
What problems is the product solving and how is that benefiting you?
I have integrated into an automation the transcription of my voice notes to create tasks in my to-do list and another automation that transcribes the audio from a Telegram chat with a client to record them in the project notes on Trello.
Affordable and Easy-to-Integrate Transcription Service
What do you like best about the product?
I'm impressed with AssemblyAI's transcription service due to its reasonable pricing. For transcribing 243 hours of audio, I paid only $68. In comparison, Google's Chirp_2 model cost $47 for just 35 hours, which would have totaled $326 for the same 243 hours.
Additional benefits include the ability to separate text by different speakers (English only) and automatic language detection. The API is straightforward to use and was easy to integrate into both Flutter and .NET Core Web applications.
Overall, I'm satisfied with the service and plan to continue using it.
Additional benefits include the ability to separate text by different speakers (English only) and automatic language detection. The API is straightforward to use and was easy to integrate into both Flutter and .NET Core Web applications.
Overall, I'm satisfied with the service and plan to continue using it.
What do you dislike about the product?
There are some aspects I'd like to see improved. The API response contains too many unnecessary fields that I don't need, which increases loading times. I would also appreciate faster speech-to-text processing speeds and an increase in the maximum duration limit beyond the current 10-hour restriction. Additionally, the slam-1 model only works with English text, and I would like to see this model become internationalized to support multiple languages.
What problems is the product solving and how is that benefiting you?
AssemblyAI enables me to efficiently convert large volumes of audio data into text, which is highly beneficial for both educational purposes and note-taking.
Powerful and Accurate Speech-to-Text API
What do you like best about the product?
I love how AssemblyAI delivers outstanding transcription accuracy even on noisy or low-quality audio. The SDKs, documentation and code samples made integration into our codebase very easy and almost instantaneous. On top of all that, features like custom vocabulary tuning, topic detection, and sentiment analysis mean I can rely on a single platform for everything from basic transcripts to deep audio insights.
What do you dislike about the product?
Occasionally the API struggles with heavy accents or extremely fast speech, leading to minor mis-transcriptions that require manual correction
What problems is the product solving and how is that benefiting you?
AssemblyAI makes it possible to transcribe audio that other services struggle with, such as recordings with heavy accents, background noise or low volume. By delivering clean, accurate text from these challenging files, it eliminates the need for manual corrections and keeps our workflows moving smoothly.
High-quality speech recognition with robust diarization and smart API design
What do you like best about the product?
AssemblyAI impresses with its high transcription quality, even when dealing with messy or low-quality audio inputs. The diarization capabilities are particularly strong—accurately distinguishing between speakers in less-than-perfect recordings. The API suite is fast, well-documented, and returns a rich, detailed output format that makes post-processing straightforward and powerful. I also found the Word Boost feature especially helpful: being able to prioritize tricky or uncommon words significantly improves recognition accuracy in niche use cases. Overall, it’s a developer-friendly platform that balances precision with flexibility.
What do you dislike about the product?
Honestly, there’s little to complain about. The pricing model is reasonable for the level of quality and features provided, and I haven’t encountered any significant drawbacks in my usage
What problems is the product solving and how is that benefiting you?
Transcription and diarization of complex audios
Great transcription for Spanish, quicker than other providers
What do you like best about the product?
It's really great for Spanish specifically and user diarization. Also, it's quick compared to Speechmatics API; it's really slow, so kudos on that also, and it's been really cost-effective. I must have transcribed 800-1000 calls with the free credits, so that's really great. Overall super solid though.
What do you dislike about the product?
I think the worst part about Assembly has been that the API itself is a bit complicated to work with, since with recordings you've got to make them into links first and then send the links and transcript IDs to a separate endpoint. I can still work with it and have done lots of things, but it would be easier if it was a single API if I'm working with recordings that did this in the background.
What problems is the product solving and how is that benefiting you?
It is the only API we've found that reliably transcribes some of our more lower quality/foreign accents calls in Spanish with correct dieratization. We haven't found another API that did this well after trying most of the popular API's (e.g. deepgram, speechmatics)
showing 1 - 10