Listing Thumbnail

    AssemblyAI

     Info
    Sold by: AssemblyAI 
    Deployed on AWS
    Free Trial
    AssemblyAI builds AI systems that can understand human speech with superhuman abilities. Starting building with $50 in usage credits during your 90-day free trial. Cancel any time. After your trial ends, you will automatically be enrolled into an Assembly AI pay-as-you-go plan. Request a private offer for discounted pricing based on your usage profile.

    Overview

    Play video

    AssemblyAI offers Speech AI models via an API that product teams and developers can use to build powerful AI solutions based on voice data. Thousands of developers build on AssemblyAI's Speech AI models every day to run Speech-to-Text on multilingual speech, and harness the power of Large Language Models to extract the full value from that voice data - including answering questions from voice data, generating content, and extracting metadata in seconds. AssemblyAI offers async transcription, with most audio files completing in well under 45 seconds regardless of audio duration, as well as real-time transcription with high accuracy and <600 ms of latency.

    AssemblyAI gives you access to state-of-the-art Speech AI models and capabilities for real-world use cases, so you can build smarter applications in a fraction of the time. Models and features include:

    - Speech recognition
    - Speaker diarization
    - Auto punctuation and casing
    - Auto language detection
    - Summarization
    - Content moderation
    - Sentiment analysis
    - Auto highlights
    - PII redaction
    - Topic detection (IAB classification)
    - Entity detection
    - Auto chapters
    - Custom spelling
    - Custom vocabulary
    - Dual channel transcription
    - Export SRT or VTT caption files
    - Filler word filtering
    - Profanity filtering

    In addition, LeMUR, which allows users to leverage the capabilities of Large Language Models, can quickly process audio transcripts for single or multiple audio files for tasks like summarization, question & answer, and AI coaching feedback.

    Our Speech AI products support 33 different audio and video file types and 99+ languages. Our models are used by thousands of breakthrough startups and dozens of global enterprises for mission-critical workloads.
    .

    Highlights

    • Unparalleled Human-Level Accuracy: Our multilingual speech recognition AI models deliver industry-leading performance with the lowest word error rates on the market, outperforming competitors by over 60% when recognizing challenging content like rare words and proper nouns. Trusted by more than 3,000 innovative companies, including Zoom, our platform provides the foundation for mission-critical speech applications at scale.
    • Built for enterprise-grade performance, our APIs deliver unmatched scalability for high-concurrency applications. Security is embedded with SOC 2 Type 2, PCI DSS, and GDPR compliance. For healthcare applications, AssemblyAI offers Business Associate Agreements (BAAs). Choose flexible hosting options in both US and EU regions.
    • Comprehensive Audio Intelligence Suite: Our advanced models summarize conversations, identify speakers through diarization, analyze sentiment, moderate content, automatically redact PII, and much more, all in a single platform. Our LeMUR framework seamlessly connects spoken data with large language models, enabling unlimited possibilities for voice-powered applications.

    Details

    Delivery method

    Deployed on AWS

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Free trial

    Try this product free according to the free trial terms set by the vendor.
    Pricing is based on the duration and terms of your contract with the vendor, and additional usage. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for overages or additional usage not covered in the contract. These charges are applied on top of the contract price. If you choose not to renew or replace your contract before the contract end date, access to your entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    1-month contract (6)

     Info
    Dimension
    Description
    Cost/month
    Pay As You Go
    State-of-the-art production-ready AI models
    $0.00
    Slam_1_STT
    Slam-1 speech-to-text (core)
    $0.37
    haiku3_5_input
    Claude 3.5 Haiku 1k token input (LeMur)
    $0.001
    haiku3_5_output
    Claude 3.5 Haiku 1k token output (LeMur)
    $0.004
    sonnet3_7_input
    Claude 3.7 Sonnet 1k token input (LeMur)
    $0.003
    sonnet3_7_output
    Claude 3.7 Sonnet 1k token output (LeMur)
    $0.015

    Additional usage costs (20)

     Info

    The following dimensions are not included in the contract terms, which will be charged based on your usage.

    Dimension
    Cost/unit
    Async Transcription (core)
    $0.37
    Nano Speech-to-Text (core)
    $0.12
    Real-Time Transcription (core)
    $0.47
    Auto Chapters (Audio Intelligence)
    $0.08
    Content Moderation (Audio Intelligence)
    $0.15
    Entity Detection (Audio Intelligence)
    $0.08
    Key Phrases (Auto Highlights)
    $0.01
    PII Redaction (Audio Intelligence)
    $0.08
    PII Audio Redaction (Audio Intelligence)
    $0.05
    Sentiment Analysis (Audio Intelligence)
    $0.02

    Vendor refund policy

    All fees are non-refundable and non-cancellable except as required by law.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Resources

    Support

    Vendor support

    Support is available via chat and email 24/7. support@assemblyai.com 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    |
    53 external reviews
    External reviews are sourced from G2  and are not included in the star rating for this product.
    Francesco M.

    Using AssemblyAI to get podcast episodes transcripts

    Reviewed on May 20, 2025
    Review provided by G2
    What do you like best about the product?
    I use AssemblyAI to get transcripts of my podcast episodes, and the accuracy is pretty good.

    The timestamp associated with each word allow us to easily make a connection with the podcast audio and jump right where we need.

    Customer support has been great.
    What do you dislike about the product?
    Nothing to complain.
    Sometimes it's a bit tricky when the podcaster say the spelling of the promo code he uses.

    For example, if the promocode is SUMMER. I may get S-U-M-M-E-R, which is not easy to work with. But I it's an edge case.
    What problems is the product solving and how is that benefiting you?
    Get the podcast episodes transcript, associating each word with a timestamp.
    Give lot of insight to what podcasters are saying and how are promoting our promo codes
    Timur M.

    a great solution to build into your product

    Reviewed on May 20, 2025
    Review provided by G2
    What do you like best about the product?
    We recently started using the AssaemblyAI api to transcribe videos from our educational channels. The API works quickly and reliably. So far we have never encountered any limitations of the platform, although our videos are quite large. The quality of recognition is very high, the price is about the same as with OpenAI analogs, but there is no limit of 25 minutes per video fragment.
    What do you dislike about the product?
    I wish the price was even lower, we have so many more videos to process. Also it is not quite clear how formatting into paragraphs works, according to the api we get exactly the text without paragraphs, although in the version available for free through the interface, the recognized text is already formatted
    What problems is the product solving and how is that benefiting you?
    We are using the AssaemblyAI api to transcribe videos from our educational channels to build RAG system
    Professional Training & Coaching

    AssemblyAI does a good job with transcription

    Reviewed on May 19, 2025
    Review provided by G2
    What do you like best about the product?
    I am a college professor and used the AssemblyAI speech-to-text API for a qualitative research project. I had hundreds of interviews in a non-English language that needed to be transcribed. AssemblyAI solved my problem fairly easily. It cost me a lot less than what I anticipated. The transcriptions were very accurate and included the special characters in that language.
    There are plenty of videos and sources for those who are new to AssemblyAI.
    I could not be happier.
    What do you dislike about the product?
    I did not have anything specific that I disliked about AssemblyAI.
    What problems is the product solving and how is that benefiting you?
    AssemblyAI transcribes audio files fairly easily. As a researcher that uses interviews as data, it makes my job a lot easier.
    Research

    Easy to use speech to text solution

    Reviewed on May 19, 2025
    Review provided by G2
    What do you like best about the product?
    Their documentation was very easy to work with. I was able to hit the ground running within minutes.
    What do you dislike about the product?
    Faster transcription would be appreciated.
    What problems is the product solving and how is that benefiting you?
    Transcribing meeting transcripts that we’ll later to capture insights.
    Rodrigo F.

    Best Speech-to-Text Service Overall

    Reviewed on May 19, 2025
    Review provided by G2
    What do you like best about the product?
    AssemblyAI is seriously impressive. Before I found it, I tried out Google Cloud, Whisper, and some open-source tools for diarization. I even gave Read.ai a shot, but honestly, none of them gave me the results I was looking for.

    Then I saw someone mention AssemblyAI on Reddit, and I decided to give it a try. I’m so glad I did—their transcription and diarization are on another level. I barely ever need to edit the transcripts, which is rare with these kinds of tools.

    The pricing is super reasonable for what you get, and the API is really flexible. I’ve been able to build my own workflows to transcribe meetings, interviews, and videos without any hassle. I use it pretty much every day for transcribing meetings I record on my computer, and I save everything in Markdown format.

    If you’re looking for a solid, reliable transcription service that just works, I can’t recommend AssemblyAI enough.
    What do you dislike about the product?
    It's not that I don't like but I think there is high bareer for non-techs to access the serviece. I know tht they ahve a playground, but it's still scary for peop,e who want to use the service but see the. Some friends who see my workflow wants to mimic but stop when they see the api nterface. The docs are very well detailed, but there are barreres for adoption for certain customer segments still.

    Another thing that I would like would to store the cluster of voicers that are recorded I would like the odel to automatically name them. I think this would be too complicated and probably there's privacy concerns involved. But it would be a quality of life approach. But I guess this is a niche need instead of something the custoemr base would be interested at
    What problems is the product solving and how is that benefiting you?
    AssemblyAI is solving the problem of turning audio into accurate, structured text—especially with speaker diarization and high transcription quality. It saves me a huge amount of time. I use it to transcribe meetings, interviews, and video content recorded locally on my computer, and the results are so good I rarely need to edit them. Having access to a reliable API also means I can fully automate my workflow and store the transcripts in Markdown, exactly the way I need. It’s made transcription effortless and consistent, which is a big deal for someone who works with audio content daily.
    View all reviews