AssemblyAI

AssemblyAI builds AI systems that can understand human speech with superhuman abilities. Starting building with $50 in usage credits during your 90-day free trial. Cancel any time. After your trial ends, you will automatically be enrolled into an Assembly AI pay-as-you-go plan. Request a private offer for discounted pricing based on your usage profile.

0 AWS reviews

View purchase options

Try for free

Overview

Product video

AssemblyAI offers Speech AI models via an API that product teams and developers can use to build powerful AI solutions based on voice data. Thousands of developers build on AssemblyAI's Speech AI models every day to run Speech-to-Text on multilingual speech, and harness the power of Large Language Models to extract the full value from that voice data - including answering questions from voice data, generating content, and extracting metadata in seconds. AssemblyAI offers async transcription, with most audio files completing in well under 45 seconds regardless of audio duration, as well as real-time transcription with high accuracy and <600 ms of latency.

AssemblyAI gives you access to state-of-the-art Speech AI models and capabilities for real-world use cases, so you can build smarter applications in a fraction of the time. Models and features include:

- Speech recognition
- Speaker diarization
- Auto punctuation and casing
- Auto language detection
- Summarization
- Content moderation
- Sentiment analysis
- Auto highlights
- PII redaction
- Topic detection (IAB classification)
- Entity detection
- Auto chapters
- Custom spelling
- Custom vocabulary
- Dual channel transcription
- Export SRT or VTT caption files
- Filler word filtering
- Profanity filtering

In addition, LeMUR, which allows users to leverage the capabilities of Large Language Models, can quickly process audio transcripts for single or multiple audio files for tasks like summarization, question & answer, and AI coaching feedback.

Our Speech AI products support 33 different audio and video file types and 99+ languages. Our models are used by thousands of breakthrough startups and dozens of global enterprises for mission-critical workloads.
.

Highlights

Unparalleled Human-Level Accuracy: Our multilingual speech recognition AI models deliver industry-leading performance with the lowest word error rates on the market, outperforming competitors by over 60% when recognizing challenging content like rare words and proper nouns. Trusted by more than 3,000 innovative companies, including Zoom, our platform provides the foundation for mission-critical speech applications at scale.
Built for enterprise-grade performance, our APIs deliver unmatched scalability for high-concurrency applications. Security is embedded with SOC 2 Type 2, PCI DSS, and GDPR compliance. For healthcare applications, AssemblyAI offers Business Associate Agreements (BAAs). Choose flexible hosting options in both US and EU regions.
Comprehensive Audio Intelligence Suite: Our advanced models summarize conversations, identify speakers through diarization, analyze sentiment, moderate content, automatically redact PII, and much more, all in a single platform. Our LeMUR framework seamlessly connects spoken data with large language models, enabling unlimited possibilities for voice-powered applications.

Details

Sold by

AssemblyAI

Unlock automation with AI agent solutions

Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.

Explore AI agent solutions

Features and programs

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

Free trial

Try for free

Try this product free according to the free trial terms set by the vendor.

AssemblyAI

Info

View purchase options

Pricing is based on the duration and terms of your contract with the vendor, and additional usage. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for overages or additional usage not covered in the contract. These charges are applied on top of the contract price. If you choose not to renew or replace your contract before the contract end date, access to your entitlements will expire.

Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.

1-month contract (6)

Info

Dimension	Description	Cost/month
Pay As You Go	State-of-the-art production-ready AI models	$0.00
Slam_1_STT	Slam-1 speech-to-text (core)	$0.37
haiku3_5_input	Claude 3.5 Haiku 1k token input (LeMur)	$0.001
haiku3_5_output	Claude 3.5 Haiku 1k token output (LeMur)	$0.004
sonnet3_7_input	Claude 3.7 Sonnet 1k token input (LeMur)	$0.003
sonnet3_7_output	Claude 3.7 Sonnet 1k token output (LeMur)	$0.015

Additional usage costs (20)

Info

The following dimensions are not included in the contract terms, which will be charged based on your usage.

Dimension	Cost/unit
Async Transcription (core)	$0.37
Nano Speech-to-Text (core)	$0.12
Real-Time Transcription (core)	$0.47
Auto Chapters (Audio Intelligence)	$0.08
Content Moderation (Audio Intelligence)	$0.15
Entity Detection (Audio Intelligence)	$0.08
Key Phrases (Auto Highlights)	$0.01
PII Redaction (Audio Intelligence)	$0.08
PII Audio Redaction (Audio Intelligence)	$0.05
Sentiment Analysis (Audio Intelligence)	$0.02

Vendor refund policy

All fees are non-refundable and non-cancellable except as required by law.

How can we make this page better?

We'd like to hear your feedback and ideas on how to improve this page.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

Software as a Service (SaaS)

SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

Resources

Vendor resources

Security

Case Studies

AssemblyAI Docs

Support

Vendor support

Support is available via chat and email 24/7. support@assemblyai.com

AWS infrastructure support

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Get support

Product comparison

Info

Updated weekly

AssemblyAI

By AssemblyAI

Speech-to-Text & Text-to-Speech GenAI API

By Deepgram

Contact Center Analytics for Amazon Connect

By SuccessKPI, Inc

Accolades

Info

Top

In Speech to Text, Customer Support, Speech Recognition

Top

In Scheduling & Coordination, Speech Recognition, Sales & Marketing

Top

In Quality Assurance, Speech to Text

Customer reviews

Info

Sentiment is AI generated from actual customer reviews on AWS and G2

Reviews

Functionality

Ease of use

Customer service

Cost effectiveness

85 reviews

Positive

2 reviews

Insufficient data

0 reviews

Insufficient data

Positive reviews

Mixed reviews

Negative reviews

Overview

Info

AI generated from product descriptions

Speech Recognition

Advanced multilingual speech recognition with high accuracy and low word error rates

Language Processing

Support for 99+ languages with automatic language detection and custom vocabulary capabilities

Audio Intelligence

Comprehensive suite of AI models including speaker diarization, sentiment analysis, content moderation, and PII redaction

Large Language Model Integration

LeMUR framework for processing audio transcripts using advanced language model capabilities

Transcription Flexibility

Support for async and real-time transcription with multiple file type compatibility across 33 audio and video formats

Speech Recognition Speed

Real-time transcription with processing speed of 20x faster than traditional methods, capable of transcribing an hour of audio in approximately 12 seconds

Latency Performance

Ultra-low latency under 300 milliseconds for near-instantaneous speech-to-text conversion

Accuracy Metrics

Speech recognition accuracy exceeding 90% across multiple use case categories

Language Understanding Capabilities

Advanced natural language processing features including summarization, sentiment analysis, speaker diarization, language detection, and translation

Model Customization

Support for customer-specific custom model training to adapt speech recognition for unique business requirements

Analytics Platform

Pure SaaS analytics platform with real-time and historical reporting capabilities for contact centers

AI-Powered Data Processing

Artificial intelligence-driven platform that converts call recordings to text and extracts sentiments, brands, events, and topics

Multi-Platform Integration

Native AWS cloud application with out-of-the-box integration for multiple contact center platforms and data sources

Advanced Speech Analytics

Comprehensive speech and text analytics with capability to blend metadata from IVR, ACD, and CRM systems

Security Compliance

Enterprise-grade security compliance including PCI, SOC II, ISO27001, GDPR, and FedRAMP standards

Security credentials

Info

Validated by AWS Marketplace

FedRAMP

GDPR

HIPAA

ISO/IEC 27001

PCI DSS

SOC 2 Type 2

No security profile

Contract

Info

Standard contract

Customer reviews

Leave a review

Ratings and reviews

Info

0 ratings

5 star

4 star

3 star

2 star

1 star

0 AWS reviews

No customer reviews yet

Be the first to review this product . We've partnered with PeerSpot to gather customer feedback. You can share your experience by writing or recording a review, or scheduling a call with a PeerSpot analyst.