Listing Thumbnail

    MiniMax AI

     Info
    Sold by: MiniMax 
    MiniMax Speech 02 is an advanced AI speech model capable of voice cloning and voice synthesis with high fidelity. It can replicate a speaker unique timbre and generate natural, expressive speech across different languages and styles. Ranked #1 on the Artificial Analyze Text to Speech leaderboard, Speech 02 is designed for audio production, virtual assistants, call center and content creation, delivering realistic, customizable voice experiences at scale.

    Overview

    MiniMax Speech model is a cutting-edge AI speech model that excels in voice cloning and text-to-speech (TTS) synthesis.

    Ranked #1 on the Artificial Analyze Text-to-Speech leaderboard, delivering industry-leading speech quality and intelligibility. https://artificialanalysis.ai/text-to-speech/arena?tab=leaderboard 

    Key Capabilities

    Advanced Voice Cloning: Accurately replicates unique timbre, intonation, and speaking style

    High-Quality Output: Produces natural-sounding speech with exceptional fidelity

    Multilingual Support:

    • Speech-02 model supports 30+ languages with diverse accents and emotional expressions
    • Speech-2.5 model support 50+ language with diverse accents and emotional expressions

    Versatile Styles: Seamlessly switches between formal, casual, and expressive tones Core Innovation

    Our breakthrough Intrinsic Zero Shot Text-to-Speech with Learnable Speaker Encoder enables:

    Seamless cooperation between voice style and content generation Virtually unlimited combinations of language, accent, and voice Enhanced synthesis quality through unified AR Transformer architecture

    Applications

    Perfect for audio production, virtual assistants, call centers, content creation, and media localization. Generate customizable voice content at scale while reducing production costs.

    Resources

    Technical Report: https://minimax-ai.github.io/tts_tech_report/ 

    Experience the Technology: https://www.minimax.io/audio 

    Highlights

    • Ranked #1 on the Artificial Analyze Text-to-Speech leaderboard, delivering industry-leading speech quality and intelligibility.
    • High-fidelity voice cloning and text-to-speech synthesis capable of replicating unique timbres and expressive speech.
    • Supports multi-style, customizable voice generation for applications in audio production, virtual assistants, call centers, and content creation.

    Details

    Delivery method

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Pricing is based on the duration and terms of your contract with the vendor. This entitles you to a specified quantity of use for the contract duration. If you choose not to renew or replace your contract before it ends, access to these entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    1-month contract (6)

     Info
    Dimension
    Description
    Cost/month
    Starter
    The MiniMax speech-01/02 series supports up to 10 requests per min (RPM = 10). Each account includes 100,000 TTS character credits for speech synthesis and allows up to 10 voice slots for different voice profiles or styles. This setup is ideal for small to medium-scale speech generation, providing stable performance and flexible voice customization.
    $5.00
    Standard
    The MiniMax speech-01/02/2.5 series supports up to 50 requests per minute (RPM = 50). Each account includes 300,000 TTS character credits for speech synthesis and provides up to 100 voice slots to manage different voice profiles or styles. This configuration is suitable for medium to large-scale speech generation, offering higher throughput and greater flexibility for voice customization.
    $30.00
    Pro
    The MiniMax speech-01/02/2.5 series supports up to 200 requests per minute (RPM = 200). Each account includes 1,100,000 TTS character credits for speech synthesis and offers up to 250 voice slots for managing diverse voice profiles and styles. This configuration is designed for large-scale or high-demand speech generation, delivering high throughput and extensive flexibility for advanced voice customization.
    $99.00
    Scale
    The MiniMax speech-01/02/2.5 series supports up to 500 requests per minute (RPM = 500). Each account includes 3,300,000 TTS character credits for speech synthesis and provides up to 500 voice slots for managing various voice profiles and styles. This configuration is optimized for enterprise-level or large-scale speech generation, offering very high throughput and extensive flexibility for complex voice customization needs.
    $249.00
    Business
    The MiniMax speech-01/02/2.5 series under the Business plan supports up to 800 requests per minute (RPM = 800). Each account includes 20,000,000 TTS character credits for speech synthesis and allows up to 800 voice slots for managing a wide range of custom voice profiles and styles. This configuration is built for enterprise-scale and high-volume applications, delivering exceptional throughput, stability, and flexibility for advanced voice generation needs.
    $999.00
    Customer Pricing
    This plan offers priority access to model updates, unlimited requests per minute (RPM), and exclusive guarantees for security and stability. Users also gain more voice options and enhanced voice cloning capabilities, making it ideal for enterprises or customers with the highest demands for scalability, customization, and reliability.
    $0.01

    Vendor refund policy

    Not currently supported.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    Email:api@hailuoai.com  Description: Customers receive 24x7 email support with a guaranteed response within 24 hours. Technical assistance includes setup guidance, troubleshooting, and usage tips for MiniMax Speech-02.

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    No customer reviews yet
    Be the first to review this product . We've partnered with PeerSpot to gather customer feedback. You can share your experience by writing or recording a review, or scheduling a call with a PeerSpot analyst.