Listing Thumbnail

    Not Diamond

     Info
    Not Diamond is an AI model router that automatically determines which LLM is best-suited to respond to any query, improving LLM output quality by combining multiple LLMs into a meta-model that learns when to call each LLM.

    Overview

    Not Diamond intelligently identifies which LLM is best-suited to respond to any given query by combining multiple LLMs into a meta-model that learns when to call each LLM.

    Key features

    • Maximize output quality: Not Diamond outperforms every foundation model on major evaluation benchmarks by always calling the best model for every prompt.
    • Reduce cost and latency: Make intelligent cost and latency tradeoffs to efficiently leverage smaller and cheaper models without degrading quality.
    • Personalized routing with feedback: Hyper-personalize routing to each individual end user in real-time based on their feedback.
    • Train your own custom router: Leverage your evaluation data to train your own custom routers optimized to your use case.
    • Not a proxy: Receive recommendations for which LLM to use and then make your LLM requests client-side in whatever way you choose.
    • Python, TypeScript, and REST API support: Easily integrate Not Diamond across a variety of stacks.

    Getting started

    Making your first API request with Not Diamond takes less than 5 minutes. To get started:

    Alternatively, you can try chatting with our Not Diamond-powered chatbot (https://chat.notdiamond.ai ) to see what routing feels like as an end-user. We also have a Not Diamond-powered RAG app (https://rag.notdiamond.ai/ ) that you can use to ask any questions you have about Not Diamond.

    Highlights

    • Maximize output quality: Not Diamond outperforms every foundation model on major evaluation benchmarks by always calling the best model for every prompt. You can also leverage your own evaluation data to train your own custom routers optimized to your use case.
    • Reduce cost and latency: Make intelligent cost and latency tradeoffs to efficiently leverage smaller and cheaper models without degrading quality.
    • Not a proxy: Receive recommendations for which LLM to use and then make your LLM requests client-side using your preferred method, such as AWS Bedrock.

    Details

    Delivery method

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Pricing is based on the duration and terms of your contract with the vendor, and additional usage. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for overages or additional usage not covered in the contract. These charges are applied on top of the contract price. If you choose not to renew or replace your contract before the contract end date, access to your entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    1-month contract (3)

     Info
    Dimension
    Description
    Cost/month
    Overage cost
    Discovery
    Free up to 100k monthly API routing requests. Train one custom router. Intelligent cost and latency tradeoffs. Joint prompt optimization support. Fallback rerouting.
    $0.00
    -
    Possibility
    Everything in Discovery plus $0.001 per API routing request after the first 100K free. Uncapped API routing requests. Unlimited custom routers. Enhanced data privacy with fuzzy hashing.
    $100.00
    Necessity
    Everything in Possibility. VPC deployments. Custom integration and router training support. Access and permissions management. We will reach out to set this up for you.
    $0.00
    -

    Vendor refund policy

    All Orders are non-cancellable and all fees and other amounts you pay under this Agreement are non-refundable.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Similar products

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    No customer reviews yet
    Be the first to review this product . We've partnered with PeerSpot to gather customer feedback. You can share your experience by writing or recording a review, or scheduling a call with a PeerSpot analyst.