Overview
Not Diamond intelligently identifies which LLM is best-suited to respond to any given query by combining multiple LLMs into a meta-model that learns when to call each LLM.
Key features
- Maximize output quality: Not Diamond outperforms every foundation model on major evaluation benchmarks by always calling the best model for every prompt.
- Reduce cost and latency: Make intelligent cost and latency tradeoffs to efficiently leverage smaller and cheaper models without degrading quality.
- Personalized routing with feedback: Hyper-personalize routing to each individual end user in real-time based on their feedback.
- Train your own custom router: Leverage your evaluation data to train your own custom routers optimized to your use case.
- Not a proxy: Receive recommendations for which LLM to use and then make your LLM requests client-side in whatever way you choose.
- Python, TypeScript, and REST API support: Easily integrate Not Diamond across a variety of stacks.
Getting started
Making your first API request with Not Diamond takes less than 5 minutes. To get started:
- Create an account at app.notdiamond.ai
- Create a Not Diamond API key (https://app.notdiamond.ai/keys )
- Jump into the quickstart example (https://docs.notdiamond.ai/docs/quickstart )
Alternatively, you can try chatting with our Not Diamond-powered chatbot (https://chat.notdiamond.ai ) to see what routing feels like as an end-user. We also have a Not Diamond-powered RAG app (https://rag.notdiamond.ai/ ) that you can use to ask any questions you have about Not Diamond.
Highlights
- Maximize output quality: Not Diamond outperforms every foundation model on major evaluation benchmarks by always calling the best model for every prompt. You can also leverage your own evaluation data to train your own custom routers optimized to your use case.
- Reduce cost and latency: Make intelligent cost and latency tradeoffs to efficiently leverage smaller and cheaper models without degrading quality.
- Not a proxy: Receive recommendations for which LLM to use and then make your LLM requests client-side using your preferred method, such as AWS Bedrock.
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/month | Overage cost |
---|---|---|---|
Discovery | Free up to 100k monthly API routing requests. Train one custom router. Intelligent cost and latency tradeoffs. Joint prompt optimization support. Fallback rerouting. | $0.00 | - |
Possibility | Everything in Discovery plus $0.001 per API routing request after the first 100K free. Uncapped API routing requests. Unlimited custom routers. Enhanced data privacy with fuzzy hashing. | $100.00 | |
Necessity | Everything in Possibility. VPC deployments. Custom integration and router training support. Access and permissions management. We will reach out to set this up for you. | $0.00 | - |
Vendor refund policy
All Orders are non-cancellable and all fees and other amounts you pay under this Agreement are non-refundable.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products


