Amazon Bedrock

Amazon Bedrock Intelligent Prompt Routing

Overview

Amazon Bedrock Intelligent Prompt Routing routes prompts to different foundational models within a model family, helping you optimize for quality of responses and cost. Intelligent Prompt Routing can reduce costs by up to 30% without compromising on accuracy.

Maximize performance at lower cost

It can be a challenge for developers to understand which queries require more advanced models or could work with a smaller, faster, and cheaper ones. Using advanced prompt matching and model understanding techniques, Intelligent Prompt Routing predicts the performance of each model for each request and dynamically routes each request to the model that it predicts is most likely to give the desired response at the lowest cost. You can configure a prompt router with any two models from the same family with Anthropic (Haiku, Haiku 3.5, Claude Sonnet 3.5 v1, Claude Sonnet 3.5 v2), Meta Llama (3.1 8b, 70b, 3.2 11B, 90B and 3.3 70B ) and Amazon Nova (Nova Lite and Nova Pro).

"Amazon Bedrock interface showing a 'Select models' dialog with options for model providers, prompt routers, and inference types."

Reduce your development effort

To achieve the desired performance and cost for your applications, you must often develop complex orchestration workflows, routing each request to the model best suited for that request based on your experience to achieve the desired performance in terms of accuracy. With Intelligent Prompt Routing, you can save months of effort on testing different models and creating complex orchestration workflows by selecting default prompt routers provided by Amazon Bedrock, or by configuring your own. You can easily configure your router by choosing two models from a model family, and then configuring the routing criteria for your router.

"Amazon Bedrock interface for configuring a prompt router, showing fields for router details, model selection, fallback model, and routing criteria."

Easily debug with fully traceable requests

Each request is fully traceable, enabling you to identify which model handles each request and enabling you to easily understand and debug any issues.

Screenshot of Amazon Bedrock's chat/text playground interface showing a conversation about LK-99 and router metrics, with configuration settings visible on the left.

Amazon Bedrock Intelligent Prompt Routing

Overview

Maximize performance at lower cost

Reduce your development effort

Easily debug with fully traceable requests

Getting Started

Get started building in the console

Learn more with the documentation

See our pricing page for detailed pricing for different model providers

Learn

Resources

Developers

Help