Overview
The Patronus AI Platform enables engineering teams to test, score, and benchmark LLM performance on real world scenarios, generate adversarial test cases at scale, monitor hallucinations and other unexpected and unsafe behavior, and more.
Customers use the Patronus AI Platform as soon as they have any kind of an LLM or LLM system in their hand. The platform is primarily used in 2 key parts of the user journey: AI product pre-deployment and AI product post-deployment. The product is typically used with not just LLMs, but also retrieval-based LLM systems, agents, routing architectures, and more. There are also 2 types of key product offerings: 1) cloud-hosted solution, and 2) on-prem self-hosted offering.
For pre-deployment: Customers use several features in the web platform for offline LLM evaluation and experimentation, all in one place. In the Evaluation Run workflow, customers can select or define parameters like the LLM and its associated settings, evaluation dataset, and criteria.
For post-deployment: Customers use the Patronus API and the LLM Failure Monitoring dashboard for LLM testing and evaluation in CI and production. The API solution allows customers to validate, log, and address LLM failures in real-time. To accompany the API and manage the alerts, there is also an LLM Failure Monitoring dashboard in the web platform to visualize, filter, and aggregate statistics on LLM failures.
Highlights
- Retrieval-Augmented Generation (RAG) Testing: Verify that your LLM-based retrieval systems consistently deliver reliable information using our retrieval evaluation API.
- Evaluation Runs: Leverage our managed service for evaluations to auto-generate test suites, score model performance on real world scenarios, benchmark LLMs, and more.
- LLM Failure Monitoring: Continuously evaluate, track, and visualize LLM system performance for your AI product in production.
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/12 months |
---|---|---|
Evaluation Samples | Total number of samples evaluated using Patronus AI Platform. | $1,000,000.00 |
Vendor refund policy
If you cancel your subscription within 48 hours of purchase, you can get a full refund. All other refunds would happen on a case-by-case basis. Reach out to contact@patronus.ai to request a refund.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Support
Vendor support
We have a 24-hour response time SLA for all buyers. Please reach out to contact@patronus.ai if you are experiencing any issues.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.