Portkey AI

Info

Sold by: Portkey AI

Control Panel for AI Apps

4.6

View purchase options

Overview

Try agent mode

Create proposal

Ask question

Scaling LLM apps in production is hard. You need reliability, security & accuracy and a bunch of DevOps work (Fallbacks, A/B testing, Load Balancing b/w models, etc) to get confidence on production.

Portkey is your AI control panel, bringing full visibility and control over your AI apps. Integrating Portkey's AI Gateway is a 2 line code upgrade. Bringing in full observability, routing control, guardrails, prompt management and security policies, all managed through a single pane of glass.

Portkey is enterprise-ready and is ISO, SOC2, HIPAA & GDPR compliant.

Highlights

AI Gateway & Observability
Prompt Management & Security
Guardrails & Fine-tuning

Details

Sold by

Portkey AI

Introducing multi-product solutions

You can now purchase comprehensive solutions tailored to use cases and industries.

Learn more

Explore multi-product solutions

Features and programs

Buyer guide

Gain valuable insights from real users who purchased this product, powered by PeerSpot.

Get the buyer guide

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

Portkey AI

Info

View purchase options

Pricing is based on the duration and terms of your contract with the vendor. This entitles you to a specified quantity of use for the contract duration. If you choose not to renew or replace your contract before it ends, access to these entitlements will expire.

Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.

12-month contract (1)

Info

Dimension	Description	Cost/12 months
PortkeyEnterprise Access	Enterprise Plan	$99,999.00

Vendor refund policy

Custom.

How can we make this page better?

Tell us how we can improve this page, or report an issue with this product.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

Software as a Service (SaaS)

SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

Resources

Vendor resources

Portkey Docs

Support

Vendor support

Custom Enterprise Support.
support@portkey.ai

AWS infrastructure support

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Get support

Product comparison

Info

Updated weekly

Portkey AI

By Portkey AI

Lunary Enterprise Edition

By Lunary AI

Karini AI Agentic AI Platform

By Karini AI

Accolades

Info

Top

In AIOps

Top

In Procurement & Supply Chain

Customer reviews

Info

Sentiment is AI generated from actual customer reviews on AWS and G2

Reviews

Functionality

Ease of use

Customer service

Cost effectiveness

21 reviews

Positive

0 reviews

Insufficient data

2 reviews

Insufficient data

Positive reviews

Mixed reviews

Negative reviews

Overview

Info

AI generated from product descriptions

AI Gateway and Request Routing

AI Gateway with load balancing capabilities between multiple language models and fallback mechanisms for production reliability

Observability and Monitoring

Full visibility and observability over AI applications with comprehensive logging and monitoring through a centralized dashboard

Prompt Management

Centralized prompt management system for versioning, organization, and control of prompts across AI applications

Security and Compliance

Enterprise-grade security with ISO, SOC2, HIPAA, and GDPR compliance certifications and security policy enforcement

Guardrails and A/B Testing

Built-in guardrails for output validation and A/B testing capabilities for comparing model performance and behavior

Observability and Monitoring

Track metrics including cost, token usage, and latency with detailed traces for logging and debugging LLM calls and agents.

Prompt Management and Versioning

Collaborate on prompts with versioning capabilities and perform A/B testing to improve LLM prompt performance.

Evaluation and Testing Integration

Test prompts and models for optimal performance with evaluation suite integration into CI pipeline.

Content Safety and Guardrails

Implement guardrails to prevent toxic content, detect prompt injections, and prevent PII data leakage in AI responses.

Security and Compliance Features

Support for on-premises deployment and GDPR-compliant cloud hosting with PII removal and anomaly detection capabilities.

No-Code Application Development

Visual interface with built-in connectors and large language models enabling generative AI application deployment without coding requirements.

Multi-Model Support and Comparison

Access to latest large language models with prompt playground functionality for model comparison and evaluation across different LLM options.

Enterprise Security and Governance

Secure credentials management, personally identifiable information masking, data encryption, and role-based access controls for enterprise-level compliance.

Observability and Cost Management

Operational dashboards providing visibility into model spending, performance metrics, usage patterns, and trends for cost tracking and optimization.

Trust and Safety Controls

Content filtering mechanisms to reduce noise, block harmful content, and include relevant citations with ground truth comparison capabilities using LLM as a judge.

Contract

Info

Standard contract

Customer reviews

Leave a review

Ratings and reviews

Info

4.6

23 ratings

5 star

4 star

3 star

2 star

1 star

83%

17%

2 AWS reviews

21 external reviews

External reviews are from G2 and PeerSpot .

Ravindra N.

Unified LLM Gateway with Powerful Observability and Reliability

Reviewed on Jul 21, 2026

Review provided by G2

What do you like best about the product?

What I like most about Portkey is its ability to unify access to multiple large language models through a single gateway. Instead of managing separate integrations for different AI providers, Portkey provides a consistent API along with powerful observability, caching, and reliability features. Unified API for working with multiple LLM providers. Comprehensive logging, tracing, and monitoring for AI requests Intelligent caching and fallback mechanisms that improve performance and reliability. Support for prompt management, versioning, and experimentation. Analytics that help optimize latency, costs, and model performance. For me, the most valuable feature is the observability dashboard. It provides detailed insights into AI requests, token usage, latency, and errors, making it much easier to debug and optimize AI-powered applications. The biggest benefit is improved developer productivity. Portkey simplifies LLM integration, enhances application reliability, and provides the visibility needed to build and maintain production-grade AI systems with confidence.

What do you dislike about the product?

The biggest drawback is the initial setup complexity. While Portkey simplifies long-term AI operations, configuring gateways, policies, and monitoring for a production environment requires some upfront effort. The platform has a learning curve, especially for teams that are new to AI gateways, observability, and prompt management. Configuring advanced routing, fallback strategies, and caching policies can take time.

What problems is the product solving and how is that benefiting you?

Portkey solves the challenge of managing multiple large language model (LLM) providers in production. Instead of building separate integrations, monitoring systems, and fallback logic for each AI provider, Portkey offers a unified gateway with built-in observability, routing, caching, and reliability features. Provides a single API to connect with multiple LLM providers. Centralizes logging, tracing, and monitoring for AI requests. Improves application reliability through intelligent routing, retries, and fallback mechanisms. Helps optimize token usage, latency, and overall AI costs with detailed analytics. Simplifies prompt management and experimentation without changing application code. In my workflow, Portkey makes it much easier to build and maintain AI-powered applications. Instead of managing provider-specific integrations and troubleshooting each service independently, I can monitor requests, compare model performance, and switch providers when needed through one platform. The biggest benefit is better reliability and operational visibility. Portkey reduces development and maintenance effort while providing the insights needed to optimize performance, control costs, and deliver more stable AI applications in production.

Akashkhurana Hirana

Centralized AI gateway has streamlined model routing, costs tracking, and data protection

Reviewed on Jun 27, 2026

Review provided by PeerSpot

What is our primary use case?

Portkey has many features for deploying an application to production, including an AI gateway, observability, guardrails, governance, and prompt management. We primarily use the AI Gateway, observability, and the guardrails.

We have an application that uses LLM models such as Gemini , OpenAI models, or Claude models. We need to interact with these models, which we have deployed elsewhere, and we use Portkey as the AI/LLM gateway to check observability and monitor how our model is behaving. We also have custom fine-tuned trained models, and we want to check the observability or test these models using Portkey. In this scenario, when any person wants to call the LLM or use the AI, they do not need the API key for that model. They can directly call Portkey and it will provide the response. The API keys for all the models are not exposed to the developers or the engineers. They are stored in the vault and can only be accessed by Portkey.

What is most valuable?

We primarily use three features: the AI Gateway, observability, and the guardrails.

The AI gateway and observability are the two features that are very important for the team. The AI gateway acts as a single point of application for all the AI model requests. It also enhances security since we do not need to provide the API key to all the developers. Additionally, it supports load balancing and automatic retries. When a lot of load is coming in, it balances the load among different models. If one of the models is not available, it handles the failover by redirecting the request to the other model. Regarding observability, it is very useful to monitor and understand how our AI application is behaving or performing. It usually logs all the requests and the responses. We can easily get an idea of the token cost, token usage, API cost, and error rates. It makes debugging easier and tracks the AI spending to show how much we have spent. These are the valuable aspects of both features.

Another feature of the AI Gateway is that it lets you switch between providers without changing your application code. Suppose we are using three models for three different purposes and one model is not available, or I want to switch the model within the application without restarting my code or doing a redeployment. We can easily do this with Portkey. Regarding observability, we usually monitor and check how our AI applications are performing and behaving. Other things that we usually track in observability include token usage, API cost, error rates, success and failure metrics, and response latency. This is an important aspect that we usually prioritize.

Guardrails are safety checks and quality control that we apply to the input and output of AI. Suppose I do not want certain personally identifiable information or personal information data of a user to be sent to the AI. Portkey has guardrails that do not allow sending that data to the LLM gateway. It can also act as a middleware that checks all the data we are sending to the LLM before transmission, so we do not send any personally identifiable information or personal information of the user to the LLM gateway.

What needs improvement?

Portkey is a very good software for AI applications. It is a single place solution that you can use if you want your AI application to be in production. However, one thing I would like to say is that it could have better documentation. As of now, the documentation is very concise and short. If possible, they could provide more beginner-friendly tutorials with step-by-step instructions on how to set up and how to see the things in the dashboard. It would also be great if the documentation included sample projects or example projects so that users can easily see and learn from those.

Another thing I would like to add is that it has a steep learning curve. It takes time to learn Portkey because the dashboard is very complex. There are many things that a user can see in Portkey. If they can create good documentation, it could definitely increase their value in the market.

For how long have I used the solution?

I have been using Portkey for around two years.

What do I think about the stability of the solution?

Portkey is totally stable. It is one of the best software solutions in the market for this purpose. The other software solutions I have read about do not provide all the solutions in a single place. The LLM Gateway is one of them, but Portkey provides all the solutions in a single place.

What do I think about the scalability of the solution?

I have tried around fourteen models integrated with Portkey and it does not break under that load. I have seen millions of token consumption through Portkey, and it works well in that scenario. It is scalable.

How are customer service and support?

Portkey's customer service is good. I remember when I was just setting up Portkey, I encountered a small issue. I tried to connect with customer support, raised a ticket, and they replied back within a few hours.

Which other solutions did I evaluate?

I do not believe we evaluated other options.

What other advice do I have?

Everything is perfect except for the documentation and the steep learning curve that I already mentioned. That is the part for which I deducted one mark. Everything else is perfect. Portkey is a very useful software if you are going forward with your AI application that uses the LLM gateway. I would recommend you use it and you will love it within a few days.

Shravan Revanna

Centralized AI control has standardized our workflows and delivers faster, more reliable features

Reviewed on Jun 26, 2026

Review from a verified AWS customer

What is our primary use case?

Portkey serves as a centralized AI gateway for our organization. At Clydo, we are building multiple AI-powered features, including internal copilots, catalog intelligence, customer support assistance, and agentic workflows. This gives us a single layer for model routing, prompt management, and API key management.

A primary example of our main use case is our internal AI assistant used by our operations and customer support team. Every user query goes through Portkey before reaching the language model. Portkey handles authentication and routes requests to appropriate models. One practical scenario involves our engineering team wanting to evaluate a newer model for better reasoning at a lower cost. Instead of changing code across multiple services, we update the routing configuration in Portkey and compare the responses and performance. Another benefit is observability. When users report that AI responses are slower or inconsistent, we use Portkey's logs to see the request latency. Overall, Portkey has become the control layer for our LLM infrastructure. It allows our engineers to focus on building AI features while providing us with centralized visibility, governance, and flexibility.

The main use case is standardizing how AI is consumed across the organization. As more teams start building AI-powered features, we do not want every service to implement its own integration with retry logic or rate limiting, which becomes very difficult to manage. Portkey gives us a common layer that every AI application uses, whether it is an internal or customer-facing chatbot or any agentic flows. This makes the architecture much cleaner and easier to manage. It gives us the flexibility to experiment. We can compare different models and run A/B evaluations for specific workloads. From an engineering perspective, Portkey has evolved from being just an API gateway to becoming part of our platform. It is always a layer that provides consistency, observability, and governance across all our LLM-based applications.

What is most valuable?

The best feature that Portkey offers is a centralized AI gateway, which is the biggest advantage. Instead of integrating directly with different LLM providers, Portkey gives us a single, consistent interface. This results in a simpler architecture and has reduced maintenance. Another valuable feature is multi-model and multi-provider support, which allows us to work with different providers through the same integration. Observability and logging represent one of the most valuable features for production. We gain visibility into request latency, token consumption, errors, and success rates. Cost monitoring is critical since LLM costs can increase quickly, and we need centralized visibility into token usage and API consumption. Reliability features, including fallback mechanisms and routing to different LLMs when there is an issue with the provider, allow us to recover gracefully.

Without question, observability is followed by the AI gateway itself. Whenever we are developing a new AI feature or investigating an issue, the first place we look is the logs and dashboards. We use it for monitoring request latencies and response times, tracking token usage, and identifying timeout requests. We compare how different models perform for the same use cases. Debugging prompts becomes seamless and easy. For example, when our customer support copilots started responding more slowly, we could quickly determine the bottleneck, whether it was the LLM provider or the application itself. Without a centralized observability system, we would have to piece that information together from multiple systems. Portkey itself is something we benefit from continuously, but it is largely transparent once configured.

Some specific details that stand out include how it is vendor-agnostic. Since AI evolves very quickly and models are releasing every few weeks with frequent pricing changes and different performance characteristics, Portkey gives us the flexibility to evaluate and adapt to newer models without having to redesign our application architecture. Another valuable detail is how it encourages good engineering practices. Since every AI request goes through a centralized layer, it is much easier and standardized. If I had to summarize, Portkey gives engineering teams the flexibility to innovate with AI while maintaining the operational costs, control, visibility, and consistency needed to run AI applications at scale.

There are specific outcomes and changes I have noticed since implementing Portkey. The biggest impact is operational efficiency and maintainability. Before Portkey, each AI service managed its own model integration. After adopting Portkey, we centralized those responsibilities into a single gateway. This reduced duplicate engineering efforts and gave every team a consistent way of consuming LLM services. The outcomes we noticed include faster development, quicker troubleshooting, better cost awareness, greater flexibility, and improved reliability.

What needs improvement?

There are definitely some places where Portkey can be improved. Overall, the experience has been positive, but the first area is analytics and reporting. While the observability feature is excellent, we would like to have richer historical analytics and customizable dashboards. For example, it would be useful to see trends by applications and teams or features over long periods without exporting data to an external BI tool. Another area is governance for large organizations. As AI adoption grows, enterprises need more granular, role-based access control. We would like to see more advanced AI evaluation capability built into the platform itself, including features such as prompt versioning and automated quality scoring and regression testing. Finally, while the platform is supported with multiple providers, we would welcome even more intelligent routing capabilities, such as automatically selecting the best model based on latency, cost, and task complexity using configurable policies. These are not major pain points, but they are enhancements that would make an already strong platform even more valuable for an organization that scales their AI workloads.

Documentation and onboarding could be enhanced. Portkey is developer-friendly, but we need more end-to-end references, architecture, and implementation guides for common AI patterns. This would help teams adopt it even faster.

I did not give Portkey a perfect score because while it has become a foundational component in our AI stack and solves several operational challenges, I would still like to see deeper analytics, stronger enterprise governance features, and more built-in AI evaluation capabilities, especially for prompt testing, regression analysis, and model benchmarking. I would be comfortable recommending Portkey to any organization that is building multiple AI applications or wants to manage a scalable way to handle LLM providers and produce AI traffic.

For how long have I used the solution?

I have been using Portkey for more than a year.

What do I think about the stability of the solution?

Regarding Portkey's accuracy and reliability of output, it has met my expectations. Portkey does not generate AI responses itself, but the quality of output primarily depends on the models. From a reliability standpoint, we have had positive experiences. The routing and retry mechanisms are centralized, and handling our AI applications is far better and easier. From an accuracy perspective, Portkey has helped us improve outcomes indirectly because it centralizes logging. To summarize, Portkey does not determine the intelligence of the model, but it gives the tooling to consistently deliver, evaluate, and improve the outputs.

There have not been any situations where I experienced outages. There have been occasional incidents, but they were only under LLMs. From an operations perspective, Portkey has been seamless, and I have not experienced any downtime. It is almost stable all the time.

What do I think about the scalability of the solution?

Overall, Portkey's performance under heavy workloads or peak usage has been satisfying. AI traffic is not at the scale of a large enterprise, but I have experienced spikes during business hours and while running batch AI workflows. In those scenarios, I have not encountered any major scalability bottlenecks. Latency introduced by the gateway has been minimal. In most cases, the issue is the LLM provider rather than the gateway. During periods of high traffic, the observability features have been particularly valuable. There have been occasional cases where upstream LLM providers experienced slower response times or temporary service degradation. In those situations, the retry and routing capability helped reduce the impact. Overall, Portkey has met my expectations for scalability and reliability.

Portkey's scalability has kept up with my organization's growth and changing needs. When I first adopted Portkey, I was using a smaller number of AI providers, and one biggest advantage is that the gateway scales with the number of AI use cases rather than requiring each team to build its own integration. I have also benefited from Portkey's multi-provider capabilities. From an operational perspective, I have not had any major architectural changes. Overall, Portkey has kept pace with my organization's growth and has provided a scalable foundation for expanding AI adoption.

How are customer service and support?

I have not reached out for customer support very often. Overall, the experience with customer support has been positive. Portkey's team has been responsive, technically knowledgeable, and understood the challenges I faced. Most of my questions were around configuring multiple provider routing and optimizing gateway settings. As the platform has matured, I have needed to contact them less frequently because of improved documentation. If I had one suggestion, it would be to expand the self-service knowledge base with more implementation guides. Overall, the customer experience rating is solid at nine out of ten.

Which solution did I use previously and why did I switch?

Previously, I did not have any dedicated gateway. We were integrating directly through SDKs and APIs. While that worked initially, it became difficult to manage. The main challenge was that comparing different LLMs required a lot of code changes, and there was no visibility on token usage. Troubleshooting was also challenging. I evaluated a few alternatives, but I chose Portkey because it has all the capabilities that I needed. It allowed me to standardize my infrastructure. The switch was not from commercial gateways; it was from direct integration to a centralized integration.

How was the initial setup?

I would describe the learning curve for new users or teams adopting Portkey as seamless and very easy to adapt. Knowledge transfer becomes increasingly easier as team members become familiar with the platform.

What was our ROI?

I have seen a return on investment with Portkey. I can share relevant metrics: thirty to forty percent reduction in development time, fifty percent faster evaluation cycles, twenty to twenty-five percent reduction in unnecessary LLM spending, and twenty to twenty-five percent improvement in engineering productivity.

What's my experience with pricing, setup cost, and licensing?

Regarding pricing and licensing, the cost was relatively low because Portkey sits in front of our existing AI infrastructure. From a licensing perspective, I appreciate that the pricing model is predictable and aligned with how we use the platform, since it is an infrastructure component rather than a per-user productivity tool. One consideration was the engineering time saved. Even with a licensing cost and reduction in development efforts, a startup with a few thousand requests per day and a large enterprise processing millions of requests evaluate ROI very differently. Overall, costing and ownership has been favorable.

Which other solutions did I evaluate?

I have not evaluated much before choosing Portkey, but most of the capabilities were largely found on Portkey itself, and I went with this option.

What other advice do I have?

Regarding governance and security, I would say security is quite high. In fact, one of the reasons we chose to implement Portkey is that it gives us a central control point for AI usage rather than every application managing it independently. From a security perspective, I appreciate that sensitive provider credentials are managed centrally instead of being distributed across multiple services. On the governance side, having a single layer for routing, usage tracking, and policy enforcement helps us standardize how AI is used across different teams. As more applications adopt LLMs, consistency becomes increasingly valuable. Overall, I would describe Portkey as a strong differentiator that provides visibility and centralized control.

Portkey is deployed in my organization as a hybrid cloud deployment. We use AWS as part of our hybrid cloud setup. Our applications and backend services run on AWS , and all AI requests are routed through Portkey before reaching the underlying models. We have backend services and API databases on AWS. Portkey centralizes routing, observability, and cost management. This approach gives us the flexibility of managing foundation models while keeping our application logic simpler. Operational controls are within our own cloud environment, allowing us to change model providers without impacting the applications.

In terms of customizing, I would say Portkey is very flexible because one of the reasons we adopted it is that it fits into the existing architecture. Most of my customization has been through configuring rather than custom development. For example, I customized different routing rules for different AI workloads. I have also integrated Portkey into my internal monitoring and engineering workflows. The best part is that our application code remains largely provider-agnostic. If we want to evaluate a new LLM or change routing for a specific use case, we typically do not need to make significant changes. I have not had to build major custom extensions to make Portkey work for us. Most of what I have needed has been supported through existing configurations. Overall, customizability is highly rated.

For my organization, compliance is important, though we are not heavily regulated. From my perspective, Portkey contributes to a centralized control layer. Since all LLM traffic passes through a single gateway, it is much easier. That said, we do not rely on Portkey alone to meet compliance requirements. We combine it with our own security controls.

I can share some approximate figures for faster development and better cost awareness. For development, I estimate that Portkey reduced engineering efforts by around thirty to forty percent. Previously, each team had to implement provider-specific authentication, retries, logging, and error handling. For model evaluation, we have seen about fifty percent reduction in time. From a cost management perspective, we have reduced unnecessary LLM consumption by around fifteen to twenty percent. I have also noticed thirty to forty percent reduction in troubleshooting time. Overall, Portkey has had a huge business impact in terms of cost savings and operational efficiency. I would rate this review overall at nine out of ten.

Singh Aman

Unified ai gateway has standardized observability and routing for multiple llm applications

Reviewed on Jun 05, 2026

Review from a verified AWS customer

What is our primary use case?

Portkey has been used as an AI gateway and observability layer for LLM applications. The main use case is to route requests across multiple LLM providers through a common API, monitor latency, cost tracking, prompts, and responses, and add reliability features such as fallback, retries, caching, and guardrails. It is especially useful when moving from experimentation to production because it provides a single control layer for model routing, prompt management, usage tracking, and operational visibility.

Whenever an API is called through Portkey , it displays the total time consumption that has been taken. Observability is provided there. When it fails to connect to an LLM model, it attempts to use a fallback mechanism, which is helpful.

What is most valuable?

The most valuable features are the unified AI gateway, observability, price calculations, routing controls, and guardrails. The universal API makes it easier to switch between and compare models without rewriting large parts of the application. For the organization, Portkey was providing two endpoints, one for the US and another for Europe, which was easily manageable. Some models were supported for the US, some not, some for the EU, and some for both. Switching between providers is very easy with Portkey. The fallback and retry capabilities are also useful because production LLM applications need resilience when a provider is slow, rate-limited, or temporarily unavailable. Cost and latency tracking are valuable because LLM usage can become difficult to manage across teams, but these can be easily managed through Portkey.

Observability is the best feature because at the production grid application, the focus is usually on observability, such as how the end user is using it, what the latency is, how many errors occurred within time frames, which models were used, and how many tokens were consumed. These aspects are easy to manage in Portkey.

Portkey has significantly improved the organization by streamlining the AI processes being followed. Portkey improves the workflow for centralizing LLM operations instead of every application team building its own logging, routing, fallback, and cost tracking logic. Portkey provides a shared layer for these capabilities so it can be synced between all teams. It helps reduce engineering efforts when experimenting with different providers and models. It also improves visibility into production behavior because request volumes, latency, cost, errors, and model usage can be seen in one place. For teams building multiple GenAI applications, the biggest improvement is standardization. Portkey makes it easier to enforce common practices across observability, reliability, and governance.

What needs improvement?

The main area for improvement is onboarding and trial experience.

Portkey is very useful for production teams, but it may feel more like infrastructure than needed for small prototypes. More guided setup templates for common use cases such as RAG chatbot or internal assistant, evaluation pipelines, and multi-provider failover could make adoption easier. If the onboarding process could be easier, that would be beneficial, as there have been issues faced during onboarding.

For how long have I used the solution?

Portkey has been used for the last two years as it is an integral part of Rosh.

What do I think about the stability of the solution?

Portkey is reliable approximately 99 percent of the time. Some endpoints will occasionally be down. Over a six-month period, endpoints went down once or twice, but most of the time it works very well.

What do I think about the scalability of the solution?

Portkey is designed for scale production uses. It is useful when multiple teams and applications need a consistent gateway for LLM applications, observability, prompt management, and governance. These aspects are good in terms of scalability. The scalability value becomes more obvious as the number of models, providers, applications, and users grows. For a single prototype, it may feel optional, but for production GenAI platforms with multiple applications, it becomes much more valuable.

How are customer service and support?

For enterprise production, support quality is very important. Support was reached out to once on Discord. There is a support group on Discord where the team is very responsive. They provide a newsletter with updates and features. The support experience has been good.

Which solution did I use previously and why did I switch?

Previous solutions were being used, but they were not considered optimal. AWS Marketplace is also being used for some LLM models, but AWS does not support all the models. Portkey is a better solution because it has everything in one place.

How was the initial setup?

The best aspect in terms of cost saving is using Portkey cache, which was built-in to the packages. This literally saved token costs because cache tokens cost less to process. This was the best aspect in terms of productivity. The setup was also smooth because only a single Portkey library is needed, and then any models can be used at once. Whether using AWS Marketplace models, Google models, or OpenAI models, everything is available in one place. There is no need to worry about multiple platforms or multiple packages.

What was our ROI?

Time has been saved for multiple developers because of having a single package to manage multiple models and multiple providers. This was the main ease gained.

What's my experience with pricing, setup cost, and licensing?

The pricing is great. The organization only pays for the model being used. For Portkey pricing, the organization is covering that cost. Roche is taking care of it. The pricing is very reasonable for an organization already running production LLM workloads that needs observability, routing, guardrails, or other features. Portkey pricing is separate from the underlying LLM provider cost. The model providers are still paid for token usage, and Portkey adds the gateway, observability, caching, guardrails, and management layers on top.

Which other solutions did I evaluate?

Nothing has been purchased from the AWS marketplace. Roche is a partner for Portkey and the organization is an enterprise for Portkey, utilizing Portkey features.

What other advice do I have?

Portkey should be recommended for teams that are moving GenAI applications into production and need more control over reliability, cost, observability, and governance. It is especially useful if multiple LLM providers are being used and if changes to models are expected over time. The overall rating given for Portkey is nine out of ten.

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Support Pan

Prompt testing has become structured and flexible but trial access still needs improvement

Reviewed on May 31, 2026

Review provided by PeerSpot

What is our primary use case?

Our team is searching for an alternative to LiteLLM, which is another monitoring management platform, and we found that Portkey is an alternative solution that we decided to try out.

We want to conduct prompt template testing with Portkey by splitting our business logic along with the prompt, so we need to test that against different models and perform evaluation accordingly. This is one major thing we are trying. The product owner can separately conduct their test for the prompt template on the playground.

There could be some advanced usage with Portkey such as setting up the MCP and guard rail. We are thinking about having that part as well, but so far we haven't touched it yet.

What is most valuable?

Portkey provides the standard interfaces for the major features that interested the industry has. It has a very comprehensive feature support for the MCP agent and its playground, support for prompt templates, and allows the management of traceability analytics. It covers most agentic application usage very comprehensively.

Portkey definitely provides a solid alternative solution for the agent and large model hosting platforms, and it is very helpful for us to explore the possibilities across the industry rather than staying with a few mainstream options.

Portkey successfully provides a standard interface and large model based solutions, and we can see those features in many other platforms as well. This is solid, but so far I haven't found anything super exciting or very innovative on this platform. For our organization, it's a great alternative solution to consider.

What needs improvement?

One major problem I see with Portkey is that when I had not yet started any trial, it started to denote that I had exceeded the prompt limit. Users are usually expecting a trial stage with more tokens to at least work through a few days to understand the products inside out. That might be something that could be improved.

The onboarding process of Portkey is very good. However, it requests that there are no free tokens or trial tokens. That was the limitation.

For how long have I used the solution?

I have been using Portkey for a few weeks.

What do I think about the stability of the solution?

When I was doing a test with Portkey, it was stable enough.

What do I think about the scalability of the solution?

We have not reached the point for considering scalability with Portkey.

How are customer service and support?

We do not have a customer relationship with Portkey.

Which solution did I use previously and why did I switch?

We were using LiteLLM and sometimes Langfuse. Langfuse is also open source, so we can deploy that to cover many such features. We used them and felt that their price was a little bit higher than our budget. This was the reason why we were seeking alternative solutions and eventually found Portkey.

How was the initial setup?

I did not want to quickly dive into too much advanced features. Portkey provides a very quick hands-on experience.

What's my experience with pricing, setup cost, and licensing?

Portkey is requiring production to be $49, I guess that is US dollars per month for 100K logs. I would say it is manageable.

What other advice do I have?

It is definitely a private cloud for Portkey from their provider. Using their online services with Portkey is good. If you are seeking an alternative solution for the product, Portkey is definitely worth a look. My overall review rating for this product is 7.

View all reviews