AI Gateway

Monitor, control, and optimize your AI applications

Cloudflare AI Gateway provides centralized visibility and control for your AI applications. Connect your apps with a single line of code to monitor usage, costs, and errors. Reduce risks and expenses through caching, rate limiting, request retries, and model fallbacks. Ensure reliability, scalability, and productivity with minimal effort.

Get Started

Benefits of AI Gateway

Control costs for all your AI apps

Connect your AI apps to AI Gateway for a unified dashboard and control costs with usage stats, rate limiting, and caching.

AI Audit - Benefits - Better visibility - icon

Easy analytics & troubleshooting

Gain visibility into prompts, AI API requests, errors, token usage, costs, and more. Logs are available for auditing and troubleshooting.

Support for the most popular AI providers

Unify the top AI providers including Hugging Face, OpenAI, Anthropic and Workers AI, for comprehensive visibility into your AI applications.

HOW IT WORKS

Make AI applications observable, reliable, and scalable

By shifting features such as rate limiting, caching, and error handling to the proxy layer, organizations can apply unified configurations across AI apps and inference service providers. AI Gateway sits between your application and the AI provider to give you multivendor AI observability and control.

What our customers are saying

"Without AI Gateway, it’s difficult to see which applications are driving the majority of the costs with the OpenAI API … We can choose to limit the number of requests used by certain tools to control costs."

Rightblogger

Top AI Gateway use cases

Cloudflare AI Gateway helps you monitor, control, and optimize your AI applications

Monitor

Real-time insights and reliability with logs, metrics, rate limiting, caching, and monitoring.

Control

Effortlessly connect the most popular providers- Workers AI, Hugging Face, OpenAI, Anthropic, and more with just one line of code.

ABM - Woolworths - Elevating the Digital Customer Journey - Card 1 - Icon

Optimize

Optimize costs and reduce latency with custom caching. Control scaling and prevent excessive activity with rate limiting.

Explore more use cases

Helping organizations worldwide monitor, control, and scale their AI solutions

View case studies

Resources

Documentation

Create an AI Gateway to monitor, control & optimize into your AI applications.

Learn more

Video

Learn how to optimize your existing AI applications with Cloudflare AI Gateway & how to finetune OpenAI models using R2.

Watch video

Documentation

Gain an overview of AI Gateway and its benefits.

Learn more

Solution brief

Learn how Cloudflare enables more effective, productive, and agile data protection.

Get solution brief

AI Gateway FAQs

What is Cloudflare AI Gateway?

What are the main benefits of using AI Gateway?

Which AI providers are supported by AI Gateway?

How does AI Gateway help control costs?

What can I monitor with AI Gateway?

How does AI Gateway improve the reliability of AI applications?

AI Gateway makes it easy for AI application developers to improve the resilience of their applications by defining request retry and model fallbacks in case of an error. AI Gateway can serve requests directly from Cloudflare's cache instead of the model providers, helping the application to more efficiently serve users at scale. And rate limiting can throttle excessive requests to prevent denial of service to legitimate users.