Chat with GPT-4o-mini, Claude Haiku, or Gemini Flash — all under one API key. LLM proxy with pay-per-message pricing and no subscriptions.
A lightweight LLM proxy that routes your message to the best model. Default is GPT-4o-mini via OpenAI. Switch to Claude Haiku (Anthropic) or Gemini Flash (Google) with a single field. Supports optional system prompts, max_tokens, and temperature control. Returns the response text plus token usage.
gpt-4o-mini). See supported models below.gpt-4o-mini — OpenAI GPT-4o mini (default, fast, cheap)gpt-4o — OpenAI GPT-4o (more capable)gpt-4-turbo — OpenAI GPT-4 Turboclaude-haiku — Anthropic Claude 3 Haiku (fast)claude-sonnet — Anthropic Claude 3.5 Sonnetgemini-flash — Google Gemini 1.5 Flashgemini-pro — Google Gemini 1.5 Pro$0.005 per message via x402 micropayment on Base (USDC). Max tokens: 2048. All models same price.