Simple, Transparent Pricing
Pay only for what you use. No hidden fees or commitments.
Special Offers
Limited time promotions and special deals.
Special Deal
Claude 3.7 Sonnet
$2.00 per 1M input tokens
$2.00 per 1M output tokens
$2.00 per 1M output tokens
Limited time promotion!
- Anthropic's latest model
- Superior reasoning & coding
- Excellent comprehension
- 500 requests/minute
Free
Gemini 2.5 Pro Exp
$2.00 per 1M input tokens
$5.00 per 1M output tokens
$5.00 per 1M output tokens
Limited time promotion!
- Google's experimental model
- Superior reasoning
- Excellent for complex tasks
- 500 requests/minute
Free
LLaMA 4 17B 12E (Scout)
$0.50 per 1M input tokens
$1.00 per 1M output tokens
$1.00 per 1M output tokens
Limited time promotion!
- Meta's experimental model
- 12-expert architecture
- Fast response times
- 500 requests/minute
Standard Models
Our full lineup of AI models at competitive prices
New Release
GPT-4.1
$1.80 per 1M input tokens
$5.40 per 1M output tokens
$5.40 per 1M output tokens
70% cheaper than retail
- OpenAI's latest model
- Enhanced reasoning capabilities
- Improved content generation
- 500 requests/minute
New Release
Gemini 2.5 Pro
$0.50 per 1M input tokens
$2.00 per 1M output tokens
$2.00 per 1M output tokens
80% cheaper than retail
- Google's latest model
- Superior context handling
- Advanced reasoning
- 1,000 requests/minute
New Release
LLaMA 3.3 70B
$0.25 per 1M input tokens
$0.75 per 1M output tokens
$0.75 per 1M output tokens
High-performance open model
- Meta's most powerful model
- Exceptional reasoning
- 8K context window
- 750 requests/minute
Most Popular
GPT-4o
$2.50 per 1M input tokens
$10.00 per 1M output tokens
$10.00 per 1M output tokens
Up to 75% cheaper than retail
- OpenAI's best model
- Superior reasoning
- Advanced code generation
- 500 requests/minute
DeepSeek Reasoner
$0.55 per 1M input tokens
$2.19 per 1M output tokens
$2.19 per 1M output tokens
Best value for reasoning
- Excellent reasoning
- Math & logic focused
- Great price/performance
- Unlimited requests
Gemini 2.0 Flash
$0.10 per 1M input tokens
$0.40 per 1M output tokens
$0.40 per 1M output tokens
90% cheaper than retail
- Google's fastest model
- Excellent for chat
- Multi-modal support
- 2,000 requests/minute
Also Popular
GPT-4o Mini
$0.15 per 1M input tokens
$0.60 per 1M output tokens
$0.60 per 1M output tokens
- Efficient GPT-4
- Great for most tasks
- 750 requests/minute
Gemini 1.5 Pro
$0.10 per 1M input tokens
$0.35 per 1M output tokens
$0.35 per 1M output tokens
- Production ready
- Reliable performance
- 1,500 requests/minute
DeepSeek Coder
$0.20 per 1M input tokens
$0.80 per 1M output tokens
$0.80 per 1M output tokens
- Specialized for coding
- Efficient code generation
- 1,000 requests/minute
Free
Gemini 2.0 Flash Lite
$0.00 per 1M input tokens
$0.00 per 1M output tokens
$0.00 per 1M output tokens
Completely free!
- Google's efficient model
- Good for general tasks
- Limited rate: 50 req/min
- No credit card required
Free
LLaMA 3 8B
$0.00 per 1M input tokens
$0.00 per 1M output tokens
$0.00 per 1M output tokens
Completely free!
- Meta's compact model
- Good reasoning ability
- Limited rate: 50 req/min
- No credit card required