How Credits Work
For most models (chat, completions):Model Multipliers
Different models have different multipliers based on their capabilities and costs:Chat Models
| Model | Multiplier | Plan |
|---|---|---|
gpt-4.1-nano | 0.1 | All |
gpt-4o-mini | 0.25 | All |
gpt-5.1 | 0.75 | All |
gpt-4o | 1.25 | All |
o3-mini | 0.25 | All |
o3 | 0.5 | Basic+ |
o1 | 5.0 | Premium+ |
claude-3-5-haiku-20241022 | 1.0 | All |
claude-sonnet-4-5-20250929 | 1.75 | Basic+ |
claude-opus-4-5-20251101 | 4.0 | Basic+ |
gemini-2.0-flash | 0.5 | All |
gemini-2.5-pro | 1.0 | All |
deepseek-v3 | 0.1 | All |
deepseek-r1 | 0.35 | All |
lumina | 0.3 | All |
Fixed Cost Models
| Model | Credits | Type |
|---|---|---|
gpt-image-1 | 2,000 | Image generation |
imagen-3.0-generate-002 | 2,500 | Image generation |
flux-kontext-pro | 3,000 | Image generation |
midjourney | 75,000 | Image generation |
text-embedding-3-small | 50 | Embeddings |
text-embedding-3-large | 50 | Embeddings |
tts-1 | 75 | Text-to-speech |
tts-1-hd | 150 | Text-to-speech |
whisper-1 | 10 | Transcription |
sora-2 | 5,000 | Video generation |
sora-2-pro | 15,000 | Video generation |
omni-moderation-latest | 0 | Moderation (free) |
Example Calculation
If you send a request usinggpt-5.1 (multiplier: 0.75) with:
- Input tokens: 1,000
- Output tokens: 500
- Total tokens: 1,500
Plans
Different plans have access to different models:| Plan | Access |
|---|---|
| Free | Basic models (gpt-4o-mini, gemini-2.0-flash, etc.) |
| Basic | + Claude Sonnet, o3, Sora |
| Premium | + Claude Opus, o1, Premium image models |
| Pro/Ultra | All models including Midjourney |
Checking Your Balance
Your current credit balance is available in your dashboard.Discounts
VoidAI offers personalized daily discounts on select models. When you have an active discount:- Normal cost: 1,000 credits
- Discounted cost: 500 credits
Rate Limiting
In addition to credits, there’s a rate limit of 100 requests per minute per API key. If exceeded, you’ll receive a429 Too Many Requests error.
Tips to Optimize Costs
Choose the right model
Use smaller models (gpt-4o-mini, deepseek-v3) for simple tasks. Save premium models for complex work.
Use discounts
Check your daily discounts and time high-volume work accordingly.
Cache responses
Cache responses for repeated identical queries to avoid duplicate costs.
Optimize prompts
Shorter prompts = fewer input tokens = lower costs.
