TOKENOPENTOKENOPEN
User GuideAPI ReferenceHelp & Support
Models & Pricing

Model List & Capability Comparison

View all AI models supported by the platform and their capability comparisons

The platform aggregates hundreds of models from major providers including OpenAI, Anthropic, Google, and DeepSeek. The following is an overview of the primary model families.

Text & Chat Models

Model FamilyRepresentative ModelsContext WindowKey Features
OpenAI GPTGPT-5.5, GPT-5.4, GPT-5.4-mini, GPT-5.4-nano1MFlagship performance; supports vision and function calling
Anthropic ClaudeClaude Opus 4.7, Claude Sonnet 4.6, Claude Haiku 4.51MExceptional at long-document comprehension, writing, and coding
Google GeminiGemini 3.5 Flash, Gemini 3.1 Pro1MUltra-long context; cutting-edge performance for agents and code
DeepSeekDeepSeek-V4 Pro, DeepSeek-V4 Flash64KExcellent Chinese comprehension and code generation; very cost-effective
Alibaba QwenQwen3-235B, Qwen3-72B128KOptimized for Chinese use cases; enterprise-grade applications
Meta LlamaLlama 4 Scout, Llama 3.3128KOpen-source models with high value-for-money

Text Model Comparison

ModelContext WindowVisionFunction CallingReasoningBest For
GPT-5.51M⭐⭐⭐⭐⭐Flagship reasoning, code, professional tasks
GPT-5.41M⭐⭐⭐⭐⭐General chat, code, professional tasks
GPT-5.4-mini400K⭐⭐⭐⭐High-frequency, low-cost tasks
GPT-5.4-nano200K⭐⭐⭐Ultra-low cost, high throughput
Claude Opus 4.71M⭐⭐⭐⭐⭐Best-in-class reasoning, code, long documents
Claude Sonnet 4.61M⭐⭐⭐⭐⭐Balanced speed and intelligence
Claude Haiku 4.5200K⭐⭐⭐⭐Fastest response; lightweight tasks
Gemini 3.5 Flash1M⭐⭐⭐⭐⭐Agents, code, frontier performance
Gemini 3.1 Pro1M⭐⭐⭐⭐⭐Complex tasks, multimodal
DeepSeek-V4 Pro64K⭐⭐⭐⭐Chinese comprehension, code generation
DeepSeek-V4 Flash64K⭐⭐⭐⭐Cost-effective reasoning
Qwen3-235B128K⭐⭐⭐⭐Chinese scenarios, enterprise applications

Image Generation Models

ModelKey Features
DALL·E 3From OpenAI; rich style variety; supports non-English prompts
FluxHigh-quality photorealistic style with excellent detail rendering
Stable DiffusionOpen-source; diverse styles; deployable locally
MidjourneyOutstanding for artistic creation and design styles

Audio Models

ModelTypeKey Features
WhisperSpeech-to-Text (STT)Supports multilingual audio-to-text transcription
TTS-1Text-to-Speech (TTS)Fast; suitable for real-time scenarios
TTS-1-HDText-to-Speech (TTS)Higher audio quality; suitable for content production

Embedding & Reranking Models

ModelTypeUse Case
text-embedding-3-smallEmbeddingLightweight vectorization; suitable for large-scale text batches
text-embedding-3-largeEmbeddingHigh-precision vectorization; suitable for knowledge base retrieval
Rerank seriesRerankRe-ranks retrieval results to improve RAG accuracy

For the full model list and real-time pricing, see the "Pricing" page in the console. For billing rules, see Billing Rules.

Model Recommendations by Use Case

Use CaseRecommended Models
Everyday Q&A, chatbotsGPT-5.4-mini, Gemini 3.5 Flash
Code generation & debuggingClaude Sonnet 4.6, DeepSeek-V4 Pro
Complex reasoning, mathGPT-5.5, Claude Opus 4.7
Long-document processingGemini 3.1 Pro (1M context), Claude Opus 4.7
Chinese language scenariosDeepSeek-V4 Pro, Qwen3-235B
Image understandingGPT-5.4, Gemini 3.5 Flash, Claude Sonnet 4.6
Image generationDALL·E 3, Flux, Midjourney
Speech recognitionWhisper
Text-to-speechTTS-1, TTS-1-HD

Last updated on