Model List & Capability Comparison

The platform aggregates hundreds of models from major providers including OpenAI, Anthropic, Google, and DeepSeek. The following is an overview of the primary model families.

Text & Chat Models

Model Family	Representative Models	Context Window	Key Features
OpenAI GPT	GPT-5.5, GPT-5.4, GPT-5.4-mini, GPT-5.4-nano	1M	Flagship performance; supports vision and function calling
Anthropic Claude	Claude Opus 4.7, Claude Sonnet 4.6, Claude Haiku 4.5	1M	Exceptional at long-document comprehension, writing, and coding
Google Gemini	Gemini 3.5 Flash, Gemini 3.1 Pro	1M	Ultra-long context; cutting-edge performance for agents and code
DeepSeek	DeepSeek-V4 Pro, DeepSeek-V4 Flash	64K	Excellent Chinese comprehension and code generation; very cost-effective
Alibaba Qwen	Qwen3-235B, Qwen3-72B	128K	Optimized for Chinese use cases; enterprise-grade applications
Meta Llama	Llama 4 Scout, Llama 3.3	128K	Open-source models with high value-for-money

Text Model Comparison

Model	Context Window	Vision	Function Calling	Reasoning	Best For
GPT-5.5	1M	✅	✅	⭐⭐⭐⭐⭐	Flagship reasoning, code, professional tasks
GPT-5.4	1M	✅	✅	⭐⭐⭐⭐⭐	General chat, code, professional tasks
GPT-5.4-mini	400K	✅	✅	⭐⭐⭐⭐	High-frequency, low-cost tasks
GPT-5.4-nano	200K	✅	✅	⭐⭐⭐	Ultra-low cost, high throughput
Claude Opus 4.7	1M	✅	✅	⭐⭐⭐⭐⭐	Best-in-class reasoning, code, long documents
Claude Sonnet 4.6	1M	✅	✅	⭐⭐⭐⭐⭐	Balanced speed and intelligence
Claude Haiku 4.5	200K	✅	✅	⭐⭐⭐⭐	Fastest response; lightweight tasks
Gemini 3.5 Flash	1M	✅	✅	⭐⭐⭐⭐⭐	Agents, code, frontier performance
Gemini 3.1 Pro	1M	✅	✅	⭐⭐⭐⭐⭐	Complex tasks, multimodal
DeepSeek-V4 Pro	64K	❌	✅	⭐⭐⭐⭐	Chinese comprehension, code generation
DeepSeek-V4 Flash	64K	❌	✅	⭐⭐⭐⭐	Cost-effective reasoning
Qwen3-235B	128K	✅	✅	⭐⭐⭐⭐	Chinese scenarios, enterprise applications

Image Generation Models

Model	Key Features
DALL·E 3	From OpenAI; rich style variety; supports non-English prompts
Flux	High-quality photorealistic style with excellent detail rendering
Stable Diffusion	Open-source; diverse styles; deployable locally
Midjourney	Outstanding for artistic creation and design styles

Audio Models

Model	Type	Key Features
Whisper	Speech-to-Text (STT)	Supports multilingual audio-to-text transcription
TTS-1	Text-to-Speech (TTS)	Fast; suitable for real-time scenarios
TTS-1-HD	Text-to-Speech (TTS)	Higher audio quality; suitable for content production

Embedding & Reranking Models

Model	Type	Use Case
text-embedding-3-small	Embedding	Lightweight vectorization; suitable for large-scale text batches
text-embedding-3-large	Embedding	High-precision vectorization; suitable for knowledge base retrieval
Rerank series	Rerank	Re-ranks retrieval results to improve RAG accuracy

For the full model list and real-time pricing, see the "Pricing" page in the console. For billing rules, see Billing Rules.

Model Recommendations by Use Case

Use Case	Recommended Models
Everyday Q&A, chatbots	GPT-5.4-mini, Gemini 3.5 Flash
Code generation & debugging	Claude Sonnet 4.6, DeepSeek-V4 Pro
Complex reasoning, math	GPT-5.5, Claude Opus 4.7
Long-document processing	Gemini 3.1 Pro (1M context), Claude Opus 4.7
Chinese language scenarios	DeepSeek-V4 Pro, Qwen3-235B
Image understanding	GPT-5.4, Gemini 3.5 Flash, Claude Sonnet 4.6
Image generation	DALL·E 3, Flux, Midjourney
Speech recognition	Whisper
Text-to-speech	TTS-1, TTS-1-HD