DeepSeek V4 Pro
LLM
PRO

DeepSeek V4 Pro

DeepSeek V4 Pro is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

DeepSeek V4 Pro
MiniMax M2.7
GLM 5 Turbo
Kimi K2.5
Qwen3.5 122B A10B
分类
模型系列
共 72 个模型,当前显示 48 个
最新
NEW
HOT
Doubao Seed 2.0 Pro 260215
LLM
PRO

Doubao Seed 2.0 Pro 260215

暂无描述

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.5/M tokens
输出:$3/M tokens
最大输出:131.07K
$0.5/3M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
Doubao Seed 2.0 Code Preview 260215
LLM
PREVIEW

Doubao Seed 2.0 Code Preview 260215

暂无描述

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.5/M tokens
输出:$3/M tokens
最大输出:131.07K
$0.5/3M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
Doubao Seed 2.0 Lite 260428
LLM

Doubao Seed 2.0 Lite 260428

暂无描述

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.25/M tokens
输出:$2/M tokens
最大输出:131.07K
$0.25/2M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
Doubao Seed 2.0 Mini 260428
LLM

Doubao Seed 2.0 Mini 260428

暂无描述

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.1/M tokens
输出:$0.4/M tokens
最大输出:131.07K
$0.1/0.4M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
Doubao Seed 1.8 251228
LLM

Doubao Seed 1.8 251228

暂无描述

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.25/M tokens
输出:$2/M tokens
最大输出:65.54K
$0.25/2M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
Doubao Seed 1.6 Flash 250828
LLM

Doubao Seed 1.6 Flash 250828

暂无描述

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.075/M tokens
输出:$0.3/M tokens
最大输出:32.77K
$0.075/0.3M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
Doubao Seed 1.6 251015
LLM

Doubao Seed 1.6 251015

暂无描述

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.25/M tokens
输出:$2/M tokens
最大输出:65.54K
$0.25/2M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
Kimi K2.7 Code
LLM

Kimi K2.7 Code

Powerful coding model for programming, debugging, and AI developer workflows.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.95/M tokens
输出:$4/M tokens
最大输出:262.14K
$0.95/4M 输入/输出
Cache-Based
NEW
HOT
Grok Build 0.1
LLM

Grok Build 0.1

Specialized coding model optimized for software development, code generation, debugging, refactoring, and developer workflows.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$1/M tokens
输出:$2/M tokens
最大输出:262.14K
$1/2M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
Grok 4.3
LLM

Grok 4.3

Advanced conversational AI model optimized for natural dialogue, knowledge exploration, reasoning, and interactive chat experiences.

1000.0K 上下文:
输入类型:
输出类型:
上下文:1000.00K
输入:$1.25/M tokens
输出:$2.5/M tokens
最大输出:1000.00K
$1.25/2.5M 输入/输出
Cache-Based
Gradient-Based
HOT
Claude Opus 4.8
LLM

Claude Opus 4.8

Anthropic's most capable model, built for advanced reasoning, complex workflows, deep analysis, and high-quality content generation.

1000.0K 上下文:
输入类型:
输出类型:
上下文:1000.00K
输入:$5/M tokens
输出:$25/M tokens
最大输出:128.00K
$5/25M 输入/输出
Cache-Based
NEW
HOT
Gemini 3.5 Flash
LLM

Gemini 3.5 Flash

Fast and cost-efficient multimodal model designed for high-throughput applications, real-time interactions, and everyday AI tasks.

1048.6K 上下文:
输入类型:
输出类型:
上下文:1048.58K
输入:$1.5/M tokens
输出:$9/M tokens
最大输出:65.54K
$1.5/9M 输入/输出
Cache-Based
NEW
HOT
DeepSeek V4 Pro
LLM
PRO

DeepSeek V4 Pro

DeepSeek V4 Pro is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

1048.6K 上下文:
输入类型:
输出类型:
上下文:1048.58K
输入:$1.68/M tokens
输出:$3.38/M tokens
最大输出:393.22K
$1.68/3.38M 输入/输出
Cache-Based
NEW
HOT
DeepSeek V4 Flash
LLM

DeepSeek V4 Flash

DeepSeek V4 Flash is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

1048.6K 上下文:
输入类型:
输出类型:
上下文:1048.58K
输入:$0.14/M tokens
输出:$0.28/M tokens
最大输出:393.22K
$0.14/0.28M 输入/输出
Cache-Based
NEW
OWL
LLM

OWL

暂无描述

1048.8K 上下文:
输入类型:
输出类型:
上下文:1048.76K
输入:免费
输出:免费
最大输出:262.14K
免费
HOT
Kimi K2.6
LLM

Kimi K2.6

Enhanced model for reasoning, coding, and productivity.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.95/M tokens
输出:$4/M tokens
最大输出:262.14K
$0.95/4M 输入/输出
Cache-Based
NEW
Qwen3.6 35B A3B
LLM

Qwen3.6 35B A3B

The latest Qwen reasoning model.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.161/M tokens
输出:$0.965/M tokens
最大输出:65.54K
$0.161/0.965M 输入/输出
NEW
Qwen3.6 Plus
LLM

Qwen3.6 Plus

Versatile model for chat, and productivity workflows.

1000.0K 上下文:
输入类型:
输出类型:
上下文:1000.00K
输入:$0.325/M tokens
输出:$1.95/M tokens
最大输出:65.54K
$0.325/1.95M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
GLM 5.1
LLM

GLM 5.1

GLM-5.1 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

202.8K 上下文:
输入类型:
输出类型:
上下文:202.75K
输入:$1.26/M tokens
输出:$3.96/M tokens
最大输出:202.75K
$1.26/3.96M 输入/输出
Cache-Based
NEW
HOT
MiniMax M2.7
LLM

MiniMax M2.7

MiniMax-M2.7 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

196.6K 上下文:
输入类型:
输出类型:
上下文:196.61K
输入:$0.3/M tokens
输出:$1.2/M tokens
最大输出:196.61K
$0.3/1.2M 输入/输出
Cache-Based
NEW
HOT
MiniMax M3
LLM

MiniMax M3

MiniMax M3 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

524.3K 上下文:
输入类型:
输出类型:
上下文:524.30K
输入:$0.42/M tokens
输出:$1.68/M tokens
最大输出:524.29K
$0.42/1.68M 输入/输出
Cache-Based
NEW
Qwen3.5 122B A10B
LLM

Qwen3.5 122B A10B

Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.3/M tokens
输出:$2.4/M tokens
最大输出:65.54K
$0.3/2.4M 输入/输出
NEW
Qwen3.5 35B A3B
LLM

Qwen3.5 35B A3B

Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.225/M tokens
输出:$1.8/M tokens
最大输出:65.54K
$0.225/1.8M 输入/输出
NEW
Qwen3.5 27B
LLM

Qwen3.5 27B

Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.27/M tokens
输出:$2.16/M tokens
最大输出:65.54K
$0.27/2.16M 输入/输出
NEW
Qwen3 Coder Next
LLM

Qwen3 Coder Next

Qwen3 Coder represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.18/M tokens
输出:$1.35/M tokens
最大输出:262.14K
$0.18/1.35M 输入/输出
Gradient-Based
NEW
Qwen3.5 397BA17B
LLM

Qwen3.5 397BA17B

Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility to empower developers and enterprises with unprecedented capability and efficiency.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.55/M tokens
输出:$3.5/M tokens
最大输出:65.54K
$0.55/3.5M 输入/输出
Cache-Based
HOT
MiniMax M2.5
LLM

MiniMax M2.5

MiniMax-M2.5 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

196.6K 上下文:
输入类型:
输出类型:
上下文:196.61K
输入:$0.295/M tokens
输出:$1.2/M tokens
最大输出:196.61K
$0.295/1.2M 输入/输出
Cache-Based
NEW
HOT
GLM 5v Turbo
LLM
TURBO

GLM 5v Turbo

GLM-5v Turbo is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

202.8K 上下文:
输入类型:
输出类型:
上下文:202.75K
输入:$1.2/M tokens
输出:$4/M tokens
最大输出:131.07K
$1.2/4M 输入/输出
Cache-Based
NEW
HOT
GLM 5 Turbo
LLM
TURBO

GLM 5 Turbo

GLM-5 Turbo is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$1.2/M tokens
输出:$4/M tokens
最大输出:131.07K
$1.2/4M 输入/输出
Cache-Based
NEW
HOT
GLM 5
LLM

GLM 5

GLM-5 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

202.8K 上下文:
输入类型:
输出类型:
上下文:202.75K
输入:$0.95/M tokens
输出:$3.15/M tokens
最大输出:202.75K
$0.95/3.15M 输入/输出
Cache-Based
Qwen3 VL 30B A3B Thinking
LLM

Qwen3 VL 30B A3B Thinking

The latest Qwen reasoning model.

128.0K 上下文:
输入类型:
输出类型:
上下文:128.00K
输入:$0.15/M tokens
输出:$1.5/M tokens
最大输出:32.00K
$0.15/1.5M 输入/输出
Qwen3 VL 8B Instruct
LLM

Qwen3 VL 8B Instruct

The latest Qwen reasoning model.

128.0K 上下文:
输入类型:
输出类型:
上下文:128.00K
输入:$0.08/M tokens
输出:$0.5/M tokens
最大输出:32.00K
$0.08/0.5M 输入/输出
Qwen3 VL 30B A3B Instruct
LLM

Qwen3 VL 30B A3B Instruct

The latest Qwen reasoning model.

128.0K 上下文:
输入类型:
输出类型:
上下文:128.00K
输入:$0.15/M tokens
输出:$0.6/M tokens
最大输出:32.00K
$0.15/0.6M 输入/输出
HOT
Kimi K2.5
LLM

Kimi K2.5

Powerful model for long-context and intelligent workflows.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.49/M tokens
输出:$2.5/M tokens
最大输出:262.14K
$0.49/2.5M 输入/输出
Cache-Based
NEW
Qwen3.7 Max
LLM

Qwen3.7 Max

Flagship model for advanced reasoning, coding, and complex tasks.

1000.0K 上下文:
输入类型:
输出类型:
上下文:1000.00K
输入:$2.5/M tokens
输出:$7.5/M tokens
最大输出:67.07K
$2.5/7.5M 输入/输出
Cache-Based
NEW
Qwen3.7 Plus
LLM

Qwen3.7 Plus

Balanced model combining strong capability, speed, and efficiency.

1000.0K 上下文:
输入类型:
输出类型:
上下文:1000.00K
输入:$0.4/M tokens
输出:$1.6/M tokens
最大输出:67.07K
$0.4/1.6M 输入/输出
Cache-Based
Gradient-Based
NEW
Qwen3.5 Plus
LLM

Qwen3.5 Plus

Efficient model for everyday tasks and AI assistants.

1000.0K 上下文:
输入类型:
输出类型:
上下文:1000.00K
输入:$0.4/M tokens
输出:$2.4/M tokens
最大输出:67.07K
$0.4/2.4M 输入/输出
Cache-Based
Gradient-Based
NEW
Qwen3.5 Flash
LLM

Qwen3.5 Flash

Fast model optimized for instant responses and large-scale usage.

1000.0K 上下文:
输入类型:
输出类型:
上下文:1000.00K
输入:$0.1/M tokens
输出:$0.4/M tokens
最大输出:67.07K
$0.1/0.4M 输入/输出
NEW
Qwen3 Max 20260123
LLM

Qwen3 Max 20260123

Qwen3-Max is a flagship large language model designed for ultra-long context understanding, powerful reasoning, and high-performance text and code generation, making it well suited for complex, large-scale, and production-grade AI applications.

252.0K 上下文:
输入类型:
输出类型:
上下文:252.00K
输入:$1.2/M tokens
输出:$6/M tokens
最大输出:32.00K
$1.2/6M 输入/输出
Gradient-Based
HOT
MiniMax M2.1
LLM

MiniMax M2.1

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

196.6K 上下文:
输入类型:
输出类型:
上下文:196.61K
输入:$0.29/M tokens
输出:$0.95/M tokens
最大输出:196.61K
$0.29/0.95M 输入/输出
Cache-Based
NEW
HOT
GLM 4.7
LLM

GLM 4.7

GLM-4.7 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

202.8K 上下文:
输入类型:
输出类型:
上下文:202.75K
输入:$0.52/M tokens
输出:$1.85/M tokens
最大输出:202.75K
$0.52/1.85M 输入/输出
Cache-Based
NEW
HOT
DeepSeek V3.2
LLM

DeepSeek V3.2

DeepSeek V3.2 is a state-of-the-art large language model combining efficient sparse attention, strong reasoning, and integrated agent capabilities for robust long-context understanding and versatile AI applications.

163.8K 上下文:
输入类型:
输出类型:
上下文:163.84K
输入:$0.26/M tokens
输出:$0.38/M tokens
最大输出:163.84K
$0.26/0.38M 输入/输出
Cache-Based
NEW
HOT
GPT 5.4
LLM

GPT 5.4

Advanced multimodal model optimized for reasoning, coding, content generation, and complex problem-solving with strong accuracy and reliability.

400.0K 上下文:
输入类型:
输出类型:
上下文:400.00K
输入:$2.5/M tokens
输出:$15/M tokens
最大输出:128.00K
$2.5/15M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
GPT 5.5
LLM

GPT 5.5

Advanced multimodal model optimized for reasoning, coding, content generation, and complex problem-solving with strong accuracy and reliability.

1050.0K 上下文:
输入类型:
输出类型:
上下文:1050.00K
输入:$5/M tokens
输出:$30/M tokens
最大输出:128.00K
$5/30M 输入/输出
Cache-Based
Gradient-Based
NEW
HOT
KwaiKAT
LLM
PRO

KAT Coder Pro V2

KAT Coder Pro is KwaiKAT's most advanced agentic coding model in the KAT-Coder series. Designed specifically for agentic coding tasks, it excels in real-world software engineering scenarios, achieving 73.4% solve rate on the SWE-Bench Verified benchmark.

262.1K 上下文:
输入类型:
输出类型:
上下文:262.14K
输入:$0.3/M tokens
输出:$1.2/M tokens
最大输出:144.00K
$0.3/1.2M 输入/输出
Cache-Based
NEW
HOT
Gemini 3.1 Pro Preview
LLM
PROPREVIEW

Gemini 3.1 Pro Preview

Preview version of Google's flagship reasoning model, offering enhanced analytical capabilities, long-context understanding, and advanced multimodal performance.

1000.0K 上下文:
输入类型:
输出类型:
上下文:1000.00K
输入:$2/M tokens
输出:$12/M tokens
最大输出:64.00K
$2/12M 输入/输出
Cache-Based
Gradient-Based
HOT
MiniMax M2
LLM

MiniMax M2

MiniMax-M2 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency.

196.6K 上下文:
输入类型:
输出类型:
上下文:196.61K
输入:$0.255/M tokens
输出:$1/M tokens
最大输出:196.61K
$0.255/1M 输入/输出
Cache-Based
NEW
HOT
DeepSeek V3.2 Exp
LLM

DeepSeek V3.2 Exp

Fastest, most cost-effective model from DeepSeek Ai.

163.8K 上下文:
输入类型:
输出类型:
上下文:163.84K
输入:$0.27/M tokens
输出:$0.41/M tokens
最大输出:163.84K
$0.27/0.41M 输入/输出

Join our Discord community

Join the Discord community for the latest model updates, prompts, and support.