DeepSeek

DeepSeek V4 Flash

A faster, more economical option

DeepSeek-V4-Flash is slightly behind Pro in world knowledge, but shows similar reasoning ability. With smaller parameters and activation, V4-Flash can provide faster and more economical API service. It is close to Pro on simpler agent tasks, while still trailing on more difficult workloads.

Context1M
Released2026-04
Relays34 sites
Economical API serviceNear-Pro reasoningStrong simple-agent performanceLower latency and cost

DeepSeek Official Pricing

CNY
Updated: 2026-04-01T00:00:00.000+08:00Source

Input

¥1/ 1M tokens

Output

¥2/ 1M tokens

Cache read

¥0.02/ 1M tokens

Relay Comparison

Compare token, per-request, or per-second pricing by relay channel.

Bob API

bobdong.cnToken billing

Price range ¥0.25 - ¥0.3 / 1M tokens

How should DeepSeek V4 Flash relay pricing be compared?

This DeepSeek V4 Flash pricing page compares official pricing with public prices from 34 listed AI gateways. Token prices are shown in CNY per 1M tokens, while per-request, per-second, and per-character rows use the unit shown in the table. Last updated: 06/12/2026, 23:12.

Data sources
Public price catalogs, official pricing records, and monitoring results.
Metric definitions
Uptime means successful probe response rate, fake-rate signals possible model mismatch or abnormal output risk, and latency is average API response time.
Risk note
Relay gateways are third-party services. Pricing, billing, privacy, and stability can change; start with a small top-up and verify reliability before continued use.