Models: Build with 200+ open-source and specialized models

Chat

Qwen3 235B A22B Instruct 2507 FP8

235B MoE model with 22B activation featuring enhanced instruction following, reasoning, and 262K context for cost-efficient high-throughput inference.

Chat

Chat

Chat

Chat

Chat

Chat

Image

Transcribe

Code

Chat

New

Chat

Embeddings

Chat

New

Code

Coming Soon

Rerank

Chat

New

Featured 
models

Run any model on the fastest endpoints

Subscribe to newsletter

Featured models

DeepSeek-R1-0528

Kimi K2 Instruct-0905

Qwen3 235B A22B FP8 Throughput

Qwen3 235B A22B Instruct 2507 FP8

Llama 4 Maverick

Llama 4 Scout

Gemma 3 27B

DeepSeek-V3-0324

DeepSeek-V3.1

DeepSeek-R1-0528 Throughput

Qwen QwQ-32B

Qwen3 235B A22B Instruct 2507 FP8

GLM-4.5-Air

gpt-oss-120B

DeepSeek-V3.1

Kimi K2 Instruct-0905

Apriel-1.5-15b-Thinker

Gemma 3 27B

Llama 4 Maverick

FLUX.1 Kontext [dev]

Whisper Large v3

Qwen3-Coder 480B A35B Instruct

Qwen3-Next-80B-A3B-Thinking

Mistral Small 3

GTE ModernBERT base

MiniMax M1 40K

Qwen3-Next-80B-A3B-Instruct

DeepSeek-V3.2-Exp

Mxbai Rerank Large V2

DeepSeek-V3-0324

Cogito v2 preview - 671B MoE

Run any model on the fastest endpoints

Subscribe to newsletter

Featured 
models