Models / Minimax AI
LLM
Reasoning

MiniMax M1 40K

456B-parameter hybrid MoE reasoning model with 40K thinking budget, lightning attention, and 1M token context for efficient reasoning and problem-solving tasks.

About model

MiniMax M1 40K is a large-scale hybrid-attention reasoning model suitable for complex tasks requiring long input processing and extensive thinking. It supports a context length of 1 million tokens and enables efficient scaling of test-time compute. Ideal for users needing advanced language modeling capabilities for tasks like software engineering and mathematical reasoning.

To run this model you first need to deploy it on a Dedicated Endpoint.

Performance benchmarks

Model

AIME 2025

GPQA Diamond

HLE

LiveCodeBench

MATH500

SWE-bench verified

76.9%

70.0%

96.8%

Related open-source models

Competitor closed-source models

Claude Opus 4.6

90.5%

34.2%

78.7%

OpenAI o3

83.3%

24.9%

99.2%

62.3%

OpenAI o1

76.8%

96.4%

48.9%

GPT-4o

49.2%

2.7%

32.3%

89.3%

31.0%

  • Model card

    Architecture Overview:
    • Hybrid Mixture-of-Experts with 456 billion total parameters and 45.9 billion activated per token
    • Lightning attention mechanism for efficient test-time compute scaling
    • 1 million token context window for extensive document processing and analysis
    • Optimized hybrid attention design balancing performance with computational efficiency

    Training Methodology:
    • Large-scale reinforcement learning on diverse reasoning and engineering problems
    • CISPO algorithm optimization for efficient importance sampling weight management
    • 40K thinking budget providing balanced reasoning capabilities with computational efficiency
    • Trained on diverse problems from mathematical reasoning to real-world software engineering

    Performance Characteristics:
    • Efficient test-time compute scaling with lightning attention mechanism
    • Strong performance on AIME 2024 (83.3), SWE-bench Verified (55.6), and coding benchmarks
    • Superior efficiency compared to larger reasoning models while maintaining quality
    • Optimized for tasks requiring substantial reasoning with moderate computational budgets

  • Prompting

    Reasoning Capabilities:
    • Advanced reasoning model with 40K thinking budget for efficient problem-solving
    • System/user/assistant format optimized for reasoning chains and complex tasks
    • Lightning attention enables efficient scaling while maintaining reasoning quality
    • Balanced approach to extensive reasoning with computational efficiency considerations

    Optimization Settings:
    • Temperature 1.0 and top_p 0.95 for creativity and logical coherence balance
    • General scenarios: "You are a helpful assistant" for broad applications
    • Mathematical reasoning: Step-by-step reasoning with structured output formatting
    • Code generation: Comprehensive web development and engineering assistance

    Efficiency Features:
    • Function calling capabilities for structured external integrations
    • Efficient reasoning budget allocation for cost-effective complex problem-solving
    • Strong performance across diverse domains with moderate computational requirements
    • Optimal balance between reasoning capability and resource utilization

  • Applications & use cases

    Efficient Reasoning Applications:
    • Mathematical problem-solving and competition-level tasks with budget efficiency
    • Software engineering and coding assistance requiring moderate reasoning depth
    • Long-context document analysis with 1M token processing capability
    • Multi-step reasoning tasks with computational efficiency requirements

    Business & Development:
    • Cost-effective reasoning applications for business problem-solving
    • Development environments requiring advanced AI assistance with budget considerations
    • Educational applications requiring step-by-step reasoning and explanation
    • Research and analysis tasks with moderate complexity and reasoning requirements

    Balanced Applications:
    • Applications requiring advanced reasoning capabilities without premium computational costs
    • Complex problem-solving scenarios with efficiency and performance balance
    • Next-generation AI agents for real-world challenges with resource optimization
    • Advanced decision-making systems requiring substantial reasoning with cost-effectiveness

Related models
  • Model provider
    Minimax AI
  • Type
    LLM
    Reasoning
  • Main use cases
    Chat
    Reasoning
  • Deployment
    On-Demand Dedicated
    Monthly Reserved
  • Parameters
    456B
  • Activated parameters
    45.9B
  • Context length
    1M
  • Input modalities
    Text
  • Output modalities
    Text