Skip to main content
9 – 17 UHR +49 8031 3508270 LUITPOLDSTR. 9, 83022 ROSENHEIM
DE / EN
LLM xAI USA

xAI Grok

Grok by xAI - LLM family with 2M token context. Grok 4.3 (GA since May 2026) and the new coding model Grok Build 0.1 are the latest models. EU availability via Azure AI Foundry.

License Apache 2.0 (Grok-1 only)
GDPR Hosting Available
Context 128k - 2M Tokens Tokens
Modality Text, Image → Text, Image

Versions

Overview of available model variants

ModelReleaseEUStrengthsWeaknessesStatus
Grok Build 0.1
29 May 2026
Specialised for agentic coding (planning, execution, debugging) 256K Token Context Window Text and image input (diagrams, mockups, error screenshots) Native MCP support High inference speed (100+ tokens/second) Cost-efficient: $1 input / $2 output per 1M tokens
Focused on coding tasks, not a general-purpose LLM Public beta on the xAI API No AWS/GCP availability
Current
Grok 4.3 Recommended
May 2026 (GA)
Current flagship model, General Availability Native multimodal video understanding Custom Skills: persistent expertise across conversations Integrated code execution environment Knowledge cutoff December 2025 1M Token Context Window
Requires SuperGrok / Premium+ subscription for full access No AWS/GCP availability
Current
Grok 4.3 Beta
17 April 2026
Beta lead-up to final Grok 4.3 Native multimodal video understanding Generates downloadable PDFs, spreadsheets, PowerPoint decks from conversation Knowledge cutoff December 2025
Beta status, superseded by GA version No AWS/GCP availability
Deprecated
Grok 4.20 Beta
17 February 2026
Multi-agent architecture (4 specialised agents in parallel) Rapid learning – weekly weight updates from real-world feedback Medical document analysis via photo upload 2M Token Context Window
No AWS/GCP availability
Current
Grok 4 Heavy
2025
Highest Quality for Complex Tasks
Higher Latency and Cost
Current
Grok Code Fast 1
2025
Specialized for Code Generation Fast Inference
Only Optimized for Code Tasks
Current
Grok 4.1
November 2025
Current Generation Broad Coverage
Availability Varies
Current
Grok 4.1 Thinking
November 2025
Reasoning Focus
Higher Latency
Current
Grok 4.1 Fast
November 2025
Fast and Cost-Efficient 2M Token Context Window
Less Depth
Current
Grok 4
July 2025
Strong General Model
Smaller Context Window (128K)
Current
Grok 3
February 2025
Proven
Current
Grok-2
August 2024
Last generation before Grok 3
Superseded by Grok 3 and newer
Deprecated
Grok-1 (Open Weights)
March 2024
Open Weights
Deprecated

Use Cases

Typical applications for this model

Social Media Analysis
Trend Monitoring
Content Creation
Image Generation
Open-Source Research (Grok-1)

Technical Details

API, features and capabilities

API & Availability
Availability Public (EU available)
Features & Capabilities
Tool Use Function Calling Structured Output Vision Web Browsing File Upload
Training & Knowledge
Knowledge Cutoff 2025-12
Fine-Tuning Not available
Language Support
Best Quality English
Supported 50+ Languages
Best Quality in English

Hosting & Compliance

GDPR-compliant hosting options and licensing

GDPR-Compliant Hosting Options
Azure
West Europe / Germany
Azure AI Foundry
License & Hosting
License Apache 2.0 (Grok-1 only)
Security Filters Minimal
Enterprise Support Yes
Cloud Only On-Premise

Benchmarks

Performance comparison with standardized tests

MMLU
89.5
LiveCodeBench
79.4
SWE-Bench
75
GPQA
88

innFactory AI Consulting from Rosenheim advises companies in the DACH region (Germany, Austria, Switzerland) on GDPR-compliant integration of xAI Grok. With the General Availability of Grok 4.3 (May 2026) and the new coding-specialised Grok Build 0.1 (29 May 2026), xAI has significantly expanded its portfolio in early summer 2026.

New in May/June 2026

Grok 4.3 is now Generally Available

Following the beta launch in April 2026, xAI moved Grok 4.3 into general rollout in early May 2026. In addition to multimodal video understanding and the 1M token context window, the GA version brings:

  • Custom Skills: persistent expertise (formatting rules, workflow steps, document styles) that Grok automatically applies across every conversation
  • Integrated code execution environment: Grok can write code, run it, install dependencies, and produce real files

Grok Build 0.1: Dedicated Coding Model

On 29 May 2026, xAI released Grok Build 0.1 in public beta on the xAI API. The model is specifically trained for agentic coding:

  • 256K token context window
  • Text and image input (e.g. UI mockups, architecture diagrams, error screenshots)
  • Native MCP support and integration with Cursor, Kilo Code, OpenCode and others
  • Inference speed > 100 tokens/second
  • Pricing: $1 per 1M input tokens, $2 per 1M output tokens, cache read $0.20 per 1M tokens

Technical Strengths of Grok 4.3

Extended Context Processing

With a context window of 1 million tokens, Grok 4.3 ranks among the most powerful models for processing extensive documents. For enterprises, this means:

  • Complete analysis of entire codebases without splitting
  • Processing comprehensive contracts and technical documentation in one pass
  • Consistent analysis of long conversation histories and protocols

Benchmark Results

Current performance tests show strong results:

  • MMLU (Multitask Understanding): 89.5% - on par with leading models
  • LiveCodeBench: 79.4% with tool use - surpasses many established competitors
  • SWE-Bench (Software Engineering): 75% - leading in real-world coding tasks
  • GPQA (Graduate Science): 88% - outstanding in scientific questions

Agentic Capabilities

Grok 4.3 offers extended capabilities for autonomous multi-step tasks:

  • Reduced hallucination rate by 65% compared to predecessors
  • Improved generalization of programming logic across language boundaries
  • Native integration of web and X search for current information

EU Availability and GDPR Compliance

Azure AI Foundry

Grok 4 and Grok 4.20 Beta are available via Microsoft Azure AI Foundry in EU regions, including Germany (West Europe, Germany). This enables DACH enterprises to:

  • Process data within the EU for GDPR compliance
  • Utilize enterprise-grade security features through Azure
  • Integrate into existing Microsoft infrastructures

Limitations with Cloud Providers

Important: Grok models are currently not available on:

  • AWS Bedrock (including Frankfurt/eu-central-1)
  • Google Vertex AI (including Frankfurt/europe-west3)

This limits flexibility for companies already heavily invested in these platforms.

Pricing

xAI offers competitive pricing across the entire model family:

Grok 4.3 (Flagship):

  • Input: $1.25 per 1M tokens
  • Output: $2.50 per 1M tokens
  • Context window: 1M tokens

Grok Build 0.1 (Coding):

  • Input: $1.00 per 1M tokens
  • Output: $2.00 per 1M tokens
  • Cache read: $0.20 per 1M tokens
  • Context window: 256K tokens

Grok 4.1 Fast / Grok 4.20 Beta:

  • Input: $0.20 per 1M tokens
  • Output: $0.50 per 1M tokens
  • Context window: 2M tokens

Grok 4 (Standard):

  • Input: $3.00 per 1M tokens
  • Output: $15.00 per 1M tokens
  • Context window: 128K tokens

Tool Calls:

  • Web/X search, code execution: $5 per 1,000 calls
  • Batch API: 50% discount for asynchronous processing

Use Cases for DACH Enterprises

Technical Analysis

  • Software engineering tasks with complete codebase understanding
  • Automated code reviews and refactoring suggestions
  • Technical documentation analysis

Scientific Applications

  • Processing extensive research documents
  • STEM-related questions and calculations
  • Graduate-level scientific analysis

Social Media and Trend Monitoring

  • Integration with X/Twitter for real-time data analysis
  • Content creation with current context
  • Trend identification and market observation

Critical Assessment

Ethical and Practical Concerns

Despite technical strengths, concerns about Grok persist:

  • Controversies: Connection to Elon Musk and political positions
  • Minimal security filters: Can be problematic for regulated industries
  • Platform dependency: Only Azure, no multi-cloud strategy possible
  • Knowledge cutoff: Trained until December 2025, current only via web search

Better Alternatives for Enterprise Use

For professional applications in the DACH region, we often recommend:

ApplicationAlternative
General LLM TasksAnthropic Claude
Code GenerationOpenAI GPT-4
Open Source & FlexibilityMeta Llama or Qwen
GDPR-Compliant SolutionCompanyGPT

Integration with CompanyGPT

Our CompanyGPT solution offers a GDPR-compliant alternative that combines various models while ensuring the highest data protection standards. For companies that want to stay on the safe side, this is often the better choice.

Outlook

Following the GA of Grok 4.3 and the launch of Grok Build 0.1 at the end of May 2026, Elon Musk has announced that further models (including Grok V9-Medium with 1.5 trillion parameters) are already trained and expected to ship during summer 2026. Grok 5, with a targeted 6-10 trillion parameters on the Colossus 2 supercluster, is expected by xAI in Q2/Q3 2026.

Our Recommendation

Grok 4.3 is technically impressive, especially due to its 1M token context window, Custom Skills, and integrated code execution. For specific use cases such as extensive code analysis or scientific research, the model can be useful. For pure coding workloads, the significantly cheaper Grok Build 0.1 is the better choice.

However: The ethical concerns, minimal security filters, and limitation to Azure make Grok a risky choice for many companies in regulated environments. For business-critical applications, we recommend established alternatives with stricter governance standards.

For individual consulting on the appropriate AI strategy and GDPR-compliant implementation, contact innFactory AI Consulting.

Cost estimation for this model

For up-to-date token pricing, model variants and EU availability, see our sister project ai-prices.eu. It helps you compare and estimate the operational cost of leading AI models for your specific use case.

Compare prices on ai-prices.eu

ai-prices.eu is a project by innFactory AI Consulting GmbH and provides transparent cost estimates for leading AI models.

Consultation for this model?

We help you select and integrate the right AI model for your use case.