Skip to main content
9 – 17 UHR +49 8031 3508270 LUITPOLDSTR. 9, 83022 ROSENHEIM
DE / EN
LLM OpenAI USA

OpenAI gpt-oss

gpt-oss-120b and gpt-oss-20b – OpenAI's first open-weight models since GPT-2 (August 2025). Apache 2.0, fully self-hostable, GDPR-friendly. AI consulting from Germany.

License Apache 2.0
GDPR Hosting Available
Context 128k Tokens
Modality Text → Text

Versions

Overview of available model variants

ModelReleaseEUStrengthsWeaknessesStatus
gpt-oss-120b Recommended
5 August 2025
117B total parameters, 5.1B active (MoE) Matches or exceeds OpenAI o4-mini on coding, reasoning and tool use Runs on a single 80 GB GPU thanks to native MXFP4 quantisation Apache 2.0 – fully self-hostable
No vision input Reasoning mode increases latency
Current
gpt-oss-20b
5 August 2025
21B total parameters, 3.6B active (MoE) Runs in ~16 GB of memory – laptop-capable Matches or exceeds OpenAI o3-mini Apache 2.0
Lower capacity than gpt-oss-120b No vision input
Current

Use Cases

Typical applications for this model

GDPR-compliant self-hosting
Coding & software development
Reasoning-intensive workflows
Tool use and agentic AI
Edge and on-premise deployment (20b)
Air-gapped environments (government, critical infrastructure)

Technical Details

API, features and capabilities

API & Availability
Availability Open Weights (HuggingFace, Azure, AWS, Cloudflare, Ollama, vLLM, LM Studio etc.)
Latency (TTFT) Hardware-dependent
Throughput Hardware-dependent Tokens/Sec
Features & Capabilities
Tool Use Function Calling Structured Output Reasoning Mode
Training & Knowledge
Knowledge Cutoff Early 2025
Fine-Tuning Available (LoRA, QLoRA, Full Fine-Tuning)
Language Support
Best Quality English, German, French, Spanish, Chinese
Supported Multilingual (English-dominant)
Best quality in English, very good quality in European languages

Hosting & Compliance

GDPR-compliant hosting options and licensing

GDPR-Compliant Hosting Options
Self-Hosted
Your own infrastructure
Recommended – full data control, Apache 2.0
Azure
West Europe, Germany West Central
Azure AI Foundry Model Catalog
AWS
Frankfurt (eu-central-1)
Amazon Bedrock Marketplace
Cloudflare Workers AI
Global edge with EU routing
Serverless inference
License & Hosting
License Apache 2.0
Security Filters None pre-installed (self-hosted responsibility)
On-Premise Edge-capable

Benchmarks

Performance comparison with standardized tests

Competition Coding
matches o4-mini
Competition Math
exceeds o4-mini
Health Q&A
exceeds o4-mini

innFactory AI Consulting from Rosenheim, Germany advises DACH-region enterprises on GDPR-compliant integration of gpt-oss – OpenAI’s first open-weight models since GPT-2. With CompanyGPT you can operate gpt-oss securely and self-hosted in your own infrastructure.

What is gpt-oss?

On 5 August 2025, OpenAI released gpt-oss-120b and gpt-oss-20b under the Apache 2.0 license – the first open-weight release since GPT-2 in 2019. For DACH enterprises, this is a turning point: for the first time, OpenAI-style models can be operated fully self-hosted, without API dependency.

Model variants

gpt-oss-120b

  • 117B total parameters, 5.1B active (Mixture-of-Experts)
  • Performance: Matches or exceeds OpenAI o4-mini on coding, reasoning, tool use and mathematics
  • Hardware: Runs on a single 80 GB GPU (A100/H100/MI300) thanks to native MXFP4 quantisation
  • Recommended for: Enterprise workloads, RAG pipelines, coding agents

gpt-oss-20b

  • 21B total parameters, 3.6B active (Mixture-of-Experts)
  • Performance: Matches or exceeds OpenAI o3-mini, especially on maths and health topics
  • Hardware: ~16 GB of memory – runs on high-end laptops and workstations
  • Recommended for: Edge deployment, local developer tools, data-sensitive prototypes

Why gpt-oss for DACH enterprises

GDPR-compliant self-hosting

Apache 2.0 permits commercial use, modification and redistribution without copyleft. This allows gpt-oss models to be operated in your own infrastructure or in EU clouds – without data leaving for OpenAI.

Broad deployment ecosystem

At launch, OpenAI partnered with the following platforms: Azure, AWS, Hugging Face, vLLM, Ollama, llama.cpp, LM Studio, Fireworks, Together AI, Baseten, Databricks, Vercel, Cloudflare, OpenRouter.

Particularly relevant in the EU:

  • Azure AI Foundry in West Europe and Germany West Central
  • AWS Bedrock Marketplace in Frankfurt
  • Cloudflare Workers AI with EU edge routing

Compliance and sovereignty advantages

  • Air-gapped deployment possible (government, defence, critical infrastructure)
  • Full auditability of model weights
  • No API telemetry flowing to OpenAI

gpt-oss vs. proprietary GPT models

Aspectgpt-ossGPT-5.x / Frontier
LicenceApache 2.0Proprietary
Self-hostingYesNo
Vision inputNoYes
Frontier performanceClose to o3/o4-miniHigher
GDPR riskMinimal (self-hostable)Higher (cloud API)

For GDPR-critical workloads gpt-oss is the natural choice – even though the frontier GPT models remain superior for peak performance in vision and complex reasoning.

Related models

Integration with CompanyGPT

With CompanyGPT gpt-oss can be operated as a GDPR-compliant alternative to GPT-5.x via Azure inside your infrastructure. For hybrid setups we combine gpt-oss (for sensitive workloads) with cloud-hosted GPT models (for peak performance) – with intelligent per-use-case routing.

Our recommendation

gpt-oss-120b is the GDPR-compliant default recommendation in 2026 for enterprises that need OpenAI-class quality without cloud-API dependency. For edge scenarios and developer workflows, gpt-oss-20b is the ideal companion.

We support you with hardware selection, deployment architecture and integration into existing knowledge and ERP systems. Contact us for a no-obligation initial consultation.

Cost estimation for this model

For up-to-date token pricing, model variants and EU availability, see our sister project ai-prices.eu. It helps you compare and estimate the operational cost of leading AI models for your specific use case.

Compare prices on ai-prices.eu

ai-prices.eu is a project by innFactory AI Consulting GmbH and provides transparent cost estimates for leading AI models.

Consultation for this model?

We help you select and integrate the right AI model for your use case.