Versions

Overview of available model variants

Model	Release	Strengths	Weaknesses	Status
Jamba2 Mini Recommended	January 2026	52B parameters (12B active, MoE) 256k context window Apache 2.0	—	Current
Jamba2 3B	January 2026	Very efficient (3B) Apache 2.0 Edge-capable	Smaller capacity	Current
Jamba Reasoning 3B	October 2025	First Jamba reasoning model Compact (3B), runs on a laptop Context up to 1M tokens Apache 2.0	Small parameter count limits general knowledge	Current
Jamba Large 1.7	July 2025	Open weights Improved accuracy and speed over 1.6 256k context	Superseded by Jamba2 for many use cases	Current
Jamba 1.6 (Large/Mini)	March 2025	Open weights Improved reasoning and tool-use performance over 1.5	Superseded by Jamba 1.7 and Jamba2	Deprecated
Jamba 1.5 Large	August 2024	Open weights Self-hosting possible	Shut down on managed platforms	Deprecated
Jamba 1.5 Mini	August 2024	Open weights Compact	Shut down on managed platforms	Deprecated

Technical Details

API, features and capabilities

API & Availability

Availability Public

Requests/Min 1000

Latency (TTFT) ~300ms

Throughput ~200 Tokens/Sec

Features & Capabilities

Tool Use Function Calling Structured Output File Upload

Training & Knowledge

Knowledge Cutoff 2025 (Jamba2)

Fine-Tuning Available (Fine-tuning API, LoRA (Open Models))

Language Support

Best Quality English, Spanish, French, Portuguese, Italian, German, Arabic, Hebrew

Supported 10+ languages

Strong in English, good in Western languages, native in Hebrew

Hosting & Compliance

GDPR-compliant hosting options and licensing

GDPR-Compliant Hosting Options

License & Hosting

License Apache 2.0 (Jamba2), Jamba Open Model License (Jamba 1.5)

Security Filters Customizable

Enterprise Support Yes

SLA Available Yes

On-Premise

As AI consultants based in Rosenheim, Germany, we recommend AI21 Jamba for enterprises that need to process extremely long documents. With a 256k token context window, you can analyze entire books at once.

Important note (as of June 2026): In May 2026, AI21 Labs executed a strategic pivot and laid off more than 60% of its workforce. The company is now focusing on its Maestro orchestration platform for enterprise AI agents and is scaling back the sale of Jamba models as a standalone API product. However, the open weights remain available under Apache 2.0 on HuggingFace and can be self-hosted. For new production projects relying on a hosted API, we currently recommend evaluating alternative providers or self-hosting the Jamba2 models directly.

Innovative Architecture

AI21 Labs is an Israeli AI company that developed a unique hybrid architecture with Jamba: the combination of Mamba (State Space Model) and Transformer.

Why Jamba for Enterprises?

256k Context: One of the longest contexts on the market
Efficient: MoE architecture reduces resource requirements
Open Models: Jamba 1.5 under the Jamba Open Model License, Jamba 1.6 and Jamba2 under Apache 2.0
Multilingual: Strong support for many languages
EU-Available: Via AWS Bedrock Frankfurt

Key Strengths

Mamba-Transformer Hybrid

Jamba combines two architectures:

Mamba Layers: Efficient processing of long sequences
Transformer Layers: Precise attention for details
MoE: Mixture-of-Experts for efficiency

This combination enables:

3x faster inference on long contexts
2x less memory requirements
Linear instead of quadratic scaling

Extremely Long Context

256,000 tokens mean:

~640 pages of text at once
Entire books to analyze
Complete codebases to understand
Comprehensive legal contracts to review

Open-Source Option

Jamba 1.5 is available as open source:

Jamba Open Model License
Self-hosting possible
Full control over data
Community-driven

Hardware Requirements (Self-Hosted)

Model	VRAM	Recommended GPU
Jamba2 Mini	48 GB	A100 80GB
Jamba 1.5 Large	160 GB	Multi-A100
Jamba 1.5 Mini	24 GB	RTX 4090

Comparison to Other Models

Feature	Jamba2	GPT-4	Claude 3
Context	256k	128k	200k
Architecture	Hybrid	Transformer	Transformer
Open Source	Yes (1.5/1.6/2)	No	No
MoE	Yes	No	No

Integration with CompanyGPT

AI21 Jamba can be integrated in CompanyGPT - ideal for enterprises with extensive document collections.

Our Recommendation

AI21 Jamba2 Mini as an open-weights model under Apache 2.0 remains our recommendation for self-hosted document analysis with very long texts. If you regularly work with very long documents (legal contracts, book manuscripts, code reviews) and need full data sovereignty, Jamba2 on your own infrastructure is an excellent choice.

Due to AI21’s strategic shift to the Maestro platform, we advise caution for API-centric new projects: either commit to self-hosting (HuggingFace, AWS Bedrock) or evaluate alternatives. For general cloud-API applications without a specific focus on long contexts, we recommend Google Gemini (up to 1M context) or OpenAI GPT.

Cost estimation for this model

For up-to-date token pricing, model variants and EU availability, see our sister project ai-prices.eu. It helps you compare and estimate the operational cost of leading AI models for your specific use case.

Compare prices on ai-prices.eu

ai-prices.eu is a project by innFactory AI Consulting GmbH and provides transparent cost estimates for leading AI models.

AI21 Jamba

Versions

Use Cases

Technical Details

Hosting & Compliance

Innovative Architecture

Why Jamba for Enterprises?

Key Strengths

Mamba-Transformer Hybrid

Extremely Long Context

Open-Source Option

Hardware Requirements (Self-Hosted)

Comparison to Other Models

Integration with CompanyGPT

Our Recommendation

Cost estimation for this model

Similar Models

SOOFI (Soofi S)

Tencent Hunyuan (Hy3)

NVIDIA Nemotron

Consultation for this model?