Skip to main content
9 – 17 UHR +49 8031 3508270 LUITPOLDSTR. 9, 83022 ROSENHEIM
DE / EN
LLM AI21 Labs Israel

AI21 Jamba

AI21 Jamba2 - innovative Mamba-Transformer hybrid architecture with 256k context. Open weights (Apache 2.0) on HuggingFace. Note: Since May 2026 AI21 has strategically refocused on its Maestro orchestration platform.

License Apache 2.0 (Jamba2), Jamba Open Model License (Jamba 1.5)
GDPR Hosting Available
Context 256k Tokens
Modality Text → Text

Versions

Overview of available model variants

ModelReleaseEUStrengthsWeaknessesStatus
Jamba2 Mini Recommended
January 2026
52B parameters (12B active, MoE) 256k context window Apache 2.0
Current
Jamba2 3B
January 2026
Very efficient (3B) Apache 2.0 Edge-capable
Smaller capacity
Current
Jamba Reasoning 3B
October 2025
First Jamba reasoning model Compact (3B), runs on a laptop Context up to 1M tokens Apache 2.0
Small parameter count limits general knowledge
Current
Jamba Large 1.7
July 2025
Open weights Improved accuracy and speed over 1.6 256k context
Superseded by Jamba2 for many use cases
Current
Jamba 1.6 (Large/Mini)
March 2025
Open weights Improved reasoning and tool-use performance over 1.5
Superseded by Jamba 1.7 and Jamba2
Deprecated
Jamba 1.5 Large
August 2024
Open weights Self-hosting possible
Shut down on managed platforms
Deprecated
Jamba 1.5 Mini
August 2024
Open weights Compact
Shut down on managed platforms
Deprecated

Use Cases

Typical applications for this model

Document Analysis (long texts)
Book Summaries
Codebase Analysis
Legal Document Review
Research & Science
Multilingual Applications
Enterprise Search

Technical Details

API, features and capabilities

API & Availability
Availability Public
Requests/Min 1000
Latency (TTFT) ~300ms
Throughput ~200 Tokens/Sec
Features & Capabilities
Tool Use Function Calling Structured Output File Upload
Training & Knowledge
Knowledge Cutoff 2025 (Jamba2)
Fine-Tuning Available (Fine-tuning API, LoRA (Open Models))
Language Support
Best Quality English, Spanish, French, Portuguese, Italian, German, Arabic, Hebrew
Supported 10+ languages
Strong in English, good in Western languages, native in Hebrew

Hosting & Compliance

GDPR-compliant hosting options and licensing

GDPR-Compliant Hosting Options
AWS
Frankfurt (eu-central-1)
Amazon Bedrock
Self-Hosted
Own Infrastructure
Jamba 1.5 Open Models
License & Hosting
License Apache 2.0 (Jamba2), Jamba Open Model License (Jamba 1.5)
Security Filters Customizable
Enterprise Support Yes
SLA Available Yes
On-Premise

As AI consultants based in Rosenheim, Germany, we recommend AI21 Jamba for enterprises that need to process extremely long documents. With a 256k token context window, you can analyze entire books at once.

Important note (as of June 2026): In May 2026, AI21 Labs executed a strategic pivot and laid off more than 60% of its workforce. The company is now focusing on its Maestro orchestration platform for enterprise AI agents and is scaling back the sale of Jamba models as a standalone API product. However, the open weights remain available under Apache 2.0 on HuggingFace and can be self-hosted. For new production projects relying on a hosted API, we currently recommend evaluating alternative providers or self-hosting the Jamba2 models directly.

Innovative Architecture

AI21 Labs is an Israeli AI company that developed a unique hybrid architecture with Jamba: the combination of Mamba (State Space Model) and Transformer.

Why Jamba for Enterprises?

  • 256k Context: One of the longest contexts on the market
  • Efficient: MoE architecture reduces resource requirements
  • Open Models: Jamba 1.5 under the Jamba Open Model License, Jamba 1.6 and Jamba2 under Apache 2.0
  • Multilingual: Strong support for many languages
  • EU-Available: Via AWS Bedrock Frankfurt

Key Strengths

Mamba-Transformer Hybrid

Jamba combines two architectures:

  • Mamba Layers: Efficient processing of long sequences
  • Transformer Layers: Precise attention for details
  • MoE: Mixture-of-Experts for efficiency

This combination enables:

  • 3x faster inference on long contexts
  • 2x less memory requirements
  • Linear instead of quadratic scaling

Extremely Long Context

256,000 tokens mean:

  • ~640 pages of text at once
  • Entire books to analyze
  • Complete codebases to understand
  • Comprehensive legal contracts to review

Open-Source Option

Jamba 1.5 is available as open source:

  • Jamba Open Model License
  • Self-hosting possible
  • Full control over data
  • Community-driven

Hardware Requirements (Self-Hosted)

ModelVRAMRecommended GPU
Jamba2 Mini48 GBA100 80GB
Jamba 1.5 Large160 GBMulti-A100
Jamba 1.5 Mini24 GBRTX 4090

Comparison to Other Models

FeatureJamba2GPT-4Claude 3
Context256k128k200k
ArchitectureHybridTransformerTransformer
Open SourceYes (1.5/1.6/2)NoNo
MoEYesNoNo

Integration with CompanyGPT

AI21 Jamba can be integrated in CompanyGPT - ideal for enterprises with extensive document collections.

Our Recommendation

AI21 Jamba2 Mini as an open-weights model under Apache 2.0 remains our recommendation for self-hosted document analysis with very long texts. If you regularly work with very long documents (legal contracts, book manuscripts, code reviews) and need full data sovereignty, Jamba2 on your own infrastructure is an excellent choice.

Due to AI21’s strategic shift to the Maestro platform, we advise caution for API-centric new projects: either commit to self-hosting (HuggingFace, AWS Bedrock) or evaluate alternatives. For general cloud-API applications without a specific focus on long contexts, we recommend Google Gemini (up to 1M context) or OpenAI GPT.

Cost estimation for this model

For up-to-date token pricing, model variants and EU availability, see our sister project ai-prices.eu. It helps you compare and estimate the operational cost of leading AI models for your specific use case.

Compare prices on ai-prices.eu

ai-prices.eu is a project by innFactory AI Consulting GmbH and provides transparent cost estimates for leading AI models.

Consultation for this model?

We help you select and integrate the right AI model for your use case.