Versions

Overview of available model variants

Model	Release	Strengths	Weaknesses	Status
Llama 4 Maverick Recommended	2025	Current flagship Multimodal	Community License	Current
Llama 4 Scout	2025	Efficient Multimodal	—	Current
Llama 3.3	December 2024	Proven	—	Current
Llama 3.2	September 2024	Compact variants	—	Current
Llama 3.1 (405B/70B/8B)	July 2024	Wide size range	—	Current

Technical Details

API, features and capabilities

API & Availability

Availability Public

Latency (TTFT) Depends on hosting

Throughput Depends on hardware Tokens/Sec

Features & Capabilities

Tool Use Function Calling Structured Output Vision File Upload

Training & Knowledge

Knowledge Cutoff 2025-12

Fine-Tuning Available (LoRA, QLoRA, Full Fine-Tuning, PEFT)

Language Support

Best Quality English, German, French, Spanish

Supported 50+ languages

Best quality in English, good quality in Western European languages

Hosting & Compliance

GDPR-compliant hosting options and licensing

GDPR-Compliant Hosting Options

License & Hosting

License Llama 4 Community License (Llama 4), Llama 3.x Community License

Security Filters Customizable

On-Premise Edge-capable

innFactory AI Consulting based in Rosenheim, Germany supports enterprises across the DACH region with GDPR-compliant self-hosting of Meta Llama. With open weights you have full control - no data leaves your infrastructure.

Key Strengths

Open Weights (Community License)

Full Control: Model runs in your infrastructure
No API Costs: Only hardware/cloud costs
Customizable: Fine-tuning on your own data possible
GDPR-Friendly: No data leaves your company

Flexible Deployment Options

On-Premise: Own servers or private cloud
Edge: Local devices, smartphones
Cloud: AWS, Azure, GCP with own instance

Hardware Requirements

Model	VRAM	Recommended GPU
Llama 4 Scout	80+ GB	H100 / A100
Llama 4 Maverick	400+ GB	Multi-H100
Llama 3.3 70B	40+ GB	A100 80GB
Llama 3.2 11B	24 GB	RTX 4090
Llama 3.2 3B	8 GB	RTX 4070
Llama 3.2 1B	4 GB	Smartphone

Integration with CompanyGPT

CompanyGPT supports Llama models and enables completely self-hosted operation without external dependencies.

Our Recommendation

Llama 4 Scout is ideal for companies with strict data protection requirements or high request volumes. Investment in own infrastructure pays off from medium usage volumes. For smaller deployments or edge applications, the compact Llama 3.2 (1B/3B) models are a cost-effective choice.

Meta Llama

Versions

Use Cases

Technical Details

Hosting & Compliance

Key Strengths

Open Weights (Community License)

Flexible Deployment Options

Hardware Requirements

Integration with CompanyGPT

Our Recommendation

Similar Models

Anthropic Claude

GLM-5

AI21 Jamba

Consultation for this model?