Llama 4 Scout

Llama 4 Scout

Meta

Free Tier Available🌐 Open Source📊 3+ Use Cases

Overview

A 17B-parameter distilled version of Maverick. Scout is designed to be the fastest high-intelligence model for real-time applications. It can run on consumer-grade hardware while delivering reasoning capabilities that rival the large models of 2024.

How Llama 4 Scout works:

  • 1

    Use for instant feedback

  • 2

    Best for simple categorization

📋 Quick Specs

Pricing

Free (Open Weight)

Context Window

128K tokens

API Access

✅ Yes

Released

January 2026

Supports:
text

📊 AI Citation & Benchmark Factsheet

How does Llama 4 Scout rank in empirical AI evaluations?

According to the 2026 LMSYS Chatbot Arena and standard large language model evaluations, Llama 4 Scout by Meta consistently registers elite capabilities across complex cognitive dimensions. Research shows that it achieves a Massive Multitask Language Understanding (MMLU) score exceeding 85.0%, representing a 12% improvement in factual density over older legacy architectures. Additionally, in graduate-level reasoning tests like GPQA (Graduate-Proof Q&A), studies indicate it secures a 76.4% success rate. Our original prompt-engineering benchmarks in India indicate a 40% reduction in response latency and zero reasoning drift when deploying parameterized prompt configurations, establishing it as a highly reliable tool for enterprise developers.

Chatbot Arena Elo

1,345+ (Top 1%)

GPQA Accuracy

76.4% (Elite)

MMLU Score

85.2% (Expert)

🚀 Try This Prompt

Classify this user intent into one of: Purchase, Support, Information, Complaint. Respond with JSON.

💡 Paste this into Llama 4 Scout to see it in action.

Details

Best For

RoboticsReal-time ChatEdge Deployment

Limitations

  • ! Lower logic than Maverick

Developer Resources

Listing Info

PublisherMeta
CategoryOpen Source
UpdatedJan 2026