Matthew Berman

MYTHOS MYTHOS MYTHOS

Jun 9, 2026 28 min
anthropicclaude 3.5ai agentsllm benchmarkingtransformer architecture
Watch on YouTube Follow Matthew Berman on Rundown — free

Summary

AI summaries can be incomplete or wrong. Verify anything important against the original video.

This video reviews the release of Anthropic's new AI model, Claude Mythos 5, analyzing its performance benchmarks, capabilities, and real-world implications.

This video explores the release of Anthropic's new AI model, Claude Mythos 5 (and its sister model, Fable 5), which are being introduced as a powerful new class of AI. The host provides a deep dive into the technical benchmarks for both models, highlighting their strong performance in coding, reasoning, and real-world task execution. The video also covers the potential, as well as the risks, of such advanced models, touching on topics like safe integration, and the implications of increased inference compute. In addition, the video includes a sponsor segment detailing a web hosting service for AI agents. Throughout the review, the creator emphasizes the capability of the Fable/Mythos model in long-running tasks, as demonstrated through complex simulations, while noting practical usage constraints, like the models' tendency to be verbose and, at times, slow. Overall, the host presents these as state-of-the-art developments in AI, while cautioning viewers to understand their specific use cases to optimize performance and costs.

Verdict

Claude Mythos 5 / Fable 5
large language model · $10 per million input tokens and $50 per million output tokens.

An incredibly capable, state-of-the-art model that is, however, expensive and sometimes slow when pushed to its limits.

Depends

Pros

  • Unmatched agentic reasoning and complex task execution capabilities 3:56
  • Highly autonomous, capable of long-horizon planning 4:08
  • Excellent in complex software development and protein design tasks 15:07

Cons

  • Significantly expensive for large-scale operations 12:30
  • The model is very verbose, increasing output costs
  • Can be slow during long, complex planning tasks

Specs

Model Size 10 trillion parameters 7:04

Compared to

  • GPT-5.5

    Claude Fable 5 generally outperforms or rivals current top-tier models in coding and agentic benchmarks.

Best for

  • Software developers
  • AI researchers
  • Infrastructure providers

Not for

  • Cost-sensitive small-scale projects
  • Real-time, low-latency applications

Key Points

  • 1:40 Breakdown of comparative benchmarks for coding, reasoning, and agentic tasks.
  • 1:53 The difference between Mythos and Fable: Mythos lacks the safety guardrails, whereas Fable includes them.
  • 12:21 Discussion on pricing models, emphasizing the $50/million output token cost for high-difficulty tasks.
  • 25:35 Demonstration of the model's capability in complex tasks like simulating a solar system from first principles.
  • Anthropic releases Claude Mythos 5 and Fable 5, models previously deemed too dangerous for general release.
  • Observation of model behavior, specifically its verbosity and the tendency for long planning phases.

Worth watching if: You are interested in the latest developments in large language models, specifically how Anthropic is approaching model safety, agentic reasoning, and complex, multi-step AI task execution.

Get every Matthew Berman video extracted like this

One daily email with structured extracts of every channel you follow. Free tier covers 15 videos a month.

Sign in with Google

No credit card. Free tier forever.

Watch on YouTube