NewsNewestAskShowJobs Open on GitHub

Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult

(simonw.substack.com)

4 points | by hackthegibson2 5 hours ago

1 comments

  • ChrisArchitect 3 hours ago
    Discussion: https://news.ycombinator.com/item?id=46037637