News Nug
all agent api update benchmark deployment eval fine tuning inference library monitoring new model open source optimization prompt engineering research tool training workflow
ICML qr code visible [D]
r/MachineLearning · 1d ago · 5
A system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]
r/MachineLearning · 1d ago · 5
Redeploying Fable 5AnnouncementsJun 30, 2026Fable 5 returns globally July 1. We're also proposing an industry-wide framework for scoring jailbreak severity, together with Amazon, Microsoft, Google, and other Glasswing partners.
Anthropic Blog · 1d ago · 5
AIEWF Daily Dispatch: Loops, Software Factories & Forward Deployed Engineers
Latent Space · 1d ago · 5
Anyone looking into the new MARS2 Workshop/Competition @ ECCV 2026? I saw Tec-do posting it. [D]
r/MachineLearning · 2d ago · 5
[AINews] Sonnet 5 today, and Fable 5 tomorrow
Latent Space · 2d ago · 5
[D] Monthly Who's Hiring and Who wants to be Hired?
r/MachineLearning · 2d ago · 5
80TB+ of astronomy for the HDD-poor: crossmatch the Universe from your laptop [R]
r/MachineLearning · 2d ago · 5
REAP: Automatic Curation of Coding Agent Benchmarks from Interactive Production Usage [R]
r/MachineLearning · 2d ago · 5
Forward Deployed Engineers and the future of software engineering
Latent Space · 2d ago · 5
Hugging Face and Cerebras bring Gemma 4 to real-time voice AI
HuggingFace Blog · 2d ago · 5
Quoting Anthropic
Simon Willison · 2d ago · 5
Ahmad Osman on why local AI is catching up
Latent Space · 2d ago · 5
Nano Banana 2 Lite
Simon Willison · 2d ago · 5
What's new in Claude Sonnet 5
Simon Willison · 2d ago · 5
AnnouncementsJun 30, 2026Claude Science, an AI workbench for scientists, is now availableClaude Science is a customizable app that integrates the tools and packages researchers most often use, produces auditable artifacts, and provides flexible access to computing resources.
Anthropic Blog · 2d ago · 5
ProductJun 30, 2026Introducing Claude Sonnet 5Sonnet 5 delivers frontier performance across coding, agents, and professional work at scale.
Anthropic Blog · 2d ago · 5
How to improve a 5-class Diabetic Retinopathy model (APTOS 2019) – Mixed predictions across classes[P]
r/MachineLearning · 2d ago · 5
ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration
HuggingFace Blog · 2d ago · 5
The AI Compass
Simon Willison · 2d ago · 5
<12345…73>