News Nug
all agent alignment api update architecture benchmark business cuda dataset deployment eval evaluation fine tuning game dev hardware hype inference infrastructure library monitoring new model open source optimization plugin probe targeted prompt engineering quantization rag research rl training security tool training tutorial workflow
iOS 27 Siri is using WaveRNN and FastSpeech2 [D]
r/MachineLearning · 1d ago · 5
Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech
HuggingFace Blog · 1d ago · 5
AI Epistemic Risks: Emerging Mechanisms & Evidence [R]
r/MachineLearning · 1d ago · 5
Quoting Andrej Karpathy
Simon Willison · 1d ago · 5
What will be the next breakthrough in ASR? [D]
r/MachineLearning · 1d ago · 5
Time Series Forecasting for Agriculture/Crop Volume & Pricing – Looking for Advice [D]
r/MachineLearning · 1d ago · 5
OpenAI S-1 🇺🇸, Siri AI 📱, Xiaomi Ultraspeed ⚡
TLDR AI · 1d ago · 5
Introducing North Mini Code: Cohere’s First Model For Developers
HuggingFace Blog · 1d ago · 5
Fluid, natural voice translation with Gemini 3.5 Live Translate
DeepMind Blog · 1d ago · 5
Rick & Morty
r/LocalLLaMA · 1d ago · 5
Introducing Gemma 4 12B: a unified, encoder-free multimodal model
DeepMind Blog · 1d ago · 5
Powering the future of robotics in Europe
DeepMind Blog · 1d ago · 5
How engineers at Nextdoor use Codex to build without limits
OpenAI Blog · 1d ago · 5
Are privacy-preserving techniques actually being used in production ML systems? [D]
r/MachineLearning · 1d ago · 5
Understanding Pytorch better and Moving forward from papers [D]
r/MachineLearning · 1d ago · 5
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces
HuggingFace Blog · 1d ago · 5
What Codex unlocks for Notion
OpenAI Blog · 1d ago · 5
Papers figures [D]
r/MachineLearning · 1d ago · 5
[AINews] FrontierCode: Benchmarking for Code Quality over Slop
Latent Space · 1d ago · 5
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute.
r/LocalLLaMA · 1d ago · 5
<12345…48>