News Nug

r/LocalLLaMA · 19h ago · 7 · tool tutorial prompt engineering inference

Guide to using BLOOMZ-P3, a multilingual instruction-following model fine-tuned on 46 languages, with practical examples showing how to deploy it via Transformers, vLLM, SGLang, and Docker. Includes prompt engineering best practices for optimizing zero-shot task performance across languages, emphasizing clear prompt structure and contextual framing.

Quoting Kenton Varda

Simon Willison · 2d ago · 6 · workflow prompt engineering

A technical leader shares practical experience with AI-generated commit messages and PR descriptions, finding them ineffective because they describe low-level code details rather than high-level intent. This highlights a real workflow problem when using AI for documentation tasks and the importance of understanding context in code review practices.

Agentic safety triggers aren't textual safety triggers — MCP attacks that beat SOTA guardrails more than half the time (code + dataset) [R]

r/MachineLearning · 2d ago · 8 · research agent fine tuning tool prompt engineering

Research demonstrates a critical gap in LLM safety alignment: current text-classification-based guardrails fail against adversarial prompts that encode attacks in tool-call sequences rather than linguistic markers. The study evaluates multiple safety approaches (DPO, SafeDPO, training-free methods) against CVE-based attacks on MCP-enabled agents, showing current SOTA methods only block ~48% of attacks while training-free approaches achieve 3x baseline refusal rates without fine-tuning.

LLM agents patch security bugs, pass all tests, but still leave the vulnerability open [R]

r/MachineLearning · 39d ago · 9 · benchmark agent research prompt engineering

CVE-Bench is a rigorous benchmark evaluating five frontier models on real-world vulnerability patching across 18 Python projects with 300 runs in sandboxed containers, scored against maintainer-derived test cases. The study identifies three distinct failure modes (wrong-search drift, budget exhaustion, correct-file-wrong-gadget) and confirms statistically significant cross-family performance gaps (OpenAI vs Laguna, p<0.05) while showing within-family differences are noise; locating vulnerabilities without explicit advisories proves the hardest condition, with all models dropping performance.

The solution might be cancelling my AI subscription

Simon Willison · 41d ago · 5 · workflow prompt engineering

A thoughtful essay examining how AI coding agents can paradoxically reduce productivity by making project creation frictionless, leading to abandoned side projects and attention fragmentation. The post explores the psychological challenge of managing AI tools' efficiency and includes discussion of how some ADHD users find agents helpful for focus, presenting a nuanced perspective on workflow management with AI.

Qwen 3.7 🤖, Cursor Composer 2.5 👨‍💻, Anthropic acquires Stainless 🛠️

TLDR AI · 42d ago · 7 · rag prompt engineering tutorial deployment

A technical guide covering RAG (Retrieval-Augmented Generation) implementation patterns, including code snippets, prompt templates, and production anti-patterns for scaling AI-powered search systems. Provides practical patterns and ready-to-use prompt contracts for building reliable RAG applications.

Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code

r/LocalLLaMA · 42d ago · 6 · prompt engineering agent open source

A Java testing library (jqwik) embedded a prompt injection attack designed to sabotage AI coding agents, raising important security considerations for engineers integrating third-party tools into AI-powered workflows. The incident highlights vulnerabilities in how AI agents parse library outputs and the need for defensive practices when using open-source dependencies with autonomous code tools.

Claude Opus 4.8: "a modest but tangible improvement"

Simon Willison · 43d ago · 9 · new model api update inference prompt engineering agent

Claude Opus 4.8 release brings improved honesty/uncertainty flagging (4x reduction in unsupported claims), mid-conversation system messages for better prompt caching in agentic loops, and lower prompt cache minimums (1,024 tokens down from 1,024). Same pricing as 4.7 ($5/$25 per million tokens) with January 2026 knowledge cutoff and 1M context window.

Microsoft Copilot Cowork Exfiltrates Files

Simon Willison · 46d ago · 7 · agent security prompt engineering

Microsoft Copilot Cowork contained a critical security vulnerability where agentic systems could exfiltrate files through unapproved email messages with external image requests and pre-authenticated OneDrive links. This highlights a major design challenge in building safe autonomous agents: preventing prompt injection attacks from enabling data theft while maintaining agent autonomy.

Quoting Armin Ronacher

Simon Willison · 47d ago · 6 · prompt engineering workflow

Armin Ronacher discusses a growing problem in open-source development where AI-generated issue reports obscure actual user observations with confident but often inaccurate interpretations, making debugging harder. The post highlights practical friction when LLMs are used to process and reword user problems without preserving the original observed facts.

Alignment: Higher order prioritizing over constraints [R]

r/MachineLearning · 49d ago · 6 · research prompt engineering

A researcher observes that transformer models' inherent drive to predict accurate tokens ("clarity-seeking") can prioritize semantic coherence over safety constraints when discussing higher-order topics, potentially explaining constraint bypass behaviors. This touches on model alignment and interpretability but lacks technical depth, experimental validation, or concrete mechanisms.

Is personalized AI memory actually a problem worth solving or am I just coping[D]

r/MachineLearning · 49d ago · 6 · rag workflow prompt engineering

Reddit discussion proposing a personalized cognitive profiling system that tracks not just facts but learning patterns, struggling points, and effective explanation styles to improve LLM context retrieval over time. The idea combines dynamic profiling with RAG-like personalization to create an evolving understanding of how individual users think, rather than basic chat memory.

LLMs are just giant probability machines pretending to think [P]

r/MachineLearning · 49d ago · 7 · tutorial prompt engineering

A hands-on explanation of LLM architecture breaking down how token prediction works through embeddings, positional encoding, attention, and the LM Head—using a simple 4-sentence example to illustrate why models predict contextually appropriate tokens. Demystifies transformer mechanics by focusing on the core probability matching problem rather than advanced concepts, making it accessible for engineers learning from first principles.

CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution [R]

r/MachineLearning · 52d ago · 8 · agent prompt engineering research open source

CANTANTE is a novel framework that automates multi-agent LLM system configuration by solving the credit assignment problem, allowing per-agent prompt optimization from global task rewards rather than manual tuning. The approach outperforms DSPy baselines (GEPA, MIPROv2) by 12-19 points on standard benchmarks while maintaining inference costs, with open-source code available.

Do you agree with Judea that learning from data is not everything? [D]

r/MachineLearning · 56d ago · 6 · research prompt engineering

Judea Pearl discusses fundamental mathematical limits of pure data-driven learning, arguing that causal inference cannot be derived from correlation alone and that machine learning's overreliance on tabula rasa and neural network paradigms ignores proven constraints. The post raises important conceptual limitations software engineers should understand when building ML systems, though it's more philosophical framework than actionable technical guidance.

arXiv implements 1-year ban for papers containing incontrovertible evidence of unchecked LLM-generated errors, such as hallucinated references or results. [N]

r/MachineLearning · 57d ago · 6 · workflow prompt engineering

arXiv moderator Thomas Dietterich clarifies the platform's Code of Conduct regarding AI-generated content in academic papers, emphasizing author responsibility for all submitted material regardless of generation method. The post outlines specific penalties (1-year ban + peer-review requirement) for papers with evidence of unchecked LLM outputs, with concrete examples like hallucinated references and meta-comments left in final submissions.

Learning, Fast and Slow: Towards LLMs That Adapt Continually [R]

r/MachineLearning · 59d ago · 8 · research fine tuning prompt engineering workflow

Fast-Slow Training (FST) combines in-context learning via optimized prompts (fast weights) with parameter updates (slow weights) to achieve 3x better sample efficiency than pure RL while reducing catastrophic forgetting and preserving model plasticity. This dual-timescale approach maintains closer alignment to base models while enabling effective continual learning across multiple tasks.

journal-adapt-writing-skill — Learn any journal's writing conventions from its published papers, then revise your manuscript to match — section by section.

GitHub Trending AI · 59d ago · 6 · tool workflow prompt engineering

journal-adapt is a framework for generating dynamic, corpus-grounded academic writing skills that adapt papers to specific journal standards by analyzing target journal papers and exemplars. It combines optional static writing rules with a dynamic layer that learns actual journal patterns, producing an auditable revision framework (dynamic_writing_skill.md) rather than auto-writing papers.

Using LLM in the shebang line of a script

Simon Willison · 60d ago · 7 · tutorial workflow prompt engineering tool

Simon Willison demonstrates practical patterns for executing LLM-generated code directly from shell scripts using shebang syntax, including examples with tool calls and YAML-defined functions. The post covers workflow techniques for integrating LLM outputs into command-line workflows and debugging with options like --td for tool inspection.

Passing Multidimensional time series to VLM [R]

r/MachineLearning · 60d ago · 6 · workflow prompt engineering

A developer seeks guidance on optimal methods for inputting multidimensional time series data alongside video to VLMs, noting that common approaches (text formatting and line chart visualization) underperformed on their task. This represents a practical workflow challenge in multimodal AI engineering with potential solutions in data representation and prompt engineering.