LLM News Digest

Tag: prompt-engineering

Many-Shot CoT-ICL: Making In-Context Learning Truly Learn
Accessible

Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

Tsz Ting Chung, Lemao Liu, Mo Yu, Dit-Yan Yeung

This overturns conventional wisdom about many-shot in-context learning for reasoning tasks. While more examples help with simple tasks, reasoning tasks show unstable scaling behavior, and semantic similarity-based retrieval actually hurts performance. The order of examples matters more than previously thought. This has immediate implications for how you structure prompts and manage context in reasoning-heavy production systems.

Takeaways
  • Many-shot scaling rules for non-reasoning tasks don't apply to reasoning tasks and can degrade performance.
  • Semantic similarity poorly predicts procedural compatibility in chain-of-thought reasoning.
  • Example ordering significantly impacts performance and requires careful consideration in production prompt design.
from May 18, 2026 · via api-hf · arXiv:2605.13511
Introducing the OpenAI Safety Bug Bounty program
Intermediate

Introducing the OpenAI Safety Bug Bounty program

OpenAI's new bug bounty program specifically targets AI safety issues including prompt injection, agentic vulnerabilities, and data exfiltration — signaling that these attack vectors are now mainstream security concerns. For production teams, this validates that AI-specific security testing should be part of standard security practices, not an afterthought.

Takeaways
  • AI-specific vulnerabilities like prompt injection and agentic exploits are now recognized as legitimate security concerns requiring dedicated testing.
  • Production AI systems need security models that account for both traditional software vulnerabilities and novel AI attack vectors.
from Mar 29, 2026 · via rss-openai