The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got ...
Researchers have uncovered a supply-chain attack that hides in Python packages, propagates like a worm, and tricks LLM-based ...
Hackers compromised 19 packages on the PyPI, collectively downloaded hundreds of thousands of times, in a new Shai-Hulud ...
Dozens of cryptographically verified open source packages from Microsoft were compromised late last week to add advanced credential-stealing code that was triggered when developers opened them in AI ...
CISA added CVE-2026-42271, a high-severity LiteLLM command injection flaw, to its KEV catalog after evidence of active ...
Users probe backup failures find Claude-assisted commits. Veteran engineer retorts: 'I did not just vibe-code 'convert test ...
Python scripts were used to test malware against endpoint detection and response agents from Sophos, CrowdStrike, and Windows ...
A threat actor is using an AI-built ransomware attack toolkit that automates Active Directory discovery and helps evade ...
A threat actor has been observed using AI coding tools to develop and refine malware designed to slip past endpoint detection ...
DeepSWE, created by DataCurve offers a benchmark for assessing AI coding models by focusing on real-world programming challenges rather than synthetic test cases. According to Matthew Berman, one of ...
Researchers who found the bug warn that its Moderate rating understates a threat reaching across LLM gateways, MCP servers and agent infrastructure.