Skip to content
Category

AI

Deep, developer-first coverage of artificial intelligence — from frontier model releases and benchmarks to agents, RAG pipelines, and the AI-native tools changing how we ship software. No hype, just what actually matters to engineers.

Ornith-1.0: Coding Models That Train Their Own Agent Scaffolds
Article 3h ago 0

Ornith-1.0: Coding Models That Train Their Own Agent Scaffolds

By optimizing both the reasoning loop and the code output, these MIT-licensed models bring native agentic capabilities to local hardware.

Priya Nair
Google's Interactions API Shifts the Agent Orchestration Battleground

Google's Interactions API Shifts the Agent Orchestration Battleground

Article · 1w ago0
The Telemetry Trap: Why Employee Surveillance is a Bad Training Strategy

The Telemetry Trap: Why Employee Surveillance is a Bad Training Strategy

Article · 1w ago1
Why Prompt Injection Works: The Role Confusion Theory

Why Prompt Injection Works: The Role Confusion Theory

Article · 1w ago4
Shattering the Scaling Law: Inside Moebius's 0.2B Inpainting Architecture

Shattering the Scaling Law: Inside Moebius's 0.2B Inpainting Architecture

Article · 1w ago1
Running 70B Models on 4GB VRAM: The AirLLM Layer-Swap Hack

Running 70B Models on 4GB VRAM: The AirLLM Layer-Swap Hack

Article · 1w ago1
GLM 5.2 Is a Point Behind Opus — Until the Task Runs for Hours

GLM 5.2 Is a Point Behind Opus — Until the Task Runs for Hours

Article · 1w ago2
DeerFlow 2.0: ByteDance's Sandbox Runtime for Long-Horizon Agents

DeerFlow 2.0: ByteDance's Sandbox Runtime for Long-Horizon Agents

Article · 1w ago0
Google Deprecates Gemini CLI: Inside the Antigravity Agent Shift

Google Deprecates Gemini CLI: Inside the Antigravity Agent Shift

Article · 1w ago0
Apertus: True Open-Source AI for Sovereign Deployments

Apertus: True Open-Source AI for Sovereign Deployments

Article · 1w ago0
Claude Now Wants Your ID — KYC Comes to AI

Claude Now Wants Your ID — KYC Comes to AI

News · 1w ago4
Orchestrating Chaos: Dynamic Multi-Agent Workflows in Claude Code

Orchestrating Chaos: Dynamic Multi-Agent Workflows in Claude Code

Article · 1w ago0
Beyond the Demo: Engineering Reliable, Production-Grade AI Agents

Beyond the Demo: Engineering Reliable, Production-Grade AI Agents

Article · 1w ago0
When to Reject AI Code Even If It Works

When to Reject AI Code Even If It Works

Article · 1w ago0
Gemma 4 12B: The Encoder-Free Shift to Local Multimodal Agents

Gemma 4 12B: The Encoder-Free Shift to Local Multimodal Agents

Article · 1w ago0
Beyond Refusal: The Rise of Agentic AI Penetration Testing

Beyond Refusal: The Rise of Agentic AI Penetration Testing

Article · 1w ago0