Skip to main content

Independent AI Research

Asking the questions
no one else is asking.

Weekly articles exploring the frontiers of AI — from AI-assisted development and computational neuroscience to security, developer psychology, and tool reviews.

Featured

Tool Review March 19, 2026 · 8 min read

Khaos SDK: Chaos Engineering Meets AI Agent Security Testing

Khaos SDK applies chaos engineering to AI agents — testing for prompt injection, tool misuse, and fault resilience. Here's what works and what doesn't.

All Articles

Tool Review March 19, 2026 · 8 min read

Khaos SDK: Chaos Engineering Meets AI Agent Security Testing

Khaos SDK applies chaos engineering to AI agents — testing for prompt injection, tool misuse, and fault resilience. Here's what works and what doesn't.

Experiment March 12, 2026 · 8 min read

Autoresearch: When AI Agents Become Overnight Scientists

Karpathy's autoresearch ran 700 experiments in two days. I dissected the loop, tested its limits, and found the 2.8% hit rate nobody's talking about.

Deep Dive March 5, 2026 · 12 min read

EchoLeak: Zero-Click Exfiltration Through Microsoft 365 Copilot

One email turned Microsoft 365 Copilot into a data exfiltration tool — no clicks, no user interaction. The attack bypasses every defense Microsoft built.

Deep Dive February 26, 2026 · 12 min read

The Homogenization Engine: How LLMs Are Shrinking Cognitive Diversity

AI makes every individual more creative. But groups using AI produce less diverse ideas. The cost is collective, invisible, and measurable.

Deep Dive February 19, 2026 · 11 min read

DeepSeek Writes Worse Code When You Mention Tibet or Taiwan

CrowdStrike found that political trigger words increase DeepSeek-R1's vulnerability rate by 50%. The implications go far beyond one model.

Deep Dive February 12, 2026 · 12 min read

Context Windows Are a Lie: How LLMs Actually Use Long Context

Models claim 1M tokens. Research shows they struggle past 32K. Here's what 'lost in the middle' means for developers stuffing codebases into prompts.

Deep Dive February 5, 2026 · 12 min read

Hybrid RNN-Attention: Efficiency Gains Are Real, Revolution Isn't

Hybrid architectures deliver up to 8x inference speedup, but no model has proved the concept at frontier scale. An optimization, not a paradigm break.

Experiment January 29, 2026 · 13 min read

LLM-Generated Passwords Are Far Weaker Than They Look

I generated passwords across seven LLMs — from Gemini 1.5 to GPT-5.4 — and measured their entropy. Centuries to crack? Try hours.

Deep Dive January 22, 2026 · 11 min read

Clinejection: When a GitHub Issue Title Owns Your Pipeline

A GitHub issue title compromised Cline's CI/CD pipeline, stole npm tokens, and pushed malware to 4,000 devs. The first AI supply chain attack.

Deep Dive January 15, 2026 · 12 min read

The Developer's Dopamine Loop: Why AI Autocomplete Is Addictive

AI code suggestions work like slot machines — variable rewards, dopamine hits, and a feedback loop that's reshaping how developers think and learn.

Experiment January 8, 2026 · 9 min read

The Invisible Prompt: Hunting Hidden LLM Instructions on the Web

Microsoft found 50+ hidden AI instructions in commercial web pages. I built a detection pipeline, replicated the attacks, and scanned live sites.

Deep Dive January 1, 2026 · 10 min read

LLMs Hallucinate Packages. Attackers Are Registering Them.

AI coding tools invent package names that don't exist — and 43% of those names appear consistently across sessions. Attackers are registering them.

Quick Byte August 2, 2025 · 4 min read

Agentic Coding Tools: The Top Ten Ranked and Reviewed

A ranked breakdown of ten agentic coding tools — from Continue and Cline to Claude Code and Cursor — scored on autonomy, context, and friction.

Quick Byte July 15, 2025 · 5 min read

Wired Like a Brain: Neuromorphic Hardware's AI Future

Neuromorphic chips mimic biological neurons in silicon, delivering 25-1000x energy savings over GPUs and opening radical new possibilities for AI hardware.

Newsletter

Get Brain Bytes in your inbox

Weekly articles on AI, development, and the questions no one else is asking. No spam.