blog - page 3 | Subhadip Mitra

All Articles

The Observer Effect in AI: When Models Know They're Being Tested - (Part 1/4)

Frontier AI models from OpenAI, Anthropic, and Google can now recognize when they're being tested. This observer effect undermines AI safety evaluation.

11 min read · September 30, 2025

2025 situational-awareness AI-evaluation frontier-models claude openai AI Safety Machine Learning Research
We Need a Consent Layer for AI (And I'm Trying to Build One)

AI companies are getting sued over training data, agents operate with no permission framework, and users can't control their AI profiles. I wrote four open standards (LLMConsent) to create a decentralized consent protocol for AI - like HTTP but for data rights, agent permissions, and user sovereignty. This is an RFC, not a product.

23 min read · August 16, 2025

2025 ai blockchain standards consent ethics protocol llm agents privacy decentralization ai standards ethics architecture
Why Kimi K2 Stands Out - A Deep Dive into Its Trillion-Parameter MoE

Explore Kimi K2’s trillion-parameter MoE architecture, MuonClip optimizer, and agentic training. Learn why it outperforms GPT-4.1 and DeepSeek-V3

9 min read · July 13, 2025

2025 genai llm llm genai
From 11% to 88% Peak Bandwidth: Writing Custom Triton Kernels for LLM Inference

A hands-on exploration of writing custom GPU kernels with OpenAI Triton, going from PyTorch's 11% bandwidth utilization to 88% on RMSNorm.

12 min read · June 15, 2025

2025 machine-learning llm inference optimization triton gpu AI Infrastructure Research
Implementing Model Context Protocol in Autonomous Multi-Agent Systems - Technical Architecture and Performance Optimization

Discover how to implement Model Context Protocol (MCP) in autonomous multi-agent systems with this technical deep dive. Learn advanced context optimization strategies, distributed architecture patterns, and performance benchmarks with complete Python implementations. Includes hypothetical telecom implementation scenarios showing potential optimization benefits.

60 min read · March 22, 2025

2025 genai architecture system-design architecture genai system-design

binary breakthroughs

Notes on building data platforms, AI systems, and the infrastructure between them

I Trained Probes to Catch AI Models Sandbagging

Why Steering Vectors Beat Prompting (And When They Don't)

Why I Built a Spark-Native LLM Evaluation Framework (And What I Learned)

The MCP Maturity Model: Evaluating Your Multi-Agent Context Strategy

From 11% to 88% Peak Bandwidth: Writing Custom Triton Kernels for LLM Inference

Making LLMs Faster: My Deep Dive into Speculative Decoding

All Articles

The Observer Effect in AI: When Models Know They're Being Tested - (Part 1/4)

We Need a Consent Layer for AI (And I'm Trying to Build One)

Why Kimi K2 Stands Out - A Deep Dive into Its Trillion-Parameter MoE

From 11% to 88% Peak Bandwidth: Writing Custom Triton Kernels for LLM Inference

Implementing Model Context Protocol in Autonomous Multi-Agent Systems - Technical Architecture and Performance Optimization