blog - page 4 | Subhadip Mitra

All Articles

Making LLMs Faster: My Deep Dive into Speculative Decoding

A deep dive into implementing speculative decoding from scratch, with benchmarks on GPT-2 and extensions to diffusion models.

13 min read · March 20, 2025

2025 machine-learning llm inference optimization AI Infrastructure Research
Engineering Autonomous Multi-Agent Systems - A Technical Deep Dive into Telecom Customer Service

Dive into the world of autonomous AI agents with practical implementations, code examples, and real-world scenarios. Learn how to build intelligent systems with advanced memory management, dynamic prompt evolution, and sophisticated monitoring capabilities in telecom customer service.

52 min read · January 05, 2025

2025 genai architecture casestudy system-design architecture genai casetudy system-design
Engineering Multi-Agent Systems - A Retail Banking Case Study

Explore a detailed technical implementation of a multi-agent system for retail banking credit assessment. Learn about agent architecture, distributed systems patterns, error handling, compliance requirements, and performance optimization through actual code examples and system diagrams. Ideal for software architects and engineers building scalable financial systems.

36 min read · December 28, 2024

2024 architecture casestudy architecture casetudy
ETLC 2.0 - Building Context-Aware Data Pipelines

Think your data pipelines could do more than just process information? ETLC 2.0 takes data engineering to the next level with Adaptive Context, Contextual Joins, and a scalable Context Store. It's not just about moving data—it's about making it intelligent. Ready to unlock the future of data pipelines? Read on.

10 min read · December 07, 2024

2024 platform genai etlc platform genai
The End of Data Warehouses? Enter the Age of Dynamic Context Engines

Traditional data warehouses are struggling to keep up with modern demands. Enter Dynamic Context Engines (DCEs) - real-time, path-aware platforms that enrich data with context for smarter, faster decisions. Discover why they're the future of data analytics.

19 min read · November 18, 2024

2024 platform genai etlc platform genai

binary breakthroughs

Notes on building data platforms, AI systems, and the infrastructure between them

I Trained Probes to Catch AI Models Sandbagging

Why Steering Vectors Beat Prompting (And When They Don't)

Why I Built a Spark-Native LLM Evaluation Framework (And What I Learned)

The MCP Maturity Model: Evaluating Your Multi-Agent Context Strategy

From 11% to 88% Peak Bandwidth: Writing Custom Triton Kernels for LLM Inference

Making LLMs Faster: My Deep Dive into Speculative Decoding

All Articles

Making LLMs Faster: My Deep Dive into Speculative Decoding

Engineering Autonomous Multi-Agent Systems - A Technical Deep Dive into Telecom Customer Service

Engineering Multi-Agent Systems - A Retail Banking Case Study

ETLC 2.0 - Building Context-Aware Data Pipelines

The End of Data Warehouses? Enter the Age of Dynamic Context Engines