Latest: Confessions vs. CoT Monitoring vs. Probes: Three Bets on Model Honesty

Type to search posts and projects to navigate

Subhadip Mitra
Output 18 papers, 8 packages
Latest ICLR 2026
Now New ventures
Subhadip Mitra
Singapore, 2026
About

Data platforms. AI systems. The infrastructure between them.

delta at Google Cloud — leading Data & AI innovation and transformation across Southeast Asia. Publishing research on multi-agent systems, inference optimization, and AI safety. Exploring what's next.

Building
ICLR 2026 paper on LLM safety. Compute primitives for orbital environments.
Exploring
New ventures at the frontier. Conversations with research labs and founders building what's next.
Writing
Notes on activation steering, circuit tracing in production, and bets on model honesty.

Research Focus

Selected Publications

View all →
  • 2026

    Spark-LLM-Eval: A Distributed Framework for Statistically Rigorous Large Language Model Evaluation

    arXiv
    PDF
  • 2026

    Quality-Diversity Evolution for Discovering Diverse Vulnerabilities in LLM Safety

    ICLR 2026 Workshop AIWILD
    PDF
  • 2026

    Field-Theoretic Memory for AI Agents: Continuous Dynamics for Context Preservation

    arXiv
    PDF
  • 2026

    Automated Rule Generation for Tiered Systems Using Multi-Stage Failure Learning

    Google, Technical Disclosure Commons
    PDF

Recent Writing

View all →

Building something interesting? Let's talk.

Subhadip Mitra