Latest: Confessions vs. CoT Monitoring vs. Probes: Three Bets on Model Honesty

Type to search posts and projects to navigate

About

Data platforms. AI systems. The infrastructure between them.

delta at Google Cloud — Data & AI innovation and transformation across 6 countries in Southeast Asia. Publishing research on multi-agent systems, inference optimization, and AI safety. Exploring what's next.

Building
ICLR 2026 paper on LLM safety. Compute primitives for orbital environments.
Exploring
New ventures at the frontier. Conversations with research labs and founders building what's next.
Writing
Notes on activation steering, circuit tracing in production, and bets on model honesty.

Research Focus

Selected Publications

View all →
  • 2026

    Quality-Diversity Evolution for Discovering Diverse Vulnerabilities in LLM Safety

    ICLR 2026 Workshop AIWILD
    PDF
  • 2026

    Field-Theoretic Memory for AI Agents: Continuous Dynamics for Context Preservation

    arXiv
    PDF
  • 2026

    Automated Rule Generation for Tiered Systems Using Multi-Stage Failure Learning

    Google, Technical Disclosure Commons
    PDF
  • 2026

    Predictive Cost-Benefit Routing for Multi-Tier Risk Decisioning Systems

    Google, Technical Disclosure Commons
    PDF

Recent Writing

View all →

Building something interesting? Let's talk.

Subhadip Mitra