I鈥檝e always been torn between the elegant certainty of mathematics and the beautiful complexity of biology, so I decided to master both. My obsession with healthcare data science began at IIT Delhi, where I spent my Master鈥檚 thesis translating the abstract constructs of evolution into mathematical models. That research spark eventually took me to Harvard University, building computational drug discovery frameworks for DARPA-funded projects, followed by a summer at Vertex Pharmaceuticals while finishing Data Science Degree at Columbia University. Today, I鈥檓 a Senior Data Scientist on the Experimental AI team at AbbVie, where I use complex statistical theory and AI to build enterprise-ready solutions that aid decision-making for clinical leadership. Whether it鈥檚 decoding disease at the discovery stage or building audit-ready AI solutions, I live for the moments where rigorous algorithms meet clinical innovation.

Experience across Drug Lifecycle

Industry Experience

AbbVie Inc (2023 - Present)

Anomaly Detection

Site Anomaly & Fraud Detection

Engineered GLMM-based unsupervised models to identify atypical site behavior and fraud. Deployed portfolio-wide across global trials to ensure high-fidelity data oversight.

Mixed Effects Temporal Data Hypothesis Testing ePRO Statistical Theory
Professional Patient Detection

Professional Patient Detection

Developed novel algorithms to identify duplicate or "professional" trial participants, safeguarding data integrity across international clinical sites.

Patient Embeddings Vector Similarity Statistical Testing EHR
Medical Monitoring

Medical Monitoring

Engineered ensemble models for unsupervised patient monitoring, reducing clinical review and medical escalation times by 90%.

Anomaly Detection Isolation Forests SHAP EHR

Wyss Institute, Harvard University (2019 - 2022)

AI Drug Discovery

AI Drug Discovery

Built AI-driven drug discovery framework using Markov Random Fields and multiomics data to model protein鈥搕arget interactions for DARPA-funded therapeutic projects.

Network Theory Multiomics Data Gene Expression Belief Propagation

Portfolio Projects

Something fun I've been working on

KDrama Sense

KDrama Sense

Agentic taste and theme-aware K-Drama assistant built by combining a custom enrichment pipeline and a RAG-based personalised assessment engine. Extracted and synthesised structured metadata, specialist reviews, and community discussions from TMDB, MyDramaList, and Reddit r/KDRAMA into 1,500 enriched drama profiles with LLM-derived thematic labels.

Generative AI LangChain LangGraph BERT Topic Modeling ChromaDB RAGAS TMDB API Streamlit
Trial Guard

Trial Guard

Real-time adverse event prediction system for enrolled clinical trial patients using a temporal transformer trained on 13,000+ ALS patient trajectories from the PRO-ACT database. Implements timestamp-based positional encoding and an agentic pipeline integrating SHAP-based feature attribution and CTCAE v5.0 grading criteria to generate grounded clinical alerts at each patient visit.

Agentic AI Transformers PyTorch LangGraph SHAP CTCAE v5.0 Streamlit PRO-ACT

Awards

Young Data Scientist of The Year 路 2026 路 Pistoia Alliance Recognized for driving AI innovation and impact in life sciences R&D through Advanced Anomaly Detection in Clinical Trials

Features

Research & Writing