Gaurav Gada

Hi, I'm Gaurav — an Applied Scientist with 8+ years building NLP and LLM systems at scale. Right now I'm working on ad relevance: benchmarking frontier LLMs, distilling calibrated judgments into low-latency production models, and wiring up multi-agent orchestrators that compress multi-day scientific deep-dives into a couple of hours of supervised analysis. Before that I was a founding scientist on a content moderation and AI safety team, where I grew the science org from 1 to 5 and shipped hate speech detection, social engineering detection, and Responsible AI red-teaming workstreams for products used by hundreds of millions of people. I've filed two patents with the USPTO along the way. These days I spend most of my time thinking about LLM evaluation, human-in-the-loop systems, and agentic patterns — and the messy practical tradeoffs (latency, cost, hallucinations, context bloat) that decide whether any of it actually ships. Outside of work I'm a part-time musician, runner, and recovering electrical engineer. Thanks for dropping by — drop a note if you want to talk shop. Apart from deep learning (pun intended), I'm always up for a deep conversation with practitioners in the field.

Posts

Understanding Attention: A Code-First Journey Through Transformers

March 12, 2026

last month

Build attention mechanisms from scratch in PyTorch. We'll start with raw tensors and progressively build to multi-head attention, explaining every reshape, transpose, and dimension along the way.

Read Post

The 10% You Should Never Automate

November 8, 2025

5 months ago

Everyone's asking what AI can do. The better question is what you shouldn't let it do. Frameworks for deciding what to automate and what to protect.

Read Post

When Should You Build an AI Agent? A Practical Decision Framework

November 5, 2025

5 months ago

Practical framework to determine when AI agents make sense for your use case. Learn when to build agents and when simpler approaches like prompt engineering or RAG work better.

Read Post

Mistral 7B on consumer hardware

July 21, 2024

last year

Run Mistral 7B locally on Mac with Ollama for fast seed data generation. Learn CLI setup, prompt formatting, and downstream parsing to generate thousands of samples on consumer hardware.

Read Post

Finding the right words

July 6, 2024

last year

Understand how LLMs choose words during generation. Learn temperature, top-k, and top-p sampling strategies to balance coherence, diversity, and task-appropriateness in generated text.

Read Post

Paper Review - Embers of Autoregression

June 29, 2024

last year

Critical review of LLM limitations in low-probability situations. Explores why AI practitioners should understand autoregressive training pressures before deploying LLMs for tasks requiring precise reasoning or uncommon patterns.

Read Post

Multi-label text classification

February 16, 2024

2 years ago

Learn to build a multi-label text classifier using DistilBERT with imbalanced classes. Covers binary cross-entropy loss, multi-hot encoding, and practical implementation strategies for handling multiple labels.

Read Post

Library version mismatches declared not safe

February 2, 2024

2 years ago

Critical lessons on matching Python package versions between model development and inference. Learn about safetensors format advantages and why version mismatches cause production failures.

Read Post

Mining word collocations

February 1, 2024

2 years ago

Extract common bigrams and trigrams from text using Gensim and NPMI scoring. Learn to mine jargon, phrases, and collocations from customer reviews, feedback, and text corpora.

Read Post

Science Talk: Generative LLMs

September 1, 2023

2 years ago

Comprehensive introduction to generative LLMs covering basics, training processes, and real-world applications. Slides from talk delivered to 70+ attendees.

Read Post

Projects

Skill Quality Coach

July 20, 2022

3 years ago

Amazon Alexa announced Skill Quality Coach (SQC), a personalized guide to help skill developers build high-quality skills on Alexa

View Project

Data Science: Analyzing crime stats in Seattle and San Francisco

April 26, 2017

8 years ago

Analysis of criminal activity periodicity, geospatial distribution by district in R.

View Project

Posts

Understanding Attention: A Code-First Journey Through Transformers

The 10% You Should Never Automate

When Should You Build an AI Agent? A Practical Decision Framework

Mistral 7B on consumer hardware

Finding the right words

Paper Review - Embers of Autoregression

Multi-label text classification

Library version mismatches declared not safe

Mining word collocations

Science Talk: Generative LLMs

Projects

Skill Quality Coach

Data Science: Analyzing crime stats in Seattle and San Francisco

Subscribe