Teaching Models to Decide When to Retrieve: Adaptive RAG, Part 4

Sumit Kumar published on 2025-10-05 included in category Information Retrieval and series Adaptive RAG (Selective Retrieval)

This final post of the Adaptive RAG series explores methods that treat adaptive retrieval as a learned skill and explicitly teach models when to retrieve. We examine three paradigms in increasing order of sophistication.

Probing LLMs' Knowledge Boundary: Adaptive RAG, Part 3

Sumit Kumar published on 2025-09-27 included in category Information Retrieval and series Adaptive RAG (Selective Retrieval)

This post introduces techniques that probe the LLM’s internal confidence and knowledge boundaries. We explore prompt-based confidence detection, consistency-based uncertainty estimation, and internal state analysis approaches to determine when retrieval is truly necessary.

Deciding When Not to Retrieve: Adaptive RAG, Part 2

Sumit Kumar published on 2025-09-21 included in category Information Retrieval and series Adaptive RAG (Selective Retrieval)

Building on part 1’s exploration of naive RAG’s limitations, this post introduces adaptive retrieval frameworks and pre-generation retrieval decision-making methods that determine if retrieval is truly necessary.

The Hidden Costs of Naive Retrieval: Adaptive RAG, Part 1

Sumit Kumar published on 2025-09-01 included in category Information Retrieval and series Adaptive RAG (Selective Retrieval)

Retrieval-Augmented Generation (RAG) isn’t a silver bullet. This post highlights the hidden costs associated with RAG and makes the case for a smarter, adaptive approach.

Embedding Collapse in Recommender Systems: Causes, Consequences, and Solutions

Sumit Kumar published on 2024-11-06 included in category Recommender Systems

Learned embeddings often suffer from ’embedding collapse’, where they occupy only a small subspace of the available dimensions. This article explores the causes of embedding collapse, from two-tower models to GNN-based systems, and its impact on model scalability and recommendation quality. We discuss methods to detect collapse and examine recent solutions proposed by research teams at Visa, Facebook AI, and Tencent Ads to address this challenge.

Incorporating Ads into Large Language Models Outputs

Sumit Kumar published on 2024-08-11 included in category Information Retrieval

This article provides an introduction to online advertising systems and explores research work that incorporates ads into the LLM responses to user queries of commercial nature.