Shasvat Desai

Staff Machine Learning Scientist,

Walmart Global Tech

ABOUT THE SPEAKER:

I am currently a Machine Learning Scientist in the Sponsored Products Search team at Walmart that is responsible for powering the advertising technlogy for Walmart’s e-commerce platform. My work spans the domain of semantic query and item understanding, retrieval (traditional IR and neural networks), ranking, and ad auction and monetization. Apart from product dev, I work on applied research. Recently, I got a paper accepted at SIGIR 2026, Industry track: https://arxiv.org/pdf/2604.07930

Prior to that, I was a computer vision scientist at Walmart’s Intelligent Retail Lab (IRL). My work in the retail space focused on scene understanding and fine-grained image retrieval, where I developed solutions that significantly reduce shrinkage at self-checkout systems (SCO), leading to millions of dollars in recovered revenue. These systems utilize multiple sensor inputs, including computer vision cameras, weight sensors, RFID, hand scanners, and barcode readers, to streamline and enhance the checkout process. I was recognized with the prestigious Impact Award for driving extraordinary contributions by proactively identifying critical improvement areas, implementing innovative solutions, and delivering exceptional business results.

Prior to joining Walmart, I worked at Orbital Insight where I worked on development of multi-class object detectors to identify ships, aircraft, and armored vehicles from satellite imagery, supporting strategic intelligence initiatives. My experience also extends to environmental monitoring, where I worked on land use change detection algorithms. My research in geospatial computer vision includes authoring a paper on “Active Learning for Improved Semi-Supervised Semantic Segmentation in Satellite Images,” accepted at WACV 2022.

Earlier, I pursued my master’s research at UMass Amherst, working with Professor Madalina Fiterau. During my time at UMass Amherst, I co-authored a paper titled “Pedestrian Detection in Thermal Images Using Saliency Maps,” published in the CVPR 2019 workshop.

TALK TITLE:

INSPIRE: Intent-aware Neural Sponsored Product Retrieval for E-commerce

TRACK:

Technical / Engineering Talks

SUB TOPIC:

Fine-Tuning & Training – Safety / Governance / Auditability

ABSTRACT:

Walmart holds the largest share of the U.S. e-commerce grocery market, where food and beverage categories generate some of the highest search traffic and, consequently, drive a substantial portion of sponsored search revenue. At this scale, even small mismatches between user intent and retrieved products can lead to significant losses in both user engagement and monetization. Yet, understanding user intent in grocery search is inherently challenging. Queries are typically short, ambiguous, and highly diverse, often underspecifying critical preferences.

For example, a query like schar white bread implicitly encodes a gluten-free preference through brand association, while queries such as chickpea pasta or oatmilk reflect underlying dietary preferences like gluten-free, plant based, or lactose-free alternatives. Failing to capture these signals results in retrieving products that might be semantically similar but misaligned with the user’s true needs.

From the advertiser’s perspective, many products are explicitly designed to target specific intents—such as dietary preferences or size variants—and must be surfaced at the right moment to be effective. For example, a brand like Quest Nutrition, which sells high-protein, low-sugar snacks, wants its products to appear for queries like protein bars, low carb snacks, or keto snacks, even when these attributes might not be explicitly stated in the product title text. When retrieval systems fail to capture these intent signals, relevant products are not shown to the right users at the right time. From an advertiser’s perspective, this means their products are missing high-intent opportunities where conversion is most likely. Over time, this leads to lower returns on ad spend, reduced trust in the platform, and potential advertiser attrition. Losing advertisers directly translates to a loss in advertising revenue and weakens the overall sponsored search ecosystem. This challenge is further amplified in sponsored search, where only a limited number of ad slots are available, making precise relevance essential. Thus, we propose INSPIRE (Intent-aware Neural Sponsored Product Retrieval for E-commerce), an intent aware retrieval framework for sponsored search that leverages structured intent signals to better align user queries with relevant food and beverage products. INSPIRE represents intent as a set of structured, multi-dimensional attributes derived from both user queries and product content, capturing explicit signals (e.g., brand, flavor) as well as implicit preferences (e.g., dietary constraints, cuisine types) that are often not directly expressed in queries.

We develop a weakly supervised intent learning pipeline, where a large language model serves as a teacher to generate structured intent annotations from product titles and descriptions. We then distill these annotations by using them to finetune a lightweight student LLM model through LoRA based supervised finetuning (LoRA-SFT) that predicts intent attributes—such as brand, flavor, dietary preference, ingredient, product subtype, and cuisine type—at Walmart catalog scale. We then introduce an intent-augmented dense retrieval framework, where predicted intents are incorporated into query and product representations within a bi-encoder, enabling more precise matching between queries and sponsored products. To support real-world usage, we deploy the system as a scalable inference service. The distilled student model is served via a high-throughput API powered by vLLM, enabling efficient intent prediction over large product catalogs with low latency. This design ensures that
intent-aware retrieval can be applied in production settings while maintaining efficiency and scalability.

WHAT YOU’LL LEARN:

Treat retrieval failures as an intent problem, not only a semantic matching problem. In domains like grocery, many bad results are lexically similar but violate hidden constraints such as dietary needs, brand-linked expectations, or cuisine type.
Use LLMs first as labelers, not necessarily as serving-time models. A practical pattern is to let larger teacher models generate structured intent annotations, then distill that knowledge into a much smaller deployable model.
Add a consensus stage before training on LLM outputs. Cross-model agreement helps filter noisy or hallucinated labels and makes weak supervision much more usable in downstream retrieval.
Parameter-efficient fine-tuning is a strong fit for this setting. LoRA-based SFT gives a practical way to adapt a compact model for domain-specific intent extraction without the cost of full fine-tuning.
Inject structured intents directly into retrieval representations. Appending predicted intent attributes to both query and item text is a simple but effective way to improve alignment in a bi-encoder setup.
Expect the biggest gains on implicit-intent queries. The approach is especially useful when user needs are underspecified and item titles do not explicitly state the attributes that matter.
Plan for operational tradeoffs early. To make this usable in production, precompute item intents offline, cache frequent query intents, and reserve online inference for cases where caching is not enough.

Shasvat Desai

Who Attends

2023 Event Demographics

2023 Technical Background

2023 Attendees & Thought Leadership