Sriram Selvam

Senior Software Engineer,

Microsoft

ABOUT THE SPEAKER:

Sriram Selvam is a Senior Software Engineer at Microsoft AI with over 14 years of industry experience, specializing in generative search and the deployment of Large Language Model (LLM) applications across distributed systems. He was a core founding team member behind Bing’s Generative Search framework and continues to build AI solutions that enhance user experiences at scale.

Alongside his engineering role, Sriram is an independent researcher deeply invested in the ethical challenges of AI, particularly long-term privacy, sensitive data memorization, and responsible model behavior. His recent work includes co-developing PANORAMA, a large-scale synthetic dataset of 384,000 samples from realistic human profiles, built to model the distribution and context of Personally Identifiable Information (PII) in online content. This work enables robust model auditing and provides researchers with the open-source tooling needed to evaluate privacy-preserving mitigation strategies. Sriram holds an M.S. in Computer Science from the University of Utah.

TALK TITLE:

Emulating Real-World PII with a Large-Scale Synthetic Dataset to Audit LLM Memorization

TRACK:

Fundamental Research (No Direct Business ROI)

SUB TOPIC:

Safety / Interpretability

ABSTRACT:

To address the critical gap in privacy risk assessment, we introduce PANORAMA (Profile-based Assemblage for Naturalistic Online Representation and Attribute Memorization Analysis). PANORAMA is a large-scale, fully synthetic text corpus containing 384,789 samples derived from 9,674 internally consistent synthetic human profiles. Generated using constrained selection and reasoning LLMs, the dataset spans six distinct online modalities, including social media posts, forum discussions, reviews, and marketplace listings. This session will explore how PANORAMA accurately emulates the naturalistic distribution and variety of sensitive data, enabling researchers to systematically study PII memorization, conduct rigorous model auditing, and benchmark privacy-preserving techniques without exposing real user data.

WHAT YOU’LL LEARN:

TBA

Sriram Selvam

Who Attends

2023 Event Demographics

2023 Technical Background

2023 Attendees & Thought Leadership