ABOUT THE SPEAKER:
Jess is a devrel engineer at Braintrust and builds a bunch of educational content around evals and AI. She creates technical content on her own social platforms and hosts a podcast on all things tech. Previously, Jess worked as a software engineer at Microsoft and DoorDash. Outside of work, she enjoys playing pickleball, tennis, flag football, and ultimate frisbee.
TALK TITLE:
TRACK:
SUB TOPIC:
ABSTRACT:
AI evals let you replace gut feelings with quantifiable decisions. This talk breaks the basic concepts of evals, including the four core components: datasets, tasks, scoring, and experiments. Then, to solidify the concept, we’ll walk through a real eval comparing agentic search versus vector search for coding agents. We’ll also cover practical challenges like tracing Claude Code subprocess calls and why a single eval run is never enough. You’ll leave with a concrete framework for building evals that actually inform your ship decisions.
WHAT YOU’LL LEARN:
Business Leaders: C-Level Executives, Project Managers, and Product Owners will get to explore best practices, methodologies, principles, and practices for achieving ROI.
Engineers, Researchers, Data Practitioners: Will get a better understanding of the challenges, solutions, and ideas being offered via breakouts & workshops on Natural Language Processing, Neural Nets, Reinforcement Learning, Generative Adversarial Networks (GANs), Evolution Strategies, AutoML, and more.
Job Seekers: Will have the opportunity to network virtually and meet over 30+ Top Al Companies.
Ignite what is an Ignite Talk?
Ignite is an innovative and fast-paced style used to deliver a concise presentation.
During an Ignite Talk, presenters discuss their research using 20 image-centric slides which automatically advance every 15 seconds.
The result is a fun and engaging five-minute presentation.
You can see all our speakers and full agenda here