Helena Yu
Strategic Lead,
Sprout Cilmate

ABOUT THE SPEAKER:

Helena founded Sprout (formerly Climate Resilient Communities), empowering equity-deserving communities with data and climate literacy tools and programs, and has led global data science initiatives with Statistics Without Borders on women’s rights advocacy and human trafficking intervention. Her work bridges technical rigor with social impact, driving AI adoption that serves both business outcomes and the greater good.

She is also Head of Data & AI at EnStream, building data-centric and AI-powered digital trust infrastructure in Canada. With experience spanning financial services, telecommunications, healthcare, and the public sector, she has led cross-functional teams in developing enterprise-grade AI/ML solutions—from RAG systems to fraud detection and personalized recommendation engines.

TALK TITLE:

Lean by Design: How a Three-Person Nonprofit AI Team Shipped a Production-Grade Multilingual RAG and Conversational System

TRACK:

Technical / Engineering Talks

SUB TOPIC:

Data Engineering / Rag Pipelines – Search / Recommendation Systems

ABSTRACT:

Sprout is a Toronto nonprofit operating as an AI- and data-first organization that works alongside urban Canadian communities and grassroots organizations, providing data tools, local insights, and storytelling support that reflects community resilience building work already happening on the ground. With a core team of three, we built and run the Multilingual Climate Chatbot (MLCC) — a production RAG system serving 200+ languages to communities that don’t speak English or French as their first language across Toronto. We did it on a near-zero budget, under real infrastructure constraints, with no option to optimize later. And we did it without compromising on our values: every model and infrastructure choice was evaluated not just on cost and quality, but on environmental footprint. This talk covers four things: team design as architecture, responsible model selection under constraint, what broke in production, and a retrieval architecture built from failure. Every design decision traces back to a constraint the team couldn’t ignore: budget, latency, language quality, team size, or environmental responsibility. Infrastructure cost: $26/month. Languages served: 200+. Team size: 3.

WHAT YOU’LL LEARN:

  1. A team ownership model with specific role boundaries and deployment authority decisions that eliminates coordination overhead.
  2. A model selection framework from a real multi-model, multi-language bakeoff showing how to define “good enough” when cost, latency, and environmental responsibility are simultaneous constraints.
  3. A retrieval architecture pattern (open source, adaptable in an afternoon) using adaptive quality gating, diversification, and backfill logic, built directly from named production failures.

Who Attends

Attendees
0 +
Data Practitioners
0 %
Researchers/Academics
0 %
Business Leaders
0 %

2023 Event Demographics

Technical practitioners working directly with ML/AI systems
0 %
Currently Working in Industry*
0 %
Attendees Looking for Solutions
0 %
Currently Hiring
0 %
Attendees Actively Job-Searching
0 %

2023 Technical Background

Expert/Researcher
14%
Advanced
37%
Intermediate
28%
Beginner
7%

2023 Attendees & Thought Leadership

Attendees
0 +
Speakers
0 +
Company Sponsors
0 +

Business Leaders: C-Level Executives, Project Managers, and Product Owners will get to explore best practices, methodologies, principles, and practices for achieving ROI.

Engineers, Researchers, Data Practitioners: Will get a better understanding of the challenges, solutions, and ideas being offered via breakouts & workshops on Natural Language Processing, Neural Nets, Reinforcement Learning, Generative Adversarial Networks (GANs), Evolution Strategies, AutoML, and more.

Job Seekers: Will have the opportunity to network virtually and meet over 30+ Top Al Companies.

Ignite what is an Ignite Talk?

Ignite is an innovative and fast-paced style used to deliver a concise presentation.

During an Ignite Talk, presenters discuss their research using 20 image-centric slides which automatically advance every 15 seconds.

The result is a fun and engaging five-minute presentation.

You can see all our speakers and full agenda here

Get our official conference app
For Blackberry or Windows Phone, Click here
For feature details, visit Whova