Reproducibility & Data Version Control
for LangChain & LLM/OpenAI Models

FREE Virtual Workshop
Nov. 29th,1PM EST

Proudly Sponsored By

Event Speakers

Event Conference Chair

Click Speaker to Read Abstract

Keynote

Advanced Technical/Research

Business Strategy

Case Study

Panel Discussion

In-Person Workshops

Virtual Workshops & Talks

Lightning Talks

More speakers to be announced

Agenda

This agenda is still subject to changes

Talk Title: Advances in Algorithmic Recourse: Ensuring Causal Consistency, Fairness, & Robustness

Presenter:
Amir Hossein Karimi, Assistant Professor, University of Waterloo

About the Speaker:
Dr. Amir-Hossein Karimi is an Assistant Professor in the Electrical & Computer Engineering department at the University of Waterloo where he leads the Collaborative Human-AI Reasoning Machines (CHARM) Lab. The lab’s mission is to advance the state of the art in artificial intelligence and chart the path for trustworthy human-AI symbiosis. In particular, the group is interested in the development of systems that can recover from or amend poor experiences caused by AI decisions, assay the safety, factuality, and ethics of AI systems to foster trust in AI, and effectively combine human and machine abilities in various domains such as healthcare and education. As such, the lab’s research explores the intriguing intersection of causal inference, explainable AI, and program synthesis, among others.

Amir-Hossein’s research contributions have been showcased at esteemed AI and ML-related platforms like NeurIPS, ICML, AAAI, AISTATS, ACM-FAccT, and ACM-AIES, via spotlight and oral presentations, as well as through a book chapter and a highly regarded survey paper in the ACM Computing Surveys. Before joining the University of Waterloo, Amir-Hossein gained extensive industry experience at Meta, Google Brain, and DeepMind and offered AI consulting services worth over $250,000 to numerous startups and incubators. His academic and non-academic endeavours have been honoured with awards like the Spirit of Engineering Science Award (UofToronto, 2015), the Alumni Gold Medal Award (UWaterloo, 2018), the NSERC Canada Graduate Scholarship (2018), the Google PhD Fellowship (2021), and the ETH Zurich Medal (2024).
Talk Track: Research or Advanced Technical

Talk Technical Level: 5/7

Talk Abstract:
Explore the intersection of causal inference and explainable AI applied for fair and robust algorithmic recourse in AI applications across healthcare, insurance, and banking. This session highlights the role of causal consistency in correcting biases and ensuring transparent model decisions.

What You’ll Learn:
– Foundations of Causal Inference: Understand the basics and importance of causal reasoning in AI.
– Integrating Causality in AI Systems: Practical approaches for embedding causal methods to improve fairness and accountability.
– Case Studies: Insights from healthcare, insurance, and banking on implementing causal tools for better decision-making.
– Future Trends: Emerging technologies and methodologies in algorithmic recourse that are setting the stage for more reliable AI systems.

Talk Title: Machine Unlearning: Addressing Bias, Privacy, and Regulation in LLMs and Multimodal Models

Presenter:
Marija Stanojevic, Lead Applied Machine Learning Scientist, EudAImonia Science

About the Speaker:
Marija Stanojevic16, Ph.D. is a Lead Applied Machine Learning Scientist at EudAImonia Science and Ellipsis Health. She focuses on representation learning, multimodal, multilingual, and transfer learning for healthcare. She was a virtual chair of ICLR 2021 and ICML 2021, general chair of the Machine Learning for Cognitive and Mental Health workshop at AAAI 2024, and main organizer of the 9th Mid-Atlantic Student Colloquium on Speech, Language, and Learning
(MASC-SLL 2022). General chair. She worked at Meta, Cambridge Cognition, Winterlight Labs, and LinkedIn.

Talk Track: Research or Advanced Technical

Talk Technical Level: 4/7

Talk Abstract:
This talk discusses machine unlearning for large language models (LLMs) and multimodal models (MMs) handling sensitive data. As these AI models gain traction, ensuring adaptable and ethical practices is paramount, especially in domains handling healthcare, finance, and personal information. Here, we explore the intricacies of machine unlearning dynamics and their impact on bias mitigation, data privacy, legal compliance, and model robustness.

The talk sheds light on recent advancements and seminal research in machine unlearning. Given the growing prevalence of AI regulations and concerns around test data leaks during massive training, machine unlearning emerges as an essential component for ensuring unbiased, compliant, and well-evaluated AI systems. We discuss techniques for identifying unwanted data within models and for removing it while preserving model performance. Additionally, the talk explores methods for evaluating the success of machine unlearning, guaranteeing that the model forgets the targeted data without compromising its overall behavior and performance on other data.

Machine unlearning empowers stakeholders, including customers and data owners, with the ability to withdraw their data and fosters trust in the responsible development and deployment of LLMs and MM models.

What You’ll Learn:
– The importance of machine unlearning in responsible AI: You’ll be able to explain why machine unlearning is crucial for ensuring ethical and adaptable AI practices, particularly for models handling sensitive data.
– The impact of machine unlearning on key aspects of AI development: The talk will investigate how machine unlearning can mitigate bias, enhance data privacy, ensure legal compliance, and improve model robustness.
– Recent advancements and research in machine unlearning: You’ll understand the latest developments and significant research findings in the field of machine unlearning.
– Techniques for identifying and removing data from models: The talk will explore practical methods for determining if specific data resides within a model and how to remove it while maintaining the model’s performance.
– Evaluating the success of machine unlearning: You’ll learn techniques to assess whether the machine unlearning process has been successful, ensuring the model forgets the targeted data without impacting its overall functionality.

Talk Title: FL4Health: Private and Personal Clinical Modeling

Presenter:
John Jewell, Applied Machine Learning, Vector Institute

About the Speaker:
John Jewell is an Applied Machine Learning Specialist at Vector Institute where he is currently focused on building FL4Health – a Python Package to jointly train machine learning models on distributed datasets in the healthcare domain. Prior to joining Vector Institute, John received his Master’s in Computer Science from Western University under the supervision of Vector Institute Faculty Member Yalda Mohsenzadeh. During this time, he was fortunate enough to make strong contributions to the Anomaly Detection literature, an area he is very much still interested in.

Talk Track: Research or Advanced Technical

Talk Technical Level: 6/7

Talk Abstract:
It is well-established that the robustness and generalizability of machine-learning models typically grow with access to larger quantities of representative training data. However, in the healthcare domain and other industries with highly sensitive data, the vast majority of data exists in silos across different institutions. Centralizing the data is often discouraged, if not impossible, due to strong regulations governing data sharing. This is a fundamental barrier in the development of performant machine learning models in healthcare, and other domains. Fortunately, Federated learning (FL) provides an avenue for training models in distributed data settings without requiring training data transfer. In this talk, we’ll provide an overview of FL and its application in healthcare. This will include a discussion of common challenges arising in distributed data settings, such as data drift and heterogeneity, along with modern approaches aimed at addressing these issues. We’ll introduce the FL4Health library developed at the Vector Institute, which can be leveraged to easily train models on distributed clinical datasets. Finally, we’ll consider some noteworthy experimental results, obtained using the library, demonstrating the utility of FL in training high-performing models in challenging clinical settings.

What You’ll Learn:
Attendees will have the opportunity to learn about FL and how it is used to train performant models on distributed datasets, with a specific focus on clinical tasks. In doing so, attendees will become familiar with common challenges that arise in FL, state-of-the-art techniques to address those challenges and helpful tools to get started.

Talk Title: ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models

Presenter:
En-Shiun Annie Lee, Assistant Professor, Ontario Tech University | Kusei Uemura, Undergraduate, Ontario Tech University | David Anugraha, Undergraduate, Ontario Tech University | Jeremy Bradbury, Professor, Ontario Tech University

About the Speaker:
Annie En-Shiun Lee is an assistant professor at OntarioTech University and the University of Toronto (status-only). Her goal is to make language technology as inclusive and accessible to as many people as possible. She runs the Lee Langauge Lab (L^3) with research focusing on language diversity and multilinguiality. Professor Lee’s research has been published in Nature Digital Medicine, ACM Computing Survey, ACL, SIGCSE, IEEE TKDE, and Bioinformatics. She serves as the demo co-chair for NAACL and has extensive experience transferring technology to industry. Previously she was an assistant professor (teaching stream) at the University of Toronto. She received her PhD from the University of Waterloo and was a visiting researcher at the Fields Institute and Chinese University of Hong Kong as well as worked as a research scientist in industry at VerticalScope and Stradigi AI.

David Anugraha is an undergraduate researcher at the University of Toronto, where he has focused on developing efficient methods for low-resource languages and multilinguality under the guidance of Assistant Professor En-Shiun Annie Lee. He has also assisted Assistant Professor Maryam Dehnavi in investigating methods around large language models compression.

Kosei Uemura is an NLP student researcher at the University of Toronto, specializing in low-resource languages and cross-lingual transfer learning. He has developed state-of-the-art models in African languages with a model size of 7b. With experience in training LLMs from scratch at UTokyo’s Matsuo-Iwasawa Lab and fine-tuning large language models at Spiral.AI, specializing in personality injection, Kosei is dedicated to advancing AI capabilities.

Talk Track: Research or Advanced Technical

Talk Technical Level: 3/7

Talk Abstract:
Performance prediction is a method to estimate the performance of Language Models (LMs) on various Natural Language Processing (NLP) tasks, mitigating computational costs associated with model capacity and data for fine-tuning. Our paper introduces ProxyLM, a scalable framework for predicting LM performance using proxy models in multilingual tasks. These proxy models act as surrogates, approximating the performance of the LM of interest. By leveraging proxy models, ProxyLM significantly reduces computational overhead on task evaluations, achieving up to a 37.08× speedup compared to traditional methods, even with our smallest proxy models. Additionally, our methodology showcases adaptability to previously unseen languages in pre-trained LMs, outperforming the state-of-the-art performance by 1.89× as measured by root-mean-square error (RMSE). This framework streamlines model selection, enabling efficient deployment and iterative LM enhancements without extensive computational resources.

What You’ll Learn:
It may be worth exploring the usage of smaller and cheaper to fine-tune language models to gauge the performance of bigger and more expensive language models.

Talk Title: Revolutionizing Fraud Prevention: Harnessing AI and ML to Safeguard Banking from Fraud

Presenters:
Angela Xu, Director, Risk Control and Fraud Analytics, CIBC | Kemi Borisade, Senior Fraud Data Analyst, CIBC

About the Speakers:
Angela Xu brings over 15 years of strategic data analytics experience in premier financial institutions to the Toronto Machine Learning Summit Conference. As a seasoned technical expert and strategic thinker, Angela has demonstrated success in developing and implementing innovative strategies. With a Master’s degree in Statistics from the Georgia Institute of Technology in Atlanta, USA, and another Master’s degree in Computer Science from China, Angela possesses a diverse skill set that she leverages to drive initiatives to tangible results.

Currently leading the Risk Control & Fraud Analytics team at CIBC, Angela focuses on regulatory breach reporting and fraud strategies for secured and unsecured lending products such as mortgages, loans, and lines of credit. Her leadership is characterized by a commitment to generating innovative ideas, influencing stakeholders, and delivering real value to both her organization and its clients.

Passionate about leveraging cutting-edge technologies to solve complex problems, Angela is dedicated to applying the latest advancements in machine learning and data analytics to add value to her company and enhance the experiences of its clients.

Talk Track: Business Strategy or Ethics

Talk Technical Level: 3/7

Talk Abstract:
In 2023, the Canadian Anti-Fraud Centre reported staggering losses of over CAD $550 million due to fraudulent activities, underscoring the urgent need for advanced security measures. At CIBC, we confront the dynamic challenges of this evolving landscape head-on by embracing cutting-edge tools, technologies, and methodologies.
Our journey is marked by formidable obstacles, including the limitations of rule-based fraud strategies, the delicate balance between sales and risk mitigation, inadequate tools for documentation validation, and the pressing demand for rapid fraud assessment. To address these challenges, our team embarked on a transformative path, leveraging next-generation self-learning Machine Learning models supplemented with custom thresholds. This approach enhances fraud detection capabilities, minimizes false positives, optimizes sales strategies, and fortifies client protection.
Furthermore, through strategic partnerships, we’ve embraced solutions such as Optical Character Recognition (OCR) to streamline documentation validation processes. Exploring the integration of graph databases, Natural Language Processing (NLP), and foundational models, we aim to unlock new frontiers in fraud prevention.
The culmination of our efforts heralds a new era in security, where the synergy of advanced AI and ML technologies promises unparalleled efficiency and efficacy in combating fraud. Join us as we unveil the future of fraud prevention in Canadian banking.

Additional notes

Dear Organizers and Evaluators,

I hope this letter finds you well. Over the years, I have had the privilege of attending the Toronto Machine Learning Summit Conference, and each time, I have found immense value in the exchange of ideas and the learning opportunities it provides. It has been a platform where I have personally benefited and grown in my understanding of machine learning and its applications.

This year, I am excited to contribute to the conference by sharing insights into the latest trends and technologies in fraud detection within financial institutions. My presentation aims to raise awareness among the audience about the critical importance of fraud prevention measures for both institutions and individuals alike. By exploring the advancements in machine learning and artificial intelligence, I hope to inspire discussions on innovative strategies to safeguard company assets and personal finances.

Fraud prevention is a pressing concern in today’s interconnected world, and I believe that through collaboration and knowledge-sharing at events like the Toronto Machine Learning Summit Conference, we can collectively work towards more effective solutions. I am eager to engage with fellow attendees, exchange perspectives, and explore new avenues for leveraging technology in the fight against fraud.

What You’ll Learn:
After attending this presentation, you will gain a comprehensive understanding of the prevailing fraud challenges within the financial industry. You will also acquire foundational knowledge of next-generation near real-time self-learning Machine Learning models, along with insights into their fundamental concepts. Additionally, you’ll explore advanced cutting-edge technologies utilized in fraud detection, equipping you with valuable insights into the evolving landscape of financial security.

Talk Title: Successfully Integrating Ai in Your Strategy and Business Operations – Lessons Learnt from Investing

Presenter:
Patrick Tammer, Senior Investment Director, Scale AI

About the Speaker:
Patrick Tammer is a Senior Investment Director and Policy Advisor to the Canadian Government at Scale AI, Canada’s global AI innovation cluster. He currently manages a $125M portfolio of AI industry innovation projects. He is also an AI researcher at the Harvard Kennedy School. Prior to his current role, he spent 4 years as a strategy consultant with BCG. LinkedIn

Talk Track: Business Strategy or Ethics

Talk Technical Level: 2/7

Talk Abstract:
Drawing from a portfolio of over 100 AI and big data projects, I aim to share actionable guidance on how businesses can harness AI to drive innovation, efficiency, and competitive advantage. Attendees will learn how to:
1. Navigate the AI Landscape: I will present findings from Scale AI’s flagship report “”The State of AI in Canada”” (https://www.scaleai.ca/aiatscale-2023/) to provide a comprehensive overview of how Canada compares globally in AI advancements.
2. Identify and Collaborate with Ecosystem Partners: I will provide strategies for identifying the right partners across academia, startups, and AI solution providers to foster innovation and growth.
3. Structure Successful AI Initiatives: Sharing lessons learned from Scale AI’s extensive project portfolio, I will outline how to effectively structure internal AI initiatives for maximum impact.
4. Develop AI Talent: Insights on crafting a forward-thinking AI talent strategy will be discussed, enabling organizations to build essential in-house capabilities.
5. Access Non-Dilutive Funding: Information on leveraging government non-dilutive funding to de-risk investments in AI technologies will be highlighted, offering a pathway to innovative project financing.

Additional notes

While our project portfolio is cross-industry, I am happy to tailor my presentation to specific industries of interest

What You’ll Learn:
Why? What’s the knowledge gap:
The session addresses the critical gap of integrating cutting-edge AI and big data technologies into mainstream business operations. It aims to equip leaders with the knowledge and tools necessary to navigate the complexities of AI adoption and to leverage these technologies for strategic advantage.

Learning Format and Audience Engagement Details:
The session is designed to be a concise, high-impact presentation lasting 15-30 minutes. It will include a combination of case study insights, strategic frameworks, and interactive Q&A, crafted to engage a diverse audience of C-suite executives, IT professionals, and strategic decision-makers.

Target Audience:
Tailored for senior decision-makers, this presentation will benefit those looking to effectively deploy AI and big data technologies to reshape their business landscapes. It promises valuable insights for anyone involved in technology strategy and implementation.

Talk Title: Connecting the Dots Between AI Ethics and Sustainability

Presenter:
Sasha Luccioni, AI and Climate Leader, Hugging Face | Monish Gandhi, Founder, Gradient Ascent Inc | Deval Pandya, Vice President of AI Engineering, Vector Institute

About the Speaker:
Dr. Sasha Luccioni is the AI & Climate Lead at Hugging Face, a global startup in responsible open-source AI, where she works on creating tools and techniques for quantifying AI’s societal and environmental costs. She is also a founding member of Climate Change AI (CCAI) and a board member of Women in Machine Learning (WiML).

Monish has been passionate about machine learning, AI, and model development for a long time: over ten years ago he built a computer vision based system that could help players win at pool billiards. He has also built models for everything from airplane landing gear systems to crowd behaviour. Over the past few years, he’s worked on almost 60 ML/AI projects and brought this passion and experience to help businesses thrive in the coming AI-powered world.

Monish founded Gradient Ascent (GA) – a trusted provider of AI products, services, and solutions for non-AI companies within financial services, technology, industrial, and other sectors. GA has also made investments in AI businesses and is a member of AngelOne. Previously, he held product management, professional services, technical management, and sales roles at a number of fast growing technology companies. Monish often speaks at events and writes about the role of AI in business.

He has a masters degree in Finance and Financial Law (University of London) and an undergraduate degree in Systems Design Engineering (with Dean’s Honours) from University of Waterloo. In his free time, he loves to read, cook, and play tennis. He is a Board Member at CycleTO.

Deval is the Vice President of AI Engineering at Vector Institute and is passionate about the role of digital technologies in accelerating energy transition and Energy equity, as well as about building machine learning teams and products for societal good.

He holds a Doctorate in Mechanical Engineering and a Masters in Aerospace engineering. Before joining Vector Institute, Deval was leading Data Science and Machine learning teams at Shell. While in that role, his work spanned across various domains, including predictive maintenance, GHG accounting, power value chain, nature-based solutions, biofuels, and hydrogen.

He is passionate about the role of digitalization in energy transition and was the co-founder of the Future Energy Lions network at Shell. Deval also serves as a Director on the technical steering committee of Moja Global, a not-for-profit, collaborative project that brings together a community of experts to develop open-source software under Linux Foundation used for country-level greenhouse gas accounting from the AFOLU sector.

Deval is on the task force for Digitalization in Energy at the United Nations Economic Commission of Europe (UNECE). He enjoys traveling and cooking in his free time.

Talk Track: Business Strategy or Ethics

Talk Technical Level: 2/7

Talk Abstract:
AI ethics and sustainability considerations have typically been considered separately : work that aims to estimate the carbon footprint of AI models does not typically address their contribution towards shifting the balance of power and amplifying inequalities, and that which aims to evaluate the societal impacts of AI models focuses on aspects such as bias and fairness consistently overlooks their water and energy use. In this panel, we will discuss how the two subjects are related and intertwined, especially in the context of generative AI technologies, which come with many challenges in terms of ethics and the environment.

What You’ll Learn:
– Key ethical challenges in AI (bias, fairness, representativity, copyright)
– Environmental impacts of AI (energy, water, natural resources)
– Current state of the art in research on both
– How to make informed trade-offs between potential benefits of (generative) AI technologies while remaining cognizant of their ethical and environmental impacts

Talk Title: Generative AI for Financial Services

Presenters:
Patricia Arocena, Senior Director and Head, Generative AI Innovation Labs, RBC | John Bolton, Director of Engineering, Generative AI Innovation Labs, RBC

About the Speakers:
Patricia Arocena, Head, Generative AI Innovation Labs North America. Working within the Innovation and Technology organization, she is responsible for understanding emerging technologies in the Generative AI space and helping drive their adoption across the bank.

Patricia spearheaded a First-of-a-Kind Program to explore the application of Generative AI technologies including next-gen Large Language Models (LLMs), centered around business problems and in collaboration with business and functional partners. Recently, she was awarded the 2023 RBC Performance Conference Award and Leo Award for her contribution to advance innovation.

Prior to joining RBC, Patricia held leadership innovation positions at Tier-1 research institutions in Canada, PWC, and other banks where she helped create Data and AI-powered solutions for the Financial Services industry. She earned her PhD in Computer Science and MEng in Computer Engineering from the University of Toronto and has been published in numerous scientific journals.

Patricia lives in Toronto with her family and is an avid gardener when there is no snow on the ground.

John has spent his career building unique digital experiences with a focus on integrating emerging technologies into user-facing applications. He oversees the development of proof-of-concepts that leverage Generative AI to address business problems within RBC. He holds a MSc in Human Computer Interaction from Queen’s University.

Talk Track: Business Strategy or Ethics

Talk Technical Level: 2/7

Talk Abstract:
Traveler, there is no road, the road is made by walking. Join us for an expert talk that delves into the transformative power of Generative Artificial Intelligence (AI) and its impact to the Financial Services industry. We begin by presenting an overview of the new AI race’s key moments in the last year and how it is starting to shape our future. We then delve into emerging use cases in the industry, along with a discussion of challenges and opportunities to fuel the next wave of innovation in products and services. We conclude with a visionary outlook for the future of Financial Services, and what is yet needed to power enterprise adoption and growth, 24/7, 365 days a year.

What You’ll Learn:
The impact of Generative AI on the Financial Services Industry. Emerging use cases, challenges, and opportunities. Outlook on what is coming next.

Talk Title: AI Governance: Accelerate Responsible, Transparent, and Explainable AI Workflows

Presenter:
Nassim Tayari, watsonx Canada Leader, IBM Canada

About the Speaker:
Nassim Tayari is a distinguished technology leader with over 15 years of experience in Data science and engineering management. Currently, she serves as watsonx Canada leader at IBM, overseeing a large cross functional team of AI engineers and Solutions Architects assist Canadian clients in adopting trusted generative AI within their organizations.

Prior to this role, Nassim held various leadership positions at Borealis AI and Royal bank of Canada. Her background also includes hands-on experience as a Data scientist. Nassim is a visionary technologist with a passion for harnessing the power of bleeding-edge technology to revolutionize businesses.
Nassim is passionate about creating innovative and impactful solutions that leverage the power of Data and AI, prioritizing teamwork, diversity, excellence, and strive to make a positive difference in the world through her work.
Nassim Tayari holds PhD in the applications of Machine Learning in Medical Imaging and have multiple publications and certifications in the field.

Talk Track: Applied Case Studies

Talk Technical Level: 4/7

Talk Abstract:
The Hype around AI and the value it can offer and the concerns around how it can be implemented has reached a fever pitch in recent months. AI governance is not just a “nice to have” in today’s AI environment. It provides a level of organizational rigor and human oversight into how AI models are created and deployed.
While it doesn’t replace the MLOps processes that organizations have, it complements them with activities intended to strike the appropriate balance between the benefits and risks of AI. The focus of this talk is on a comprehensive platform agnostic end to end solution for managing life cycles and risk for both traditional predictive machine learning and new generative AI models.
watsonx.governance is an AI-driven, highly scalable governance, risk and compliance platform that can centralize siloed risk management functions within a single environment.

What You’ll Learn:
As part of this talk the audience would learn about importance of AI governance and how to ensure that these advanced technologies are used ethically, responsibly, and in a manner that benefits society as a whole.

The learners will be introduced into a governance framework that outlines the principles, policies, and practices for governing the development, deployment, and use of generative artificial intelligence (AI) systems.

Talk Title: LLMs for Revolutionizing Credit Risk Assessment

Presenters:
Josh Peters, Data Science Manager, Wealthsimple | Kiarash Shamsi, ML Researcher, Wealthsimple

About the Speakers:
Josh is a Data Science Manager at Wealthsimple. For the last 2 years, he has led the development of the company’s first credit risk models and created the data pipelines to support new credit products.

Prior to Wealthsimple, Josh spent 7+ years working on Data Science problems in the insurance, banking and fraud spaces through his time at Accenture and Airbnb.

Josh’s educational background is in Finance, Statistics and Computer Science.

Kiarash Shamsi is a Ph.D. student at the University of Manitoba, and currently working as financial ML researcher at Wealthsimple. He has published as a first author in conferences such as NeurIPS, ICLR, and ICBC. His research interests are Large language models, temporal graph learning, graph neural networks, topological data analysis, and blockchain data analysis and systems.

Talk Track: Applied Case Studies

Talk Technical Level: 4/7

Talk Abstract:
The session on leveraging Large Language Models (LLMs) in revolutionizing credit risk assessment will commence with an introduction to the potential impact of LLMs on the finance industry. This will be followed by an exploration of the key benefits of LLM integration, including the enhancement of risk assessment accuracy, the utilization of alternative data sources, and the automation of credit processes. The discussion will delve into real-life case studies and examples to illustrate the practical applications of LLMs in credit risk assessment. Additionally, the session will address potential challenges and ethical considerations surrounding the use of LLMs in this context. The talk will conclude with insights on the future of credit risk assessment with LLMs, leaving room for engaging discussions and Q&A.

What You’ll Learn:
Improved Risk Assessment
LLMs can analyze vast amounts of unstructured data, such as financial records, transaction histories, and market trends, to provide more comprehensive and accurate risk assessments. By processing and generating human-like text, LLMs can uncover insights and patterns that traditional credit risk models may miss.

Enhanced Contextual Understanding
LLMs can provide a deeper contextual understanding of borrower profiles and financial data. They can analyze text-based information, like loan applications and customer interactions, to gain a more holistic view of a borrower’s creditworthiness.

Handling Nonlinear Relationships
LLMs can capture complex nonlinear relationships within credit data, enabling them to make more accurate credit risk predictions compared to traditional linear models.

Improved Fraud Detection
LLMs can analyze transaction patterns and identify anomalies that may indicate fraudulent activities, enhancing an institution’s ability to detect and prevent fraud.

Automating Credit Risk Processes
LLMs can automate the credit risk analysis process, generating credit approvals, pricing recommendations, and repayment terms. This can lead to faster decision-making, reduced manual effort, and minimized human error.

Leveraging Alternative Data
LLMs can integrate alternative data sources, such as social media profiles and online behavior, to assess credit risk for borrowers with limited or no credit history. This allows for more comprehensive and inclusive credit risk evaluations.

Enhancing Portfolio Management
By analyzing market trends and customer behavior, LLMs can assist in optimizing credit portfolios, improving risk management, and enhancing overall lending strategies.
Overall, the integration of LLMs in credit risk assessment has the potential to revolutionize the industry by providing more accurate, efficient, and inclusive credit risk evaluations, ultimately leading to better lending decisions and improved financial outcomes.

Talk Title: Rapid Deployment of LLMs into Production: Strategies and Insights

Presenters:
Irena Grabovitch-Zuyev, Staff Applied Scientist, PagerDuty | Suchita Venugopal, Senior Machine Learning Engineer, PagerDuty

About the Speakers:

Irena Grabovitch-Zuyev is a Staff Applied Scientist at PagerDuty, specializing in Data Mining, Machine Learning, and Information Retrieval. She earned her Master of Science in Computer Science from the Technion – Israel Institute of Technology. Her thesis, titled “Entity Search in Facebook,” delved into the realm of Information Retrieval in Social Networks.

In her current role, Irena plays a significant role in developing the PagerDuty Copilot Assistant, leveraging Generative AI to streamline PagerDuty Operation Cloud. Additionally, she has contributed to the development of the Auto-Pause Incident Notifications feature, an important component of AIOps aimed at noise reduction. This feature employs a prediction model to automatically pause notifications for transient alerts, resolving them within minutes.

Before joining PagerDuty, Irena spent five years at Yahoo Research as a senior member of the Mail Mining Team. During this time, she focused on Automatic Extraction and Classification using machine learning algorithms. Her work was deployed in production within Yahoo’s mail backend, processing hundreds of millions of messages daily.

In addition to her professional accomplishments, which include presenting papers at top conferences and filing patents, Irena finds immense fulfillment in her role as a mother to her three children.

Suchita Venugopal is a Senior Machine Learning Engineer at PagerDuty, where she specializes in implementing Generative AI features and leveraging Large Language Models (LLMs). She holds a Master of Science in Big Data from Simon Fraser University in Vancouver, Canada.

In her current role at PagerDuty, Suchita is instrumental in integrating LLM-based features, such as the PagerDuty Copilot assistant and customer support chatbots that utilize Retrieval-Augmented Generation (RAG). She also contributes to the development of Machine Learning (ML) models used in PagerDuty AIOps, helping to automate and optimize IT operations.

Talk Track: Applied Case Studies

Talk Technical Level: 4/7

Talk Abstract:
In the fast-paced domain of generative AI, the deployment of Large Language Models (LLMs) into production settings introduces a distinctive blend of challenges and opportunities. This presentation will detail our experience in incorporating LLMs into our product line within a challenging two-month period, a move motivated by the transformative potential of generative AI for enhancing our offerings. We navigated through various obstacles, such as constrained planning timelines, shifting requirements, the management of diverse stakeholder expectations, adaptation to emerging technologies, and the coordination of simultaneous workflows. These hurdles highlighted the pivotal role of data science and machine learning engineering teams in facilitating LLM integration, emphasizing the importance of security, testing, monitoring, and the pursuit of alternative solutions.

We will share the systematic approach we employed for identifying LLM use cases, validating their feasibility, engineering effective prompts, and crafting a comprehensive testing strategy. Additionally, we will introduce the LLM Service, a custom solution designed to ensure secure and efficient LLM access. This service underscores the significance of robust security protocols, the protection of customer data, the flexibility to switch LLM models to optimize performance for specific use cases, and the provision of redundancy in case of provider outages. Our discussion aims to illuminate how our expedited LLM deployment signifies the dawn of a new era in AI-driven product innovation.

What You’ll Learn:
In this talk, you will learn the effective processes and strategies that enabled the rapid deployment of Large Language Models (LLMs) into our production environment. We will share key takeaways from our journey, highlighting what aspects are non-negotiable, such as robust security measures and the protection of customer data, alongside insights into what strategies yielded the best outcomes. Additionally, we will openly discuss the mistakes we encountered along the way, offering valuable lessons to help you avoid similar pitfalls in your own LLM deployment projects. This session promises a candid look into the challenges and triumphs of integrating generative AI into product offerings at speed.

Talk Title: Upskilling Your Full-Stack Development Team in Machine Learning

Presenter:
Kathryn Hume, Vice President, Digital Channels Technology, RBC | Nijan Giree, Director Mobile Development, Digital, RBC | Arup Saha, Director, Android Development, RBC | Alex Lau, Senior Director, Android and Mobile Services Development, RBC

About the Speaker:
Kathryn Hume is the Vice President of Digital Channels Technology at the Royal Bank of Canada. She is responsible for the software engineering and development of the mobile and online banking platforms at RBC. Alongside her primary role at RBC, she is a board member for AI-Redefined and CanadaHelps, and an advisor for Lytical Ventures. She has led multiple technology teams at RBC, including the personal investments engineering team and the Borealis AI machine learning team. Prior to joining RBC, Kathryn held leadership positions at Integrate.ai and Fast Forward Labs, where she helped over 50 Fortune 500 organizations develop and implement AI programs. She is a widely respected author and educator on technology and innovation, with work appearing at TED, HBR, the Globe and Mail. She has given guest lectures on AI at Harvard, MIT, and the University of Toronto, and served as a visiting professor at the University of Calgary Faculty of Law. She holds a PhD in Comparative Literature from Stanford University and speaks seven languages.

Nijan Giree is an experienced developer with a keen interest in artificial intelligence. He is committed to devising practical AI-driven solutions to complex challenges.

Arup is Experienced Director Of Development with a demonstrated history of working in the banking industry. Skilled in Android Development, Machine Learning, AI, Deep Learning, NLP, CI/DevOps, Google Cloud Platform, Database, SOA, Websphere, Enterprise Architecture, and Agile Methodologies.

Alex Lau is the Senior Director of Android and Mobile Services development at RBC. He leads a passionate team of software engineers that is responsible for developing RBC’s Android Banking and Avion applications. Alex has been developing mobile solutions for the past 10 years. Prior to joining RBC, he led development teams at TD, Good Technology/BlackBerry and IBM building a variety of products, from consumer facing applications to enterprise tools like MDM and BYOD containers. Alex holds a Master and Bachelor of Mathematics degree in Computer Science from the University of Waterloo.

Talk Track: Applied Case Studies

Talk Technical Level: 4/7

Talk Abstract:
As the machine learning landscape evolves, it’s becoming easier for traditional software development teams to build and implement models themselves. Generative AI further democratizes ML implementation, with traditional tasks like classification or summarization being possible with well-engineered prompts.

In this session, we will walk through how an Android native development team built skills to implement various kinds of machine learning models themselves. We’ll share lessons learned along the way, and tips for scaling and democratizing machine learning across the enterprise.

What You’ll Learn:
Practical tips for building machine learning skills in a software development team
Techniques to scale knowledge of a new domain across a large team
Lessons learned in the nuances of applying various neural network techniques and how to overcome obstacles in production at scale for a 10-million client user base

Talk Title: Optimizing Personalized User Experience: In-session Recommendations Across E-commerce Verticals

Presenter:
Tina Shen, Machine Learning Engineer, Loblaw Digital | Charles Zhu, Machine Learning Engineer, Loblaw Digital

About the Speaker:
Tianshu (Tina) Shen is a dedicated Machine Learning Engineer at Loblaw Digital, with over 2 years of experience in the e-commerce industry and over 4 years specializing in recommender systems. Tina holds a Master’s in Applied Science from the University of Toronto. Her research primarily focused on conversational recommender systems and she has published more than five conference papers during her master’s studies.

At Loblaw Digital, Tina has been instrumental in designing and building diverse machine learning solutions that deliver personalized recommendations across several e-commerce platforms such as Joe Fresh and Real Canadian Superstore. Her work significantly enhances user experiences through personalized and real-time product recommendations across these various verticals.

Through her presentation “Optimizing Personalized User Experience: In-session Recommendations Across E-commerce Verticals”, Tina aims to share valuable insights acquired from hands-on application of advanced methodologies at Loblaw Digital – inspiring peers and attendees towards innovative strides within today’s dynamic e-commerce landscape.

Charles Zhu is a Machine Learning Engineer on the P13n Recommendations Team at Loblaw Digital. He primarily works in machine learning productionization and ML pipeline governance for the company’s Helios Recommender Engine. Prior to working at Loblaw, Charles worked with the City of Toronto analyzing transportation safety data, and in astrophysics as a software engineer.

Talk Track:
Applied Case Studies

Talk Technical Level: 4/7

Talk Abstract:
This talk presented by Loblaw Digital, delves into the nuanced domain of personalized recommendation systems within e-commerce. The presentation will initiate with an examination of user behaviours on different platforms, as well as existing solutions, identifying their limitations and illuminating the pathway towards innovative solutions that significantly enhance user engagement and experience.

Reflecting on the increasing demand for personalization in today’s competitive e-commerce landscape, various use cases of in-session recommendations will be discussed. These are effectively executed across our well-known platforms such as Loblaws (Grocery), Real Canadian Superstore (Grocery), Shoppers Drug Mart (Beauty, Personal Care & Health Products), and Joe Fresh (Fashion Industry).

In highlighting our in-house in-session recommendation model detailed at https://arxiv.org/abs/2401.16433), we show how it takes account multiple item/user data to offer effective personalized suggestions across different shopping situations. The model’s flexibility allows it to optimize customer experiences by addressing the complexity of user behaviour in practice, such as 1) co-existence of multiple shopping intentions, 2) multi-granularity of such intentions, and 3) interleaving behaviour (switching intentions) in a shopping session.

An overview of the system design behind this model is presented alongside its impressive performance results – providing you a clear picture of practically applicable solutions enhancing shopper experiences on digital platforms. Additionally, we will briefly share the ongoing online evaluations and discuss anticipated improvements moving forward. By attending this session by Loblaw Digital, attendees can expect to gain comprehensive insights into tailoring recommendations that accurately capture customer needs in real-time across various verticals within e-commerce.

What You’ll Learn:
1. Comprehensive understanding of the current landscape and limitations of personalized recommendation systems in e-commerce.

2. Insight into the practical application and benefits of in-session recommendations across various platforms in fashion, grocery, beauty, personal care and health products.

3. Detailed understanding of Loblaw Digital’s innovative in-house adaptable recommendation model that leverages multiple item/user data to generate effective personalized, real-time recommendations.

4. Technical insights into our Neural Pattern Associator (NPA), a pioneering item-association-mining model that employs vector quantization to encode common user intentions as quantized representations.

5. An overview on how NPA permits of users’ shopping intentions through an attention-driven lookup during the reasoning phase, resulting in coherent and self-interpretable recommendations.

6. A sneak peek into ongoing online evaluations and potential improvements for enhancing e-commerce experiences via tailored real-time product suggestions.

Talk Title: RAGs in Production: Delivering Impact Safely and Efficiently

Presenters:
Everaldo Aguiar, Senior Engineering Manager, PagerDuty | Wendy Foster, Data Products Leader, Shopify | Margaret Wu, Senior Data Scientist, Advanced Analytics and AI, CIBC | Christopher Parisien, Senior Manager, Applied Research, NVIDIA

About the Speakers:
Everaldo started his Data Science journey as a Data Science for Social Good Fellow at the Center for Data Science and Public Policy at UChicago. Today he is a Senior Engineering Manager at PagerDuty where he leads both the Data Science and Data Engineering teams, and a faculty member at the Khoury College at Northeastern University. Prior to that he was a Data Science Lead at Shopify’s Growth organization. Everaldo is originally from Brazil and Seattle has been home to him for 6 years.

Wendy with over 10 years of experience leading data organizations at scale, Wendy Foster divides her time between data start-up advising and applied data science education; supporting the next wave of data leaders and innovation in this rapidly evolving space.

Christopher Parisien is a Senior Manager of Applied Research at NVIDIA, leading the development of NeMo Guardrails, a toolkit for safety and security in Large Language Models. Chris holds a PhD in Computational Linguistics from the University of Toronto, where he used AI models to explain the strange ways that children learn language. During his time in industry, he helped build the first generation of mainstream chatbots, developed systems to understand medical records, and served as Chief Technology Officer at NexJ Health, a patient-centred health platform. His current focus at NVIDIA is to bring trustworthy language models to large enterprises.

Talk Track: Panel Discussion

Talk Technical Level: 4/7

Talk Abstract:
“Urgent” and “unplanned” are among the least favorite words in any productive team’s dictionaries. Unexpected issues disrupt roadmaps, delay important work, lead to burnout, and hurt customer trust.

Here at PagerDuty we’ve been leveraging AI to help our customers experience fewer incidents and resolve the ones they do have faster. This often involves giving them streamlined access to information they need about our product, their individual setups, and an efficient way for them to get answers to complex answers on the fly.

As technologies evolved and we rolled out our generative AI infrastructure, RAGs became an excellent candidate for those use-cases. They allow for an easy-to-automate process of building “”knowledge bases”” and using those to power powerful chat-like applications, but productionalizing them in a safe manner is often more challenging than building these RAG systems themselves.

In this panel we’ll discuss some of these challenges, how we’ve been tackling them, as well as existing areas of open research we’re excited to pursue in the coming months.

What You’ll Learn:
Attendees will learn how to tackle some common (and uncommon) challenges that come with bundling RAG models into their own products. We’ll cover a few corner cases that were completely unexpected as well as automation processes that we designed to ensure that complex parts of our systems could be maintained with minimal engineering effort.

Workshop: A Practitioner's Guide To Safeguarding Your LLM Applications

Presenter:
Shashank Shekhar, Co-Founder, Dice Health

About the Speaker:
Shashank Shekhar is a machine learning engineer and researcher. He is the co-founder of Dice Health, a startup dedicated to developing automation tools for healthcare providers, aimed at accelerating the delivery of care. Dice Health is currently a part of the Next AI 2024 incubator cohort.

Prior to founding Dice Health, Shashank worked at Meta AI Research pursuing research on scaling laws, self-supervised computer vision, and foundation models. Before that, he was at the Vector Institute doing research on explainable AI, visual reasoning, and dynamic neural networks. His first foray into machine learning happened at the Indian Institute Of Science’s Data Science department, where he worked on projects involving visual question answering, entity re-identification, and object detection.

Shashank holds a master’s degree in Computer Engineering from the University of Guelph and a bachelor’s degree in Electronics Engineering from the Indian Institute of Technology Dhanbad. His extensive ML experience also includes collaborations with NEXT-AI Toronto, Layer6 AI Toronto, Shell R&D Center Bangalore, HyperWorks Imaging Bangalore, and Samsung Research Institute Delhi.

Shashank, alongside his collaborators from Meta AI, Stanford and Tübingen University, was a recipient of the Best Paper Award at NeurIPS 2022 – the premier machine learning research conference. He was also a Vector Institute Scholar in AI in 2019, awarded to exceptional graduate student in Ontario in the field of AI.

Talk Track: Workshop

Talk Technical Level: 6/7

Talk Abstract:
In this workshop, participants will learn essential techniques for enhancing the reliability and security of their Large Language Model (LLM) applications. Despite their powerful capabilities, LLMs often face challenges such as generating inconsistent outputs, straying off-topic, exposing sensitive data, etc. This workshop is tailored to give practitioners a broad understanding of current LLM limitations, as well as providing them with tools to address these limitations by generating structured outputs, ensuring topic relevance, mitigating hallucinations, and safeguarding company data.

This workshop will be tailored towards data scientists, ML engineers, and anyone involved in developing or managing LLM applications in the real world who is looking to enhance the robustness of their LLM systems. There will be hands-on programming components using open-source tools to reinforce the concepts covered during the workshop.

What You’ll Learn:
Participants of this workshop will gain a comprehensive understanding of:

Generating Structured Outputs for LLMs: Learn to generate, validate, and, if necessary, regenerate outputs that align with interoperability requirements, ensuring that LLM applications interact seamlessly with existing codebases.

Topic Relevance: Master techniques to ensure that LLMs consistently produce content that is relevant and on-topic, adheres to company brand guidelines, and focuses on delivering the desired user experience.

Hallucination Mitigation: Develop strategies to reduce the risk of LLMs generating inaccurate or misleading information. This includes setting programmable guardrails and providing a reliable ground truth data source for content generation.

Data Leakage Prevention: Understand and implement best practices to protect sensitive information, such as health records and financial details, from being inadvertently exposed by LLMs.

Safety Guardrails Implementation: Learn to establish robust safety guardrails to minimize risks like unauthorized model behavior (“”jailbreaks””), ensure safe interactions with third-party applications, and manage operational costs related to LLM use.

Prerequisite Knowledge (if required)
Basic knowledge of large language models either via APIs such as OpenAI ChatGPT, or Anthropic Claude, or via local models such as Meta Llama, or Mixtral.

Workshop: Kùzu - A Fast, Scalable Graph Database for Analytical Workloads

Presenter:
Prashanth Rao, AI Engineer, Kùzu, Inc.

About the Speaker:
Prashanth is an AI engineer at Kùzu based in Toronto. In recent years, he’s worked with numerous databases and data modeling paradigms, with a focus on data engineering, analytics and machine learning to power a variety of applications. He enjoys engaging with the data community and blogging @ thedataquarry.com in his spare time.

Talk Track: Workshop

Talk Technical Level: 5/7

Talk Abstract:
In this session, we will introduce Kùzu, a highly scalable, extremely fast, easy-to-use, open source embedded graph database designed for analytical query workloads. Users who are familiar with DuckDB in the SQL world will find Kùzu to be a refreshingly familiar graph analogue. A number of state-of-the-art methods from graph database research are highlighted.

The workshop will include a practical component that showcases how simple and easy-to-use Kùzu is for data scientists and engineers. We will demonstrate popular use cases by transforming a relational dataset (in the form of tables) into a knowledge graph, run Cypher queries on the graph, analyze the dataset using graph algorithms, and train a simple graph neural network using PyTorch Geometric to compute node embeddings and store them in the graph database for downstream use cases. We will end by summarizing how these methods can help build advanced RAG systems that can be coupled with an LLM downstream.

Additional notes

In addition to the workshop where we go into the hands-on concepts of knowledge graphs and how to use them, we’d very much like to have a 30-minute talk that introduces the idea of Kùzu and how it’s different from other graph databases, and the core innovations under the hood. If the organizers feel that the content is better separated into two parts (a separate talk on the main stage and the workshop with the practical component), that’s perfectly fine as well. For this reason, I’ve opted for any of the available presentation times.

What You’ll Learn:
1. What are knowledge graphs
2. The characteristics of competent graph database systems
3. How to work with graphs on real-world data
4. How to query a graph in Cypher
5. How to run graph algorithms for graph data science
6. How to do graph machine learning

The core message that attendees will take away is this: There are times when modeling tabular/relational data as a graph is necessary and useful, e.g., to obtain a more object-oriented model over your records or find indirect connections/paths between the entities in the data. In such cases, using an open source, embedded graph database like Kùzu is a simple and low-barrier-to-entry option to analyze the connected data at a much greater depth via graph data structures.

Prerequisite Knowledge (if required)
Basic Python programming skills (all the background for what knowledge graphs are, and how to work with a graph database will be provided to users who are new to the world of graphs).

Workshop: Optimizing Large Language Model Selection for Efficient GenAI Development

Presenters:
Royal Sequeira, Machine Learning Engineer, Georgian | Aslesha Pokhrel, Machine Learning Engineer, Georgian | Christopher Tee, Software Engineer, Georgian

About the Speakers:
Royal is a Machine Learning Engineer and is a part of Georgian’s R&D team. He helps Georgian’s portfolio companies develop product features and in accelerating GTM strategies. His expertise is in Natural Language Processing with broader experience in Multimodal Machine Learning, Computer Vision, Information Retrieval, and Reinforcement Learning. He has publications in top-tier conferences such as ACL, EMNLP, WSDM, and SIGIR. In the past, he has worked at Ada Support, LG Toronto AI Lab, and Microsoft Research India. In 2018, he founded Sushiksha, a mentorship organization that has mentored hundreds of medical and engineering students across rural India with both technical and soft skills. In his free time, he reads books, likes to learn new languages, and enjoy a hot chai with his friends.

Aslesha is a Machine Learning Engineer at Georgian, helping portfolio companies leverage ML solutions in various business use cases. She graduated from the University of Toronto with a Master’s in Applied Computing and a Bachelor’s in Computer Science and Physics. Her background includes significant research in deep learning and representation learning in various data modalities including language, time series and tabular data, which she now applies to driving innovation and efficiency.

Christopher is a tech enthusiast with a passion for code optimizations, efficient machine learning solutions and MLOps. He has extensive experience in building high-performance machine learning pipelines and orchestrating the lifecycle of machine learning models.
During his spare time, Christopher enjoys cycling and skiing.

Talk Track: Workshop

Talk Technical Level: 5/7

Talk Abstract:
When developing a Generative AI use case, developers face a variety of choices, particularly with the proliferation of foundational and open-source models. The decision process to choose the suitable large language model (LLM) for a given use case, however, may involve fine-tuning, crafting tailored prompts, cost considerations, and evaluations, which can become cumbersome without a modular design approach. In this workshop, we will explore various tools such as DSPy and frugalGPT to help pick the best LLM given the usecase. This will be a hands-on session focusing on the practical applications.

What You’ll Learn:
The main goal of the workshop is to provide attendees with a hands-on experience with tools such as DSPy and frugalGPT to build modular pipeline to choose the best LLM for specific needs based on performance, cost, and scalability.

Prerequisite Knowledge (if required)
Install the following libraries before the workshop: ollama, dspy-ai, frugalGPT

Workshop: Building an Open-Source Agentic RAG Application with Llama 3

Presenters:
Greg Loughnane, Co-Founder, AI Makerspace | Chris Alexiuk, Co-Founder & CTO, AI Makerspace

About the Speakers:
Dr. Greg Loughnane is the Co-Founder & CEO of AI Makerspace, where he is an instructor for their [AI Engineering Bootcamp]. Since 2021 he has built and led industry-leading Machine Learning education programs. Previously, he worked as an AI product manager, a university professor teaching AI, an AI consultant and startup advisor, and an ML researcher. He loves trail running and is based in Dayton, Ohio.

Chris Alexiuk is the Co-Founder & CTO at AI Makerspace, where he is an instructor for their [AI Engineering Bootcamp]. Previously, he was a Founding Machine Learning Engineer, Data Scientist, and ML curriculum developer and instructor. He’s a YouTube content creator YouTube who’s motto is “Build, build, build!” He loves Dungeons & Dragons and is based in Toronto, Canada.

Talk Track: Workshop

Talk Technical Level: 4/7

Talk Abstract:
This year, people and companies aim to build more complex LLM applications; namely, ones that leverage context and reasoning. For applications to leverage context well, they must provide useful input to the context window, through direct prompting or search and retrieval. To leverage reasoning is to leverage the Reasoning-Action pattern, and to be “agentic” or “agent-like.”

The tool with the largest community-building LLM applications is LangChain. LangChain v0.2, the latest version of the leading, incorporates LangGraph directly, the engine that powers stateful (and even fully autonomous) agent cycles.

In this session, we’ll break down the concepts and code you need to understand and build the industry-standard agentic RAG application, from soup to nuts.

What You’ll Learn:
– A review of the basic prototyping patterns of GenAI, including Prompt Engineering, RAG, Fine-Tuning, and Agents

– Understand agents and agentic behavior as a pattern of reasoning and action

– The big ideas behind giving agents access to tools through [Function Calling]

– Why giving agents access to tools enables search and retrieval (e.g., RAG)

– Why you should choose specific open-source LLMs and embedding models over others

– The core ideas and constructs you’ll need to build RAG applications with LangChain

– How synthetic data can be created and evolved using the [Evol-Instruct] method

– How you should think about evaluating the output of RAG and Agentic systems

Prerequisite Knowledge (if required)
– Working knowledge of how to run Machine Learning Python code in Jupyter Notebooks
– Practical knowledge of how to use an interactive development environment with version control so that you can engage with our public GitHub repo. To test yourself, complete [The AI Engineering Bootcamp Challenge]

Workshop: Leveraging Large Language Models to Build Enterprise AI

Presenters:
Rohit Saha, Machine Learning Scientist, Georgian | Kyryl Truskovskyi, Founder, ML Engineer, Kyryl Opens ML | Benjamin Ye Machine Learning Scientist, Georgian | Angeline Yasodhara Machine Learning Engineer, Georgian

About the Speakers:
Rohit is a Machine Learning Scientist on Georgian’s R&D team, where he works with portfolio companies to accelerate their AI roadmap. This includes scoping research problems to building ML models to moving them into production. He has over 5 years of experience developing ML models across Vision, Language and Speech modalities. His latest project entails figuring out how businesses can leverage Large Language Models (LLMs) to address their needs. He holds a Master’s degree in Applied Computing from the University of Toronto, and has spent 2 years at MIT and Brown where he worked at the intersection of Computer Vision and domain adaptation.

Kyryl is a seasoned ML professional, currently based in Canada. With a rich 9-year background in ML, he has evolved from hands-on coding to architecturing key ML business solutions.

Ben is a Machine Learning Engineer at Georgian, where he helps companies to implement the latest techniques from ML literature. He obtained his Bachelor’s from Ivey and Master’s from Penn. Prior to Georgian, he worked in quantitative investment research.

Angeline is a Machine Learning Scientist at Georgian, collaborating with companies to accelerate their AI product development. Before joining Georgian, she was a research assistant at the Vector Institute, working at the intersection of machine learning and healthcare, focusing on explainability and causality. From explainability, time series, outlier detection to LLMs, she applies the latest techniques to enhance product differentiation.

Talk Track: Workshop

Talk Technical Level: 3/7

Talk Abstract:
Generative AI is poised to disrupt multiple industries as enterprises rush to incorporate AI in their product offerings. The primary driver of this technology has been the ever-increasing sophistication of Large Language Models (LLMs) and their capabilities. In the first innings of Generative AI, a handful of third-party vendors have led the development of foundational LLMs and their adoption by enterprises. However, development of open-source LLMs have made massive strides lately, to the point where they compete or even outperform their closed-source counterparts. This competition presents an unique opportunity to enterprises who are still navigating the trenches of Generative AI and how best to utilize LLMs to build enduring products. This workshop (i) showcases how open-source LLMs fare when compared to closed-source LLMs, (ii) provides an evaluation framework that enterprises can leverage to compare and contrast different LLMs, and (iii) introduces a toolkit to enable easy fine-tuning of LLMs followed by unit-testing (https://github.com/georgian-io/LLM-Finetuning-Toolkit)

What You’ll Learn:
By the end of this workshop, learn how to create instruction-based datasets, fine-tune open-source LLMs via ablation studies and hyperparameter optimization, and unit-test fine-tuned LLMs.

Prerequisite Knowledge (if required)
Python + Familiarity with concepts such as prompt designing and LLMs

Workshop: Uncertainty Quantification with Conformal Prediction: A Path to Reliable ML Models

Presenter:
Mahdi Torabi Rad, President, MLBoost

About the Speaker:
Mahdi Torabi Rad, Ph.D. is a computational scientist, engineer, self-trained software developer, mentor, and YouTube content creator with over 10 years of experience in developing mathematical, statistical, and machine-learning models, as well as computer codes to predict complex phenomena. He has published in top-tier journals of Physics, Engineering, and ML and has extensive experience as an ML Lead in various DeepTech startups. Mahdi is also the YouTuber behind the channel MLBoost, known for its popular videos on ML topics, including Conformal Prediction, which have garnered tens of thousands of views in less than a year.

Talk Track: Workshop

Talk Technical Level: 5/7

Talk Abstract:
In today’s high-stakes applications ranging from medical diagnostics to industrial AI, understanding and quantifying uncertainty in machine learning models is paramount to prevent critical failures. Conformal prediction, also known as conformal inference, offers a practical and robust approach to create statistically sound uncertainty intervals for model predictions. What sets conformal prediction apart is its distribution-free validity, providing explicit guarantees without relying on specific data distributions or model assumptions.

This hands-on workshop reviews the core concepts of conformal prediction, demonstrating its applicability across diverse domains such as computer vision, natural language processing, and deep reinforcement learning. Participants will gain a deep understanding of how to leverage conformal prediction with pre-trained models like neural networks to generate reliable uncertainty sets with customizable confidence levels.

Throughout the workshop, we’ll explore practical theories, real-world examples, and Python code samples, including Jupyter notebooks for easy implementation on real data. From handling structured outputs and distribution shifts to addressing outliers and models that abstain, this workshop equips attendees with the tools to navigate complex machine learning challenges while ensuring model reliability and trustworthiness.

What You’ll Learn:
– What sets conformal prediction apart from other methods of uncertainty quantification?
– The principles and theory behind conformal prediction for uncertainty quantification in machine learning
– Techniques for creating statistically rigorous uncertainty sets/intervals using conformal prediction
– How to apply conformal prediction to pre-trained machine learning models, such as neural networks, for reliable uncertainty quantification
– Hands-on experience with implementing conformal prediction in Python using libraries like scikit-learn and NumPy
– Examples showcasing the application of conformal prediction in diverse domains such as financial forecasting, natural language processing (NLP), and computer vision

Prerequisite Knowledge (if required)
Basic understanding of machine learning concepts, including model training and evaluation.
Familiarity with Python programming and libraries such as NumPy, Pandas, and scikit-learn

Talk: Generative AI Design Patterns

Presenter:
Krishnachaitanya Gogineni, Principal ML Engineer, Observe.AI

About the Speaker:

Krishna Gogineni is a Principal Engineer at Observe.AI, leading the company’s Generative AI stack. He specializes in integrating and productionizing large language models and other advanced architectures to solve product use cases, expertly balancing accuracy/quality with cost/latency. With a solid background in platform engineering and machine learning, Krishna excels in applying state-of-the-art research to industry use cases at scale, ensuring economic viability. Outside of work, he enjoys writing, attending local hackathons and startup events.

Talk Track: Research or Advanced Technical

Talk Technical Level: 3/7

Talk Abstract:
In this presentation, we delve into the expansive world of generative AI design patterns, selecting five pivotal examples to explore in depth: Retrieval Augmented Generation (RAG), Cluster Pulse, State Based Agents, Guard Rails, and Auto-Prompting. These patterns represent a subset of the broader spectrum of generative AI techniques, each offering unique insights into how we can enhance the capabilities and safety of AI systems. RAG provides a method for enriching AI responses with external data, Cluster Pulse fosters creativity in AI outputs, State Based Agents ensure AI actions are aligned with specific objectives, Guard Rails establish boundaries for AI behavior, and Auto-Prompting facilitates more dynamic and context-aware interactions with AI models.

The application of these patterns is demonstrated through the development of the Personalized K-8 Tutor, a project that showcases the synergistic potential of combining multiple generative AI design patterns. This educational tool leverages the strengths of each pattern to create a customized learning experience that adapts to the unique needs and preferences of individual students. By focusing on these five patterns, the presentation aims to provide attendees with a clear understanding of how generative AI can be harnessed to create innovative and impactful solutions, while also highlighting the vast array of other patterns waiting to be explored in the field of generative AI.

What You’ll Learn:
Understanding of three critical generative AI design patterns: Retrieval Augmented Generation (RAG) for enhancing AI responses with external information, State Based Agent for managing AI behavior, and Cluster Pulse for fostering AI creativity.
Insight into the practical application of these design patterns in building intelligent and adaptive AI systems.
Hands-on experience in integrating these patterns into a comprehensive project, the Personalized K-8 Tutor, showcasing their potential to revolutionize educational technology.
Appreciation of the importance of design patterns in structuring and optimizing generative AI solutions for real-world challenges.
Knowledge of how to leverage generative AI to create innovative, user-centric applications that push the boundaries of traditional software engineering.

Talk: Fuel iX: An Enterprise Grade Gen AI platform

Presenters:
Liz Lozinsky, Engineering Manager, Gen Ai Platform Team, TELUS | Sara Ghaemi, Senior Software Developer, Gen Ai Platform Team, TELUS

About the Speakers:
Liz is a Developer Advocate and Engineering Manager on the Platform Engineering team at TELUS. With a background in software development and a BASc from the University of Waterloo in Electrical Engineering with an option in Management Science, Liz leads a talented team focused on democratizing Gen AI for all. Known for her creativity, positivity, and a hint of whimsy, Liz approaches every challenge with enthusiasm and a spirit of adventure!

Sara is a Software Developer in the Gen AI Platform team at TELUS with background in both research and practical applications of software systems. She is one of the lead developers working on the Generative AI initiative at TELUS. She holds a Master’s degree in Software Engineering and Intelligent Systems from the University of Alberta for which she received the C.R. James Award for Best Master of Science Thesis award from the university. Sara is deeply passionate about leveraging her expertise to make technology more accessible and beneficial to all.

Talk Track: Applied Study Cases

Talk Technical Level: 4/7

Talk Abstract:
Sharing how TELUS enabled Gen AI for everyone internally through Fuel iX to get the most value out of the latest advancements in generative AI, while ensuring flexibility, control, privacy, trust and joy!

TELUS has been making incredible strides in AI and we’re at the forefront of scaling generative AI for our team members and customers. We’ve developed a suite of internal generative AI platforms and tools to empower our team members to safely experiment with this technology, fostering a culture of innovation and trust. With over 24,000 team members already utilizing our AI-powered tools in ways we never imagined, it’s clear that the potential for generative AI to enhance productivity and efficiency is immense. By automating repetitive tasks and providing valuable assistance, our AI tools enable team members to focus on innovation and problem-solving, ultimately driving positive change and progress.

What You’ll Learn:
– Building out enterprise grade Gen AI platforms
– The importance of responsible AI and ethical considerations in the development of Gen AI applications
– TELUS’s efforts in scaling generative AI for team members and customers
– The significant impact of AI tools in enhancing productivity and efficiency

Talk: AI for AI-Scotiabank's Award-Winning ML Models

Presenter:
Narcisse Torshizi, Data Scientist/Data Science Manager, Scotiabank | Andres Villegas, Data Scientist Manager, Scotiabank

About the Speaker:
Narcisse Torshizi is an NLP and AI Data Scientist at Scotiabank, who has a PhD in Neurolinguistics. She has 10 years of experience in data and analytics and is specialized in the development of AI products and the related LLM trainings.

Andres Villegas is a trilingual (English, French, Spanish) Conversational AI Expert with over 7 years of experience in designing voice and chatbot interactions across various industries. With a Master’s degree in Engineering and a Professional Development Certificate in Data Science and Machine Learning, I have led multiple successful projects. Currently, I am part of the global Machine Learning and Artificial Intelligence group at Scotia Bank, where I have implemented the first customer-facing Gen AI feature and conducted extensive analytics to optimize chatbot performance. I am passionate about leveraging NLP, UX design, and automaton to drive digital transformation and enhance user interactions.

Talk Track:
Applied Study Cases

Talk Technical Level: 5/7

Talk Abstract:
A brief overview of four innovative models that power and improve a chatbot solution

Last year, Scotiabank was awarded the 2023 Digital Transformation Award by IT World Canada for our customer support chatbot. This achievement was made possible through the implementation of auxiliary AI models that helped the team develop the chatbot (“”AI for AI””). These auxiliary models enabled the automation of the conversations review, supported NLU training, and allowed for scalability as the adoption of the chatbot increased. Besides, we have recently leveraged LLMs for summarizing chatbot interactions when a chatbot session is handed over to an agent (when the chatbot cannot fulfil the customer’s request).

The chatbot solutions that we have developed and deployed is a result of combining various machine learning and statistical models. These models handle distinct aspects of natural language understanding, processing, and evaluation. Launching a new chatbot with no previous data puts immense pressure on the sustaining teams to detect, classify, and fix issues in the chatbot. In the absence of out of the box solutions the team came up with the concept of building auxiliary AI models to sustain the chatbot (AI for AI). We will describe the major features and achievements of four models that sustain our award-winning chatbot: Luigi, EVA, Peach and GenAI summarization.

Luigi is a machine learning model that takes the confidence threshold of the chatbot’s answers as either correct or incorrect. It uses a supervised learning approach to learn from the feedback of human reviewers and adjust the threshold accordingly. EVA is a machine learning classification model that processes customer inputs to predict their intent. It works in conjunction with Google Dialogflow. Peach is a natural language understanding model focused on similarity analysis. It supports AI trainers by evaluating whether training utterances positively influence the performance of the Dialogflow machine learning model. Finally, Our First GenAI feature helps summarization of the chat and capturing key details of each conversation, including account information and transaction specifics. This information is then sent to an agent, reducing the initial workload by an impressive 71%. On average, summaries are a mere 48 words, compared to the original 166-word conversations.

By utilizing these models, the team tapped into a database of curated data, reducing manual labor by thousands of hours in maintaining the organization’s chatbot. This enabled the chatbot to rapidly enhance its performance after launch, resulting in improved call containment, customer satisfaction, and ultimately, recognition with the 2023 Digital Transformation Award.
These models handle different aspects of natural language processing and evaluation and work together to provide a seamless and satisfying customer experience.

What You’ll Learn:
Launching a new AI product with no previous data puts immense pressure on the sustaining teams to detect, classify, and fix issues in the model. In the absence of out of the box solutions the teams can came up with the concept of building auxiliary AI models to sustain the conversational AI product (AI for AI).

Talk: Navigating LLM Deployment: Tips, Tricks and Techniques

Presenter:
Meryem Arik, CEO, TitanML

About the Speaker:
Meryem Arik is the Co-founder and CEO of TitanML, a pioneering company empowering enterprises to harness the full potential of Generative AI without compromising on data security.
A Forbes 30 Under 30 honoree, Meryem spent several years as a rates derivatives structurer at Barclays, covering major corporate, sovereign and supranational clients across EMEA. She holds a Master’s degree in Physics and Philosophy from the University of Oxford.

At TitanML, Meryem is on a mission to accelerate enterprise adoption of cutting-edge AI technologies by providing a secure and scalable foundation for building mission-critical applications. Under her leadership, TitanML has become the platform of choice for organizations seeking to leverage Generative AI while maintaining complete control over their sensitive data.

Talk Track: Applied Study Cases

Talk Technical Level: 3/7

Talk Abstract:
Unlock the power of self-hosted language models to drive innovation in financial services, healthcare, defense, and beyond. Join our expert session to learn industry best practices for optimizing, deploying, and monitoring these cutting-edge AI solutions in-house. Through real-world case studies, Meryem Arik, CEO of TitanML, will share practical tips to help you navigate the challenges and maximize the value of bringing large language models into your organization’s AI workflow. Walk away with the knowledge and confidence to leverage self-hosted LLMs to power your next-generation applications and maintain your competitive edge.

What You’ll Learn:
1. Best practices for optimizing, deploying, and monitoring self-hosted language models. The talk will provide practical tips and real-world case studies to guide attendees on effectively implementing these powerful AI solutions in-house.

2. Understanding the challenges and opportunities of self-hosted LLMs. Attendees will learn how to navigate the potential hurdles and maximize the value of integrating these cutting-edge language models into their organization’s AI workflow.

3. Confidence and knowledge to leverage self-hosted LLMs for building next-gen applications. The session aims to empower attendees with the insights and expertise needed to harness the power of self-hosted language models to drive innovation, maintain a competitive edge, and create applications in critical industries like finance, healthcare, and defense.

In essence, the talk focuses on equipping attendees with the practical know-how, strategic understanding, and inspiration to successfully adopt and utilize self-hosted LLMs within their enterprises to power transformative AI solutions.

Talk Title: From Concept to Value: Framework for Designing Generative Applications for the Enterprise

Presenter:
Vik Pant, Partner and Chief Data Scientist, PwC Canada

About the Speaker:
Vik is a researcher and practitioner of conceptual modelling for game-theoretic optimization. His scholarship and research are focused on strategic coopetition in complex multi-agent systems.

He is an Adjunct Professor in the Faculty of Information at the University of Toronto and the Department of Geography, Environment, and Geomatics at the University of Ottawa.
His academic research has been published in numerous peer-reviewed scholarly journals. These include the Journal of Strategic E-Commerce, the Journal of Electronic Commerce in Organizations, Information Security Journal: A Global Perspective, the Journal of Information System Security, Business Process Management Journal, Complex Systems Informatics and Modeling Quarterly, and the Journal of Information Technology Education.

He has also presented his academic research at refereed scholarly conferences and juried workshops including the Practice of Enterprise Modelling, the International Conference on Software Business, the International Conference on Information Resources Management, and the International Conference on Perspectives in Business Informatics Research.

Vik earned a doctorate from the Faculty of Information in the University of Toronto where his thesis was unanimously accepted by the examination committee As-Is and without any changes, a master’s degree in business administration with distinction from the University of London, a master’s degree in information technology from Harvard University, where he received the Dean’s List Academic Achievement Award, and an undergraduate degree in management information systems from Villanova University.

Talk Abstract:
The promise of generative AI is undeniable,
yet many organizations struggle to translate impressive prototypes into impactful, real-world applications. This disconnect often arises from treating generative application development as an exploratory, data science-driven exercise – akin to an academic laboratory setting – rather than a strategic software engineering endeavor aligned with business goals.

This talk showcases a conceptual modeling framework that transforms generative AI development from an academic lab exercise into a robust generative factory capability. This framework highlights the tight coupling that is needed between technical objectives with business goals to enable organizations to align their generative AI initiatives with their strategic imperatives.

Reproducibility & Data Version Control
for LangChain & LLM/OpenAI Models

FREE Virtual Workshop
Nov. 29th,1PM EST

Toronto Machine Learning Summit

Event Speakers

Event Conference Chair

Keynote

Advanced Technical/Research

Business Strategy

Case Study

Panel Discussion

In-Person Workshops

Virtual Workshops & Talks

Lightning Talks

More speakers to be announced

Agenda

8th Annual:

TMLS

Stay up to date with all social invites and news for TMLS 2024

Join Our Community

Sign Up for TMLS 2023 News Updates

Event Speakers

Event Conference Chair

Keynote

Advanced Technical/Research

Business Strategy

Case Study

Panel Discussion

In-Person Workshops

Virtual Workshops & Talks

Lightning Talks

More speakers to be announced

Agenda

TMLS

Talk Title: Making RAG (Retrieval Augmented Generation) Work

Talk Title: Privacy Risks and Protections in Machine Learning Systems

Talk Title: AI for Hospitals at Scale

Talk Title: Industrializing ML Workflows in Drug Discovery

Talk Title: Advances in Algorithmic Recourse: Ensuring Causal Consistency, Fairness, & Robustness

Talk Title: Machine Unlearning: Addressing Bias, Privacy, and Regulation in LLMs and Multimodal Models

Talk Title: FL4Health: Private and Personal Clinical Modeling

Talk Title: Arcane, An Internal Rag System to Pinpoint Investment Policies

Talk Title: The Dual Nature of Consistency in Foundation Models: Challenges and Opportunities

Talk Title: Transitioning from LLMs to Autonomous Agents in Programming and Software Engineering

Talk Title: Extending PyTorch for Custom Compiler Targets

Talk Title: Unraveling Long Context: Existing Methods, Challenges, and Future Directions

Talk Title: ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models

Talk Title: Torch.func: Functional Transforms in PyTorch

Talk Title: Exploring the Frontier of Graph Neural Networks: Key Concepts, Architectures, and Trends

Talk Title: Extending PyTorch for Custom Compiler Targets

Talk Title: Revolutionizing Fraud Prevention: Harnessing AI and ML to Safeguard Banking from Fraud

Talk Title: GenAI: A New Renaissance in Product Development

Talk Title: Successfully Integrating Ai in Your Strategy and Business Operations – Lessons Learnt from Investing

Talk Title: How Is GenAI Reshaping the Business?

Talk Title: Connecting the Dots Between AI Ethics and Sustainability

Talk Title: GenAI for Productivity?

Talk Title: Deploying LLMs on Kubernetes Environments

Talk Title: Unlocking the Potential of Data in the Aviation Industry

Talk Title: Generative AI for Financial Services

Talk: GenAI Investing in 2024

Talk Title: AI Governance: Accelerate Responsible, Transparent, and Explainable AI Workflows

Workshop: Building Reproducible ML Processes with an Open Source Stack

Talk Title: Web Extraction With LLMs

Talk Title: Gen AI in Banking: Lessons Learned

Talk Title: Optimizing Recommendations on Wattpad Home

Talk Title: Ask the Graph: How Knowledge Graphs Helps Generative AI Models Answer Questions

Talk Title: Building and Evaluating Prompts on Production Grade Datasets

Talk Title: Dynamic Huff's Gravity Model with Covariates for Site Visitation Prediction

Talk Title: Why Real-Time Event Streaming Pattern is Indispensable for an AI Native Future

Talk Title: LLMs for Revolutionizing Credit Risk Assessment

Talk Title: Rapid Deployment of LLMs into Production: Strategies and Insights

Talk Title: Growing your ML Career via Technical Writing and Speaking: Tips and Lessons

Talk Title: Upskilling Your Full-Stack Development Team in Machine Learning

Talk Title: AI As An Engineering Discipline

Talk Title: AI-ready Data Infrastructure for Real-time Sensor Data Analytics on the Edge

Talk Title: Optimizing Personalized User Experience: In-session Recommendations Across E-commerce Verticals

Talk Title: Deploying and Evaluating RAG pipelines with Lightning Studios

Talk Title: RAGs in Production: Delivering Impact Safely and Efficiently

Workshop: The Gap From Prototype to Production: Lessons Learned from Implementing Applications with LLMs

Workshop: A Practitioner's Guide To Safeguarding Your LLM Applications

Workshop: Kùzu - A Fast, Scalable Graph Database for Analytical Workloads

Workshop: Optimizing Large Language Model Selection for Efficient GenAI Development

Workshop: Getting started with Generative Text and Fine-tuning LLMs in Hugging Face

Workshop: Building an Open-Source Agentic RAG Application with Llama 3

Workshop: LLMs for Leaders & Senior Product Managers

Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs

Workshop: Leveraging Large Language Models to Build Enterprise AI

Workshop: Uncertainty Quantification with Conformal Prediction: A Path to Reliable ML Models

Talk: From Chaos to Control: Mastering ML Reproducibility at Scale

Workshop: Building a Production-Grade Document Understanding System with LLMs

Workshop: AI Agents with Function Calling/Tool Use

Talk: Generative AI Design Patterns

Talk: Fuel iX: An Enterprise Grade Gen AI platform

Talk: Agentic AI: Unlocking Emergent Behavior in LLMs for Adaptive Workflow Automation

Talk: AI for AI-Scotiabank's Award-Winning ML Models

Talk: Navigating LLM Deployment: Tips, Tricks and Techniques

Talk: Modular Solutions for Knowledge Management at scale in RAG Systems

Talk Title: From Concept to Value: Framework for Designing Generative Applications for the Enterprise

Talk Title: Detecting AI-generated Content and Verifying Human Content with GPTZero

Talk Title: Compute Strategies for Generative AI

Sign Up for TMLS 2023 News Updates