Anthony Caterini

Senior Research Machine Learning Scientist,

Layer 6 AI (Division of TD Bank)

ABOUT THE SPEAKER:

Anthony is a Senior Research Machine Learning Scientist, leading a team responsible for delivering predictive models in the finance & trading space. Prior to joining Layer 6, Anthony completed a PhD in the Department of Statistics at the University of Oxford, with a focus on statistical machine learning and generative modelling. He also completed a BMath and MMath at the University of Waterloo, with several internships focused on finance and research. Besides the applied side, Anthony has also helped deliver over fifteen research papers to top conferences and journals whilst at Layer 6, focusing on the areas of generative modelling, tabular data analysis, and anomaly detection.

TALK TITLE:

Expanding the Capabilities of Tabular Foundation Models

TRACK:

Fundamental Research (No Direct Business ROI)

SUB TOPIC:

Model Architecture – Training Methods

ABSTRACT:

Tabular data is ubiquitous worldwide, driving solutions for generic business problems, applied time series forecasting, and beyond. This inherent heterogeneity had hindered Tabular Foundation Models (TFMs) from rapidly generalizing to unseen datasets. In-Context Learning (ICL) offers a promising path for TFMs, enabling dynamic task adaptation without fine-tuning. Moving beyond re-purposed language models, we propose combining ICL-based retrieval with self-supervised learning to train dedicated TFMs. We evaluate real versus synthetic pre-training data, demonstrating that real data captures complex signals critical for improving downstream generalization. Incorporating this real data yields significantly faster training and superior adaptability across diverse contexts. Our resulting model, TabDPT, achieves strong performance across varied classification and regression benchmarks. Importantly, our pre-training procedure demonstrates that scaling model and data size drives consistent, power-law performance improvements. This echoes foundational scaling laws, confirming that robust, large-scale, and equitable TFMs are highly achievable. We have open-sourced our complete training and inference pipeline.

WHAT YOU’LL LEARN:

Tabular foundation models are continuing to vastly improve. Real data has been shown to be a legitimate option for pre-training despite previously being underutilized in favour of synthetic pre-training data. We see as well that tabular foundation models are starting to demonstrate scaling laws much like LLMs.

Anthony Caterini

Who Attends

2023 Event Demographics

2023 Technical Background

2023 Attendees & Thought Leadership