Model Training: From Infrastructure to Evaluation, Debugging & Optimization

When

Where

About This Event

Join Us for An Exclusive Technical Summit Where Leading Foundation-Model Researchers and Practitioners Converge to Tackle Real-World Challenges in Foundation Model Training.

This Immersive Event Bridges Theory and Practice, Offering Ai Researchers and Practitioners Training Foundation Models a Rare Opportunity to Exchange Battle-Tested Approaches for Infrastructure Scaling, Debugging Model Internals, Evaluation, and Optimization.

Focus Areas

1) Infrastructure Debugging & Monitoring

  • Diagnosing Performance Bottlenecks in Multi-GPU/Multi-Node Setups
  • Instrumenting Pipelines for Deep Observability (Profiling GPU Utilization, Data Flow, Etc.)
  • Correlating Infrastructure Metrics with Model States (Loss, Gradients) in Real Time
  • Failure Detection and Recovery Strategies in Distributed or HPC Environments

2) Model Internals & Debugging

  • Techniques for Analyzing Attention and Activation Patterns (Layer-By-Layer Visualizations)
  • Identifying and Fixing Gradient Issues (Vanishing, Exploding, Partial Inactivity)
  • Debugging Architectural or Layer-Level Bottlenecks
  • Leveraging Interpretability to Guide Early-Phase Debugging (During Pre-Training)

3) Evaluation

  • Designing Targeted Test Sets and Adversarial Evaluations for Foundation Models
  • Error Analysis Frameworks to Uncover Overlooked Failures or Biases
  • Establishing Benchmarks for Generalization, Robustness, and Emergent Capabilities
  • Integrating Evaluation Signals Back Into Hyperparameter Tuning and Model Iteration

4) Pre-Training Optimization

  • Hyperparameter Optimization at Foundation-Model Scale (e.G., Population-Based Training)
  • Data Pipeline Throughput (Streaming, Multi-Threaded I/O, Sharding)
  • Memory-Saving Strategies for Large Context Windows (Activation Checkpointing, Gradient Sharding)
  • Accelerating convergence (Curriculum Learning, Dynamic Batching, Advanced Scheduling)

Sign Up for TMLS 2023 News Updates

Who Attends

Attendees
0 +
Data Practitioners
0 %
Researchers/Academics
0 %
Business Leaders
0 %

2023 Event Demographics

Highly Qualified Practitioners
0 %
Currently Working in Industry*
0 %
Attendees Looking for Solutions
0 %
Currently Hiring
0 %
Attendees Actively Job-Searching
0 %

2023 Technical Background

Expert/Researcher
18.5%
Advanced
44.66%
Intermediate
27.37%
Beginner
9.39%

2023 Attendees & Thought Leadership

Attendees
0 +
Speakers
0 +
Company Sponsors
0 +

Business Leaders: C-Level Executives, Project Managers, and Product Owners will get to explore best practices, methodologies, principles, and practices for achieving ROI.

Engineers, Researchers, Data Practitioners: Will get a better understanding of the challenges, solutions, and ideas being offered via breakouts & workshops on Natural Language Processing, Neural Nets, Reinforcement Learning, Generative Adversarial Networks (GANs), Evolution Strategies, AutoML, and more.

Job Seekers: Will have the opportunity to network virtually and meet over 30+ Top Al Companies.

Ignite what is an Ignite Talk?

Ignite is an innovative and fast-paced style used to deliver a concise presentation.

During an Ignite Talk, presenters discuss their research using 20 image-centric slides which automatically advance every 15 seconds.

The result is a fun and engaging five-minute presentation.

You can see all our speakers and full agenda here

Get our official conference app
For Blackberry or Windows Phone, Click here
For feature details, visit Whova