Machine Learning for Protein Engineering - Part 1

2024 ARCHIVES

Machine Learning Approaches for Protein Engineering: Part 1 banner

Part 1 of the PEGS Europe machine learning program delves into essential strategies and best practices small and large research groups need to employ as they strive to use machine learning tools to accelerate and optimize biologics drug discovery and development. We will explore the pros and cons of different approaches for developing and accessing high quality training data and then consider ways of using methods for “out of set” predictions that present new opportunities for ML-based studies arising out of known antigens, structures and successful campaigns. And to empower smaller companies working to compete with the substantial resources of major research organizations, a session will showcase the workflows, capabilities and successes of a set of emerging biopharma companies structured around the use of ML/AI tools as a primary R&D paradigm.

Recommended Short Course*
Monday, 4 November, 14:00 – 17:00
SC4: In silico and Machine Learning Tools for Antibody Design and Developability Predictions
*Separate registration required. See short courses page for details. All short courses take place in-person only.

Wednesday, 6 November

07:30Registration and Morning Coffee

08:25

Chairperson's Remarks

Philip M. Kim, PhD, Professor, Molecular Genetics & Computer Science, University of Toronto

08:30

Scalable Active Learning for Therapeutic Antibody Design

Nathan Frey, PhD, Senior Machine Learning Scientist, Prescient Design, a Genentech Company

We will discuss our approach and general considerations for implementing active learning and design of experiments to iteratively optimise therapeutic antibody candidates. Our active learning framework is underpinned by both algorithmic innovations and robust data pipelines. We achieve improvements across binding affinity, expression yield, and developability properties via orthogonal optimisation approaches, analogous to the multitude of affinity maturation pathways observed in immune responses.

09:00

Expanding Open-Source Structure Prediction with OpenFold

Jennifer Wei, PhD, Machine Learning Software Engineer, OpenFold

The OpenFold Consortium brings together academic and industrial teams to build state-of-the-art protein structure and co-folding prediction models optimised for use on commercial computational hardware. We develop fully open-sourced models and support creation of new experimental datasets, aiming to build more powerful models that can accurately predict complex systems of significance to life sciences. In my presentation, I will present the latest modelling and software developments from the consortium.

09:30

Key Insights from Boehringer Ingelheim’s Digital Transformation Journey

Kausheek Nandy, Digital Transformation-Research, Boehringer Ingelheim Pharmaceuticals Inc.

Boehringer Ingelheim’s digital transformation journey began in 2023, focusing on three key areas. Firstly, empower scientists by liberating them from routine tasks, allowing them to concentrate on high-value work. Second, create a hub for innovation by building an in-house digital portal, a scientist-driven mechanism to formalize and standardize in silico protocols. Lastly, emphasize digital data capture FAIR and APIs, setting the stage for leveraging AI/ML in future.

10:00

AI-Driven De Novo Design of High-Affinity VHH for GPCR Targeting

Per Greisen, President, BioMap

The development of targeted biologics is often hindered by the challenges of identifying and engineering antibodies against specific epitopes. AI-powered de novo antibody design offers a promising solution, enabling precise epitope selection and sequence optimization. Here, we leverage a synergistic combination of structural sampling diffusion models and our proprietary large language model (xTrimo) to design VHH antibodies against a functional GPCR epitope.

10:30Coffee Break in the Exhibit Hall with Poster Viewing

11:15

Pioneering Data-Driven Strategies in de novo Nanobody Design

Roberto Spreafico, PhD, Director, Discovery Data Science, Genmab

AI's potential to create antibodies from scratch is promising but hampered by poor hit rates and binding strengths, rooted in insufficient training data. We have addressed this issue by using computational simulations to determine data requirements such as modality, amount, and diversity. Simulations have been guiding our ongoing experimental data generation work, marking a shift towards a data-centric strategy that complements recent algorithmic progress, aiming to overcome current challenges.

11:45

KEYNOTE PRESENTATION: Generating Data and Labels to Train AI Models for the Design of Better Therapeutic Antibodies

Yanay Ofran, PhD, Founder, CEO, Biolojic Design Ltd.

This presentation focuses on the challenges in obtaining large and well-labeled datasets for training effective AI models. High-throughput data is often not sufficiently labeled to allow for the training of good models. I will review current approaches to coping with this challenge and propose a path to generating and labeling data to train models that design better antibodies that do things that traditionally discovered antibodies are unlikely to do.

12:15

LUNCHEON PRESENTATION: Cutting through the Hype: Real-World Applications of AI in Antibody Discovery and Engineering

Mary Ann Pohl, Director of Alliance Management, Biologics Discovery, XtalPi Inc.

Artificial intelligence (AI) is transforming antibody discovery and engineering. Ailux's platform synergistically combines the best of wet lab and AI. We will explore a series of case studies that exemplify the applications of our AI-driven approach for tackling difficult GPCR targets, designing next-gen display libraries, predicting Ab-Ag complex structures and engineering challenging molecules. This presentation provides a realistic and evidence-based perspective on AI’s impact on the industry.

12:45Luncheon in the Exhibit Hall with Poster Viewing

13:45

Chairperson’s Remarks

Amir P. Shanehsazzadeh, Artificial Intelligence Scientist, Absci Corp.