De novo design of protein interactions with learned surface fingerprints
B. E. Correia P. Gainza S. Wehrle A. Van Hall−Beauvais A. Marchand A. Scheck Z. Harteveld S. Buckley D. Ni S. Tan F. Sverrisson C. Goverde P. Turelli C. Raclot A. Teslenko M. Pacesa S. Rosset S. Georgeon J. Marsden A. Petruzzella K. Liu Z. Xu Y. Chai P. Han G. F. Gao E. Oricchio B. Fierz D. Trono H. Stahlberg M. B. Bronstein
Abstract
Physical interactions between proteins are essential for most biological processes governing life. However, the molecular determinants of such interactions have been challenging to understand, even as genomic, proteomic and structural data increase. This knowledge gap has been a major obstacle for the comprehensive understanding of cellular protein–protein interaction networks and for the de novo design of protein binders that are crucial for synthetic biology and translational applications. Here we use a geometric deep-learning framework operating on protein surfaces that generates fingerprints to describe geometric and chemical features that are critical to drive protein–protein interactions. We hypothesized that these fingerprints capture the key aspects of molecular recognition that represent a new paradigm in the computational design of novel protein interactions. As a proof of principle, we computationally designed several de novo protein binders to engage four protein targets: SARS-CoV-2 spike, PD-1, PD-L1 and CTLA-4. Several designs were experimentally optimized, whereas others were generated purely in silico, reaching nanomolar affinity with structural and mutational characterization showing highly accurate predictions. Overall, our surface-centric approach captures the physical and chemical determinants of molecular recognition, enabling an approach for the de novo design of protein interactions and, more broadly, of artificial proteins with function.