Most biological processes are intrinsically coordinated through complex networks of protein-protein interactions. The diversity and scale of these networks offer a highly selective and tunable way of modulating protein function. While genetic disease-associated mutations are known to be enriched at protein interacting interfaces, their effects on interaction dynamics and signalling pathways remain largely unexplored.
To overcome this, I have developed novel methods using artificial intelligence to rapidly and accurately explore the effects of mutations on protein interactions. These methods were validated using independent experimental data, outperforming previous approaches, and are widely used in industry and academia (over 1 million hits/year).
Harnessing the insights from these computational tools, I explored the role of mutations in Mycobacterium Tuberculosis, SARs-CoV-2 and rare genetic diseases. This showed that up to 60% of mutations involved in drug resistance and disease phenotypes would lead to significant disruption of key protein-protein interactions. While this supported the important role protein-protein interactions were playing, developing therapies to target these interactions is more challenging than traditional active site pockets. To improve the efficacy of efforts to identify protein-protein interaction modifiers, I proposed two innovative virtual screening approaches - one using information on the small molecule alone, and the other considering the target structure.
Following recent breakthroughs in computational approaches for predicting protein structures, I have recently proposed a novel geometric deep learning approach to help identify regions of a protein surface that are likely to mediate protein interactions in order to provide a link between 3D structure and biological function.
This work provides a foundation for systematically characterising and analysing protein-protein interaction networks. We are currently using it to identify the important and junk interactions for any organism, as part of clinical genomic and protein engineering pipelines and of larger drug discovery efforts.