A new deep-learning framework combines protein language models and 3D structure to accelerate vaccine antigen discovery. Vaccines remain the most powerful tool for preventing infectious diseases but designing them is far from simple. One of the biggest challenges is identifying the right protective antigens: the specific pathogen proteins that can trigger strong and effective immune responses. Pathogens can produce thousands of proteins, and experimentally testing each one is slow, costly, and often impractical during outbreaks. Now, a new study introduces an artificial intelligence–based framework that could dramatically speed up this process. The researchers developed a computational pipeline called PLGDL (Protein Language and Geometric Deep Learning), designed to predict which pathogen proteins are most likely to serve as effective vaccine antigens. What sets PLGDL apart is its ability to integrate:
- Protein language models, which learn patterns directly from amino acid sequences
- Geometric deep learning, which captures three-dimensional structural features of proteins
- Viruses
- Bacteria
- Eukaryotic pathogens
- Rapidly identified several previously known protective antigens
- Discovered a new candidate antigen, G10R, not previously highlighted as a vaccine target

