Columbia Technology Ventures

Machine learning pipeline for neoantigen prioritization in personalized immunotherapies

This technology utilizes machine learning to accurately predict and prioritize cancer neoantigens by integrating diverse genomic and transcriptomic features beyond traditional peptide-major histocompatibility complex (MHC) binding affinity.

Unmet Need: Accurate neoantigen prediction for personalized immunotherapy

Current approaches to neoantigen prediction rely on peptide-major histocompatibility complex (MHC) binding affinity, overlooking key factors such as antigen processing, transport, and T-cell recognition that are essential for immunogenicity. This leads to limited accuracy in real-world clinical scenarios, leading to ineffective neoantigen selection. As personalized cancer therapies expand, there is an urgent need for more accurate and comprehensive prediction tools to effectively guide target selection and patient stratification. Addressing these shortcomings is critical to improving therapeutic outcomes and streamlining immunotherapy development.

The Technology: Integrated machine learning platform for improved neoantigen prediction

The technology, called ImmuneMirror, uses a machine learning-based computational pipeline to predict and prioritize immunogenic neoantigens by integrating diverse genomic and transcriptomic inputs. Unlike traditional tools that focus mainly on peptide-MHC binding affinity, this tool incorporates critical parameters such as peptide processing, transport efficiency, binding stability, and immune recognition probabilities. ImmuneMirror leverages a balanced random forest model, trained on experimentally validated immunogenic neoantigens. This tool significantly improves prediction accuracy and includes both a standalone pipeline as well as an accessible web server that streamlines neoantigen prediction directly from sequencing data. As such, ImmuneMirror may facilitate the identification and prioritization of therapeutic targets for personalized immunotherapies and enhance patient stratification in clinical oncology trials.

The pipeline has been validated using patient-derived data from gastrointestinal tract cancers, demonstrating improved predictive performance compared to existing methods.

Applications:

  • Personalized cancer vaccines
  • Cancer research applications
  • Clinical trial stratification
  • Optimization of immunotherapy panels
  • Biomarker discovery
  • Preclinical screening of immunogenic mutations
  • Personalized oncology treatment pipelines
  • Infectious disease research
  • Organ transplant applications

Advantages:

  • Predicts neoantigens by combining peptide affinity, stability, transport, and immune recognition metrics
  • Employs a balanced random forest machine-learning model trained on experimentally validated neoantigens
  • Demonstrates enhanced accuracy over existing methods, achieving high predictive performance
  • Accessible as a cost-effective web-based tool and standalone software
  • Unbiased and automated

Lead Inventor:

Zhonghua Liu, Sc.D.

Related Publications:

Tech Ventures Reference: