Deep learning using tumor HLA peptide mass spectrometry datasets improves neoantigen identification

B Bulik-Sullivan, J Busby, CD Palmer, MJ Davis… - Nature …, 2019 - nature.com
B Bulik-Sullivan, J Busby, CD Palmer, MJ Davis, T Murphy, A Clark, M Busby, F Duke
Nature biotechnology, 2019nature.com
Neoantigens, which are expressed on tumor cells, are one of the main targets of an effective
antitumor T-cell response. Cancer immunotherapies to target neoantigens are of growing
interest and are in early human trials, but methods to identify neoantigens either require
invasive or difficult-to-obtain clinical specimens, require the screening of hundreds to
thousands of synthetic peptides or tandem minigenes, or are only relevant to specific human
leukocyte antigen (HLA) alleles. We apply deep learning to a large (N= 74 patients) HLA …
Abstract
Neoantigens, which are expressed on tumor cells, are one of the main targets of an effective antitumor T-cell response. Cancer immunotherapies to target neoantigens are of growing interest and are in early human trials, but methods to identify neoantigens either require invasive or difficult-to-obtain clinical specimens, require the screening of hundreds to thousands of synthetic peptides or tandem minigenes, or are only relevant to specific human leukocyte antigen (HLA) alleles. We apply deep learning to a large (N = 74 patients) HLA peptide and genomic dataset from various human tumors to create a computational model of antigen presentation for neoantigen prediction. We show that our model, named EDGE, increases the positive predictive value of HLA antigen prediction by up to ninefold. We apply EDGE to enable identification of neoantigens and neoantigen-reactive T cells using routine clinical specimens and small numbers of synthetic peptides for most common HLA alleles. EDGE could enable an improved ability to develop neoantigen-targeted immunotherapies for cancer patients.
nature.com