BigMHC Training and Evaluation Data

Published: 30 August 2022| Version 1 | DOI: 10.17632/dvmz6pkzvb.1
Contributors:
,
,
,
,
,
,

Description

All training and evaluation data used in the BigMHC study (https://doi.org/10.1101/2022.08.29.505690) After inflating all zipped files, the total size of this repository is about 4.8 GB. All other data and code are freely available at https://github.com/KarchinLab/bigmhc -------------------------------------------------------------------------------- CSV Columns mhc - MHC-I allele pep - peptide sequence if epitope data or mutated peptide sequence if neoepitope data tgt - target value of 1 (presented/immunogenic) or 0 (non-presented/non-immunogenic) All other columns are the outputs of MHC-I epitope presentation and immunogenicity predictors. -------------------------------------------------------------------------------- Files el_train.csv.zip - epitope presentation training data (inflates to 404 MB) el.csv - epitope presentation evaluation data im_train.csv - immunogenicity transfer learning data iedb.csv - infectious disease epitope evaluation data ifng.csv - neoepitope evaluation data validated with IFN-γ release assays manafest.csv - neoepitope evaluation data validated with MANAFEST assays neg.csv.zip - non-presented epitopes to supplement immunogenicity evaluation (inflates to 4.4 GB)

Files

Institutions

Johns Hopkins Medicine, Johns Hopkins University

Categories

Immunology, Bioinformatics, Mass Spectrometry, Cancer, Immunoassay, Epitope, Major Histocompatibility Complex

License