Data for: Practical bioinformatic DNA-sequencing pipeline for detecting oncogene amplification and EGFRvIII mutational status in clinical glioblastoma samples

Published: 15-04-2019| Version 1 | DOI: 10.17632/nh5y2dddd2.1
Contributors:
Michael Miller,
Aneta Waluszko,
Jessica Tome-Garcia,
Tatyana Sidorenko,
Fei Ye,
Nadejda Tsankova

Description

Dataset 1. Sequencing and z-score normalized data results from reference and CNS samples. (A and B) Reference population coverage with means and standard deviations used to calculate z-scores shown at the top. Amplicon z-scores and followed by gene z-scores (A) and EGFRvIII z-scores (B) used to identify focal gene amplification (z-score > 5, final column in A) and EGFRvIII (z-score > 10, final column in B), respectively (EGFR.d2.7 = EGFRvIII). Dataset 2. Reference values used to calculate amplicon and EGFRvIII z-scores from raw coverage matrix and sample metadata. For complete and actively updated repository that uses this reference data, refer to https://github.com/Michael-L-Miller/CoveRageAnalysis (doi:10.5281/zenodo.1220399).

Files