Benchmark gene reference data for Breast Cancer

Published: 29 July 2022| Version 2 | DOI: 10.17632/xdkvk75ns7.2
Sushrutha Raj, Athira P Anil, Anshita Shukla,
, Alok Srivastava


The presented data comprises of raw and processed data related to genes and its association with breast cancer. Raw data was processed by double fold manual validation to annotate a dataset for Breast Cancer associated genes. This association is classified into three class as positive association, negative association and ambiguous association. This data can be further be explored to study their roles in specific subtypes of breast cancer, their metastasis study based on common genes associated with other disease, as well as for system level modelling, meta-analysis of disease to study the differential evolution of the genes impact the disease.



Artificial Intelligence, Breast Cancer, Natural Language Processing, Machine Learning, Benchmarking, Biological Database, Classification System, Data Validation