The dataset that illustrates the running process of DASDC.

Published: 25 April 2024| Version 4 | DOI: 10.17632/vdg2nbpc4j.4
Contributor:
hui song

Description

There are some datasets that illustrates the running process of DASDC(https://github.com/soo-h/DASweepDetect) The folder of simu_config is the input from step 1 of the DASDC software,corresponds to the contents of the simu_config folder in demo. The folder of simu_data is the output from step 1 of the DASDC software (that is, the input file from step 2),corresponds to the contents of the simu_data folder in demo. The folder of simu_feature is the output from step 2 of the DASDC software (that is, the input file from step 3),corresponds to the contents of the simu_feature folder in demo. The folder of real_feature_portion is part of the output(only chr18) from step 3 of the DASDC software,corresponds to some of the contents of the real_feature folder in demo. The folder of trainSet is the output from step 4 of the DASDC software (that is, the input file from step 5),corresponds to the contents of the trainSet folder in demo. The folder of real_feature is the the input file from step 5 of the DASDC software,corresponds to the contents of the real_feature folder in demo. The folder of pred_res is the output from step 6 of the DASDC software,corresponds to the contents of the pred_res folder in demo. Note: Due to the large size of the whole genome feature file, only chromosome 18 is uploaded in real_feature_portion. If you wish to execute DASDC, please utilize the files within "real_feature," which is a random sample of the entire output from step 3, with an equal proportion of simulated to real data at a ratio of 1:1. available_models put some models trained for specific species by DASDC. DASDC_dependency.tar.gz is the compressed file of the third-party package of DASDC.

Files

Institutions

Huazhong Agriculture University

Categories

Biological Database

Licence