Data for: Predicting eukaryotic protein secretion without signals

Published: 31 March 2020| Version 1 | DOI: 10.17632/4yjrt82748.1
Contributors:
Henrik Nielsen,

Description

Datasets extracted from UniProt for benchmarking methods for predicting non-classical secretion. Data are in two formats: FASTA sequences (*.fast) and lists of UniProt accession codes (*.ac.txt). The *_secreted_* files are proteins with subcellular location confirmed as secreted, but without signal peptides. The *_non_secreted_* files are proteins with subcellular location confirmed as cytoplasmic or nuclear. The mammalia_* files are from mammals; the eukaryota_* files are from eukaryotes excluding mammals.

Files

Categories

Protein-Related Bioinformatics

Licence