MehNet Source Code and IDs

Published: 9 January 2024| Version 1 | DOI: 10.17632/24x9hdckx5.1
Contributor:
Mehmet Erten

Description

The dataset is related to the article "MehNet: A vigesimal-based model by amino acid melting points generates unique ID numbers for protein sequences". The MehNet study aims to assign a constant value to each amino acid, thereby creating distinctions among protein sequences. The datasets used in this study were obtained from the UniProt Knowledgebase. Subsequently, these datasets underwent preprocessing steps, and identical sequences were categorized under the same headings. Each amino acid was ranked based on its respective melting point and was assigned a vigesimal digit. These generated vigesimal digits were subsequently converted to decimal values. The centerpiece of this methodology was the melting point hashing table, which was given the name "MehNet." Ultimately, each protein sequence was assigned a unique identification number. This approach successfully digitized protein sequences.

Files

Categories

Medicine, Genetics, Clinical Biochemistry, Artificial Intelligence, Computational Mathematics, Bioinformatics, Protein, Machine Learning

Licence