Protein regulatory relationships in COVID19

Published: 25-04-2020| Version 2 | DOI: 10.17632/3pm7yy4xj9.2
Contributors:
Finlay Maclean,
null null,
Antony Scerri

Description

This dataset was extracted from the Elsevier Pathway Studio, a tool that helps scientists analyze experimental data to answer biologically meaningful questions. The dataset itself consists of biological relationships between diseases (MERS and SARS), proteins and molecules. The relationships are of various types including Regulation, Target, Molecular Transport, etc. You can find a mapping of the relationship name to a description on this support page: https://service.elsevier.com/app/answers/detail/a_id/3014/supporthub/pathway/ The source of these relationships are life sciences and biomedical articles, from various publishers. We make use of taxonomies, curated and maintained by subject matter experts, to extract the right terms from text and map them to the correct identifiers. Subject matter experts have also helped us create the rules and information extraction patterns to optimize the extraction of relationships from text. At last, the pubmed identifiers from which the relationships were extracted are also part of the dataset. The .cypher and .json files can be imported into the graph database neo4j. The .csv files can be used to import into other systems.