Medicines Information Dataset (MID)

Published: 14 August 2024| Version 1 | DOI: 10.17632/2vk5khfn6v.1
Contributor:
Hezam Gawbah

Description

Numerous research on medicines is conducted day by day. To address shortcomings of medicines information generation, prediction, and classification models , we introduce a large medicines information dataset of textual data. For this motivation, we named our dataset ‘MID’. • Value of the data - MID is the largest, to our knowledge, available and representative Medicines Information Dataset (MID) for a wide variety drug. - MID is the largest, make it robust for generating information about drugs such as indications or interactions. - MID offers more than 192k rows distributed 45 variety therapeutic class, make it robust for classification drug to therapeutic label. - MID provide accurate, authoritative & trustworthy information on medicines for enhancing predictions and efficiencies in clinical trial management. - In contrast with the few small available datasets, MID's size makes it a suitable corpus for implementing both classical as well as deep learning models. • MID.xlsx provides the raw data include medicines information. The data collected to ensure an accelerate and save experimental efforts for medicines through help in predicting or generating or classifying of medicines preclinically, facilitating a detailed analysis of the risk affecting participants in clinical trials. • Therapeutic_class_counts.xlsx is summarize distribution of medicines per by therapeutic class.

Files

Institutions

  • Ibb University

Categories

Medicine, Pharmacy, Clinical Trial, Natural Language Processing, Drug, Therapeutics, Confidentiality in Healthcare, Drug Information, Clinical Prediction Model, Pharmacoinformatics

Licence