AHQAD: Arabic Healthcare Question and Answer Dataset

Published: 22 April 2024| Version 2 | DOI: 10.17632/mgj29ndgrk.2
Hezam Gawbah


Numerous language-centric research on healthcare is conducted day by day. To address shortcomings of Arabic natural language generation models, we introduce a large Arabic Healthcare Question and Answer Dataset (AHQAD) of textual data. For this motivation, we named our dataset ‘AHQAD’. The largest Arabic Healthcare Question and Answer Dataset (AHQAD) as we know was collected from medical website. The AHQAD consists of more than 808k Question and Answer into 90 variety categories. The AHQAD contains one file, and the file description will be discussed here. One file is the actual data which is in Arabic language. This file has Arabic questions, answers and categories.



Ibb University


Medical Assistant, Natural Language Processing, Arabic Language, Healthcare Research, Natural Language Generation, Text Processing, Deep Learning, Natural-Language Understanding, Chatbot