Dataset for Arabic Word Sense Disambiguation

Published: 3 July 2024| Version 2 | DOI: 10.17632/pmdbs9tby8.2
Contributors:
Sanaa Kaddoura,

Description

This dataset can be used for research in Arabic Word Sense Disambiguation. This paper is a descriptor for the dataset to be cited when you use the data: Kaddoura, Sanaa, and Reem Nassar. "A Comprehensive Dataset for Arabic Word Sense Disambiguation." Data in Brief (2024): 110591. Paper Link: https://www.sciencedirect.com/science/article/pii/S2352340924005584 The data was analyzed in this article: Kaddoura, Sanaa, and Reem Nassar. "EnhancedBERT: A Feature-rich Ensemble Model for Arabic Word Sense Disambiguation with Statistical Analysis and Optimized Data Collection." Journal of King Saud University-Computer and Information Sciences (2024): 101911. Paper Link: https://www.sciencedirect.com/science/article/pii/S1319157823004652

Files

Steps to reproduce

The data can be expanded by adding more records for other Arabic words.

Institutions

Zayed University, Zayed University - Abu Dhabi Campus

Categories

Artificial Intelligence, Natural Language Processing, Machine Learning, Arabic Language

Funding

Zayed University

R22047

Licence