Arabic Stemming to Select Index Terms.

Published: 11 July 2023| Version 1 | DOI: 10.17632/w4y9pb9w8n.1
Contributor:
samer yaseen

Description

The data is 34 classes of Arabic terms. Each class has similar terms. The aim of this data is to measure the stemmers' ability to group similar terms to the lowest possible index terms. It is not important to get the correct stem, the important is to be able to group similar terms to the lowest possible index terms.

Files

Steps to reproduce

The data is collected and revised with the help of an expert in Arabic language.

Institutions

Sana'a University

Categories

Information Retrieval, Arabic Language

Licence