EN-UR Parallel Dataset

Name: EN-UR Parallel Dataset
Creator: Rabail Asghar
Published: 2025-04-02T10:13:16.576Z
Keywords: Machine Translation, Parallel Database

Asghar, Rabail

doi:10.17632/bmh9xjyjgb.2

EN-UR Parallel Dataset

Published: 2 April 2025| Version 2 | DOI: 10.17632/bmh9xjyjgb.2

Contributor:

Rabail Asghar

Description

The parallel corpus is gathered from various accessible sources, encompassing distinct domains such as Journalism, the Quran, the Bible, News, Subtitles, Movies, COVID-19, and Human Rights. The gathered data went through a thorough preprocessing pipeline designed to guarantee the utmost quality for training translation models. Contact at: rabailasghar97@gmail.com

EN-UR Parallel Dataset

Description

Files

Categories

Licence