EN-UR Parallel Dataset

Published: 24 November 2023| Version 1 | DOI: 10.17632/bmh9xjyjgb.1
Contributor:
Rabail Asghar

Description

The parallel corpus is gathered from various accessible sources, encompassing distinct domains such as Journalism, the Quran, the Bible, News, Subtitles, Movies, COVID-19, and Human Rights. The gathered data went through a thorough preprocessing pipeline designed to guarantee the utmost quality for training translation models.

Files

Categories

Machine Translation, Parallel Database

Licence