Data for: Learning English-Chinese Bilingual Word Representations from Sentence-Aligned Parallel Corpus

Published: 12 Feb 2019 | Version 1 | DOI: 10.17632/zmcwxt5h7m.1
Contributor(s):

Description of this data

This file includes three datasets for our tasks bilingual dictionary induction, cross-lingual analogy reasoning, and cross-lingual word semantic relatedness.
We release them for the NLP community to explore the related issues.

Experiment data files

This data is associated with the following publication:

Learning English–Chinese bilingual word representations from sentence-aligned parallel corpus

Published in: Computer Speech & Language

Latest version

  • Version 1

    2019-02-12

    Published: 2019-02-12

    DOI: 10.17632/zmcwxt5h7m.1

    Cite this dataset

    Chen, Hsin-Hsi; Huang, Hen-Hsn; Yen, An-Zi (2019), “Data for: Learning English-Chinese Bilingual Word Representations from Sentence-Aligned Parallel Corpus”, Mendeley Data, v1 http://dx.doi.org/10.17632/zmcwxt5h7m.1

Statistics

Views: 79
Downloads: 14

Categories

Computational Linguistics

Licence

CC BY 4.0 Learn more

The files associated with this dataset are licensed under a Creative Commons Attribution 4.0 International licence.

What does this mean?

This dataset is licensed under a Creative Commons Attribution 4.0 International licence. What does this mean? You can share, copy and modify this dataset so long as you give appropriate credit, provide a link to the CC BY license, and indicate if changes were made, but you may not do so in a way that suggests the rights holder has endorsed you or your use of the dataset. Note that further permission may be required for any content within the dataset that is identified as belonging to a third party.

Report