Inter-Korean summit corpus

Published: 1 September 2020| Version 1 | DOI: 10.17632/mp3drsh4hs.1
Contributor:
Jin Hee Park

Description

This corpus was compiled for Park, J. (2020). Discourse Construction of Inter-Korean Summits in South Korean Newspapers. Manuscript submitted for publication. The data set consists of 6 diachronic specialised corpora comprising news reports including the word, nampwukcengsanghoytam, 'inter-Korean summit' or its homonymous expressions, cengsanghoytam, 'summit' or hoytam, 'talks' which appeared in Chosun ilbo and Hankyoreh shinmun during each summit (13–15 June 2000, 2–4 October 2007, and 27 April, 26 May, and 18–20 September 2018 ). All the articles except for Hankyoreh’s newspaper reports on the 2000 summit were retrieved from the websites of the two newspaper companies, Chosun ilbo (www.chosun.com) and Hankyoreh shinmun (www.hani.co.kr). News coverage of the 2000 summit in Hankyoreh shinmun was collected from the online news database BICKinds (www.kinds.or.kr) because Hankyoreh’s online website provides only partial access to the archive for the year 2000. The corpora comprised 307,753 tokens and 55,909 types. The texts were grammatically annotated according to Trends 21 (http://corpus.korea.ac.kr) PoS tagset, which was used for a reference corpus.

Files

Institutions

Universiteit Leiden

Categories

Media Studies, Discourse Analysis, Corpus Linguistics, Korean Language, Asian Studies

Licence