Corpus and survey dataset for effects of discursive proximization strategies

Published: 24 July 2025| Version 1 | DOI: 10.17632/2r6x9rghv9.1
Contributor:
Yanmin Zhang

Description

This dataset supports the research entitled "The cognitive-pragmatic effects of proximization in discourse of public health crisis: Evidence from corpus-based experiments". The data include a Chinese news corpus on Covid-19 and two surveys. The corpus comprises 197 texts with 189,331 characters. It was compiled from reports spanning from late 2019 to mid-2022 on two major Chinese digitalized newspapers, and was segmented using CorpusWordParser 3.0. The data of two surveys consist of the results of responses to questionnaires from a total sample of 560 participants, 216 for Experiment 1 and 344 for Experiment 2. The primary aim of the study was to examine the language patterns enacting proximization strategies in Chinese news reports of Covid-19 and the effects of proximization upon audience perceptions. The dataset for the surveys is fully anonymized and shared in accordance with institutional ethical standards and data protection protocols.

Files

Steps to reproduce

The study included a corpus of 197 Chinese news texts and two surveys of 560 adults from all the five major regions of China recruited online from Cradamo, a professional crowd-sourcing platform. Consent and demographic information were collected before and after the questionnaire. The questionnaires in the two experiments were different in the presence and absence of explicit evaluative clause. CorpusWordParser 3.0 (Xiao, 2014): used to segment the corpus of Chinese texts SPSS (IBM): Used for data entry and statistical analysis.

Institutions

  • Nanjing Institute of Technology

Categories

Pragmatics, Cognitive Effect

Licence