NewsMinerCollection

Published: 24 October 2019| Version 3 | DOI: 10.17632/9j47dhd4kx.3
Contributors:
Tiemi Sakata,
,

Description

The NewMinerCollection was built by collecting news, in the English language, from The Guardian, CNN, BBC, Fox News, NyPost, China Daily and CNBC websites, from 1990 until 2016 using a web crawler. This dataset contains 7000 news items equally distributed among the seven categories: id_category category 1 Arts, Culture & Entertainment 4 Economy, Business & Finance 6 Environmental Issues 10 Lifestyle & Leisure 11 Politics 15 Sport 13 Science & Technology

Files

Institutions

Universidade Federal de Sao Carlos

Categories

Computing, Text Mining

Licence