Topic-aware sentiment analysis

Published: 9 June 2021| Version 2 | DOI: 10.17632/m4ndy7tcss.2
Contributors:
Iskander Akhmetov,
,

Description

Data used in experiments with topic-aware sentiment analysis The code for the paper is currently in this repository (https://github.com/iskander-akhmetov/Topic-Aware-Sentiment-Analysis-of-News-Articles/). The source corpora we used, collected, or augmented: Original corpora: ------------------- 1. News articles scraped from different Kazakhstani online media resources and labeled by the experts: IAC_experts_labeled_corpus.zip 2. Kaggle Sentiment Analysis in Russian dataset: https://www.kaggle.com/c/sentiment-analysis-in-russian/data 3. TengriNews.kz articles corpus scraped: tengrinews_articles-all-clean_lt.zip Corpus lemmatized and augmented with topic labels (by us): ------------------------------------------------------------------ 1. IAC augmented: IAC_experts_labeled_corpus-lemmatized-topic.zip 2. Kaggle augmented: sentiment-analysis-in-russian-lemmatized-topic.zip 3. TengriNews augmented: tengrinews_articles-numword_norm-lemmatized-topic.zip

Files

Institutions

Kazakh British Technical University

Categories

Sentiment Analysis, News Collection Service

Licence