Dataset_Customer_Review_Summarization

Published: 17 October 2025| Version 1 | DOI: 10.17632/mmgpv27xs2.1
Contributor:
azani cempaka sari

Description

This dataset supports the study "Graph-Based Text Summarization using Graph Convolutional Network for Indonesian Product Reviews." It contains raw and processed customer review data in the Indonesian language collected from e-commerce platforms. The dataset aims to facilitate research in graph-based natural language processing (NLP), specifically for developing and evaluating Graph Convolutional Network (GCN) models for automatic text summarization.

Files

Steps to reproduce

The dataset is provided as a ZIP archive titled Dataset_CustomerReviewSummarization.zip, containing multiple folders that represent each stage of the data preparation and analysis pipeline for customer review summarization. The folder hierarchy is as follows: Dataset_CustomerReviewSummarization.zip - 1_Raw_Data - 2_Cleaned_Data - 3_GCN_Summarise_Resut - 4_Cosine_Similarity_with_Expert_Summary

Institutions

  • Bina Nusantara University

Categories

Natural Language Processing, Text Processing, Text Mining, Graph Neural Network

Licence