PT-EN Zomato dataset

Published: 5 August 2024| Version 1 | DOI: 10.17632/448d7kts3k.1
Contributors:
,
,
,
,
,

Description

The PT-EN Zomato Dataset is a set of half a million Portuguese restaurant reviews and approximately 24 thousand English reviews reviews. The dataset has a total of 1192173 rows with information about the restaurant where the review was given and a rating score from 1 to 5 stars. Not all the entries provide a text review. These data was collected by Zomato Portugal (now known as Dig-In) and made available during a founded research. The data corresponds to the period from April 1st, 2014 to September 2nd, 2022. The dataset provides the following columns: review_id; user_pt_id; user_pt_name; user_in_id; user_in_name; text; rating; created_at; res_id; res_name; res_location; res_rating; res_price; res_zomato_id; res_lat; res_lng; res_address_full; res_address_city; res_about; res_group_name; res_group_type; res_priority; res_chain_id; res_menu_id; res_et; res_time_slot; res_company_name; res_seats_exterior; res_seats_interior; res_kitchen_Types; res_infos; res_delivery_Methods; text_language; text_translation_pt; text_translation_en. The last three columns were obtained after data collection, using Google Translate Python API for language detection and text translation.

Files

Steps to reproduce

The data was gathered by Zomato Portugal (no, Dig-In) from their booking website.

Institutions

Agencia Regional para o Desenvolvimento da Investigacao Tecnologia e Inovacao, Universidade da Madeira, Laboratorio de Robotica e Sistemas de Engenharia

Categories

Natural Language Processing, Classification System, Sentiment Analysis

Licence