Syntactic Enhancement of Killer Road Complaint Tweets Posted on Twitter
Published: 22 January 2017| Version 1 | DOI: 10.17632/dm6s252524.1
Contributors:
Swati Agarwal, , Description
The dataset consists of tweets collected in a time span of 4 weeks (from July 18, 2016 till September 13, 2016) posted to two public agency accounts of Government of India: @MORTHIndia (Ministry of Road, Transport, and Highway) and @nitin_gadkari (Union Minister of RTH, India). Step 1 to 6 are the tables pre-processed and enriched using hashtag expansion, spell error correction, sentence segmentation, @username expansion, slang conversion.
Files
Steps to reproduce
mysql -u root -p; enter your password source killer_roads.sql; ## Dataset already includes 'create schema'.
Institutions
Indraprastha Institute of Information Technology Delhi
Categories
Information Retrieval, Social Media, Natural Language Processing, Machine Learning, Government Affair, Text Mining