Syntactic Enhancement of Killer Road Complaint Tweets Posted on Twitter
The dataset consists of tweets collected in a time span of 4 weeks (from July 18, 2016 till September 13, 2016) posted to two public agency accounts of Government of India: @MORTHIndia (Ministry of Road, Transport, and Highway) and @nitin_gadkari (Union Minister of RTH, India). Step 1 to 6 are the tables pre-processed and enriched using hashtag expansion, spell error correction, sentence segmentation, @username expansion, slang conversion.
Steps to reproduce
mysql -u root -p; enter your password source killer_roads.sql; ## Dataset already includes 'create schema'.