Post Summarization of Microblogs of Sporting Events

Mehreen Gillani; Muhammad U. Ilyas; Saad Saleh; Jalal S. Alowibdi; Naif Aljohani; Fahad S. Alotaibi; ASSOC COMP MACHINERY

doi:10.1145/3038912.3038914

Conference proceeding

Post Summarization of Microblogs of Sporting Events

Mehreen Gillani, Muhammad U. Ilyas, Saad Saleh, Jalal S. Alowibdi, Naif Aljohani, Fahad S. Alotaibi and ASSOC COMP MACHINERY

WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, pp.59-68

01/01/2017

DOI: https://doi.org/10.1145/3038912.3038914

Abstract

Computer Science

Computer Science, Information Systems

Computer Science, Interdisciplinary Applications

Computer Science, Software Engineering

Computer Science, Theory & Methods

Science & Technology

Technology

Every day 645 million Twitter users generate approximately 58 million tweets. This motivates the question if it is possible to generate a summary of events from this rich set of tweets only. Key challenges in post summarization from microblog posts include circumnavigating spam and conversational posts. In this study, we present a novel technique called lexi-temporal clustering (LTC), which identifies key events. LTC uses k-means clustering and we explore the use of various distance measures for clustering using Euclidean, cosine similarity and Manhattan distance. We collected three original data sets consisting of Twitter microblog posts covering sporting events, consisting of a cricket and two football matches. The match summaries generated by LTC were compared against standard summaries taken from sports sections of various news outlets, which yielded up to 81% precision, 58% recall and 62% F-measure on different data sets. In addition, we also report results of all three variants of the recall-oriented understudy for gisting evaluation (ROUGE) software, a tool which compares and scores automatically generated summaries against standard summaries.

Metrics

1 Record Views

See more details

Details

Title: Post Summarization of Microblogs of Sporting Events
Creators - without role: Mehreen Gillani - Natl Univ Sci & Technol, Islamabad, Pakistan
Muhammad U. Ilyas - Univ Jeddah, Jeddah, Saudi Arabia
Saad Saleh - Natl Univ Sci & Technol, Islamabad, Pakistan
Jalal S. Alowibdi - Univ Jeddah, Jeddah, Saudi Arabia
Naif Aljohani - King Abdulaziz University
Fahad S. Alotaibi - King Abdulaziz University
ASSOC COMP MACHINERY
Publication Details: WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, pp.59-68
Publisher: Assoc Computing Machinery
Number of pages: 10
Identifiers: 9933477708331
Academic Unit: University of Jeddah; King Abdulaziz University; King Saud University
Language: English
Resource Type: Conference proceeding