Story Forms Detection in Text through Concept-Based Co-Clustering

Sultan Alzahrani; Betul Ceran; Saud Alashri; Scott W Ruston; Steven R Corman; Hasan Davulcu

doi:10.1109/BDCloud-SocialCom-SustainCom.2016.48

Back

Conference proceeding

Story Forms Detection in Text through Concept-Based Co-Clustering

Sultan Alzahrani, Betul Ceran, Saud Alashri, Scott W Ruston, Steven R Corman and Hasan Davulcu

2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom), pp.258-265

10/2016

DOI: https://doi.org/10.1109/BDCloud-SocialCom-SustainCom.2016.48

Abstract

Clustering algorithms

Co-clustering

Feature extraction

Matrix decomposition

Merging

Narrative analysis

Non-negative matrix factorization

Semantics

Standards

Story forms

Syntactics

A story is defined as actors taking actions that culminate in resolutions. In this paper, we extract subject - verb - object relationships from paragraphs and generalize them into semantic conceptual representations. Overlapping generalized concepts and relationships correspond to archetypes/targets and actions that characterize story forms. We present an analytic framework which implements co-clustering based on generalized conceptual relationships to automatically detect such story forms. Co-clustering can help in identifying similarities that exist in low-dimensional sub-spaces of sparse data such as textual paragraphs. Through co-clustering, we detect not only the clusters themselves but also their characteristic features which can be useful in describing and summarizing their contents. We perform co-clustering of stories using two different types of features: standard unigrams/bigrams and generalized concepts. We show that the residual error of factorization with concept-based features is significantly lower than the error with standard keyword-based features. Qualitative evaluations also suggest that concept-based features yield more coherent, distinctive and interesting story forms compared to those produced by using standard keyword-based features.

Metrics

1 Record Views

Details

Title: Story Forms Detection in Text through Concept-Based Co-Clustering
Creators - without role: Sultan Alzahrani - Arizona State University
Betul Ceran - Arizona State University
Saud Alashri - Arizona State University
Scott W Ruston - Arizona State University
Steven R Corman - Arizona State University
Hasan Davulcu - Arizona State University
Publication Details: 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom), pp.258-265
Publisher: IEEE
Identifiers: 9918742208331
Academic Unit: King Abdulaziz City for Science & Technology; King Abdulaziz University
Language: English
Resource Type: Conference proceeding