Query-Based Automatic Training Set Selection for Microblog Retrieval

Khaled Albishre; Yuefeng Li; Yue Xu

doi:10.1007/978-3-319-93037-4_26

Back

Query-Based Automatic Training Set Selection for Microblog Retrieval

Conference proceeding

Peer reviewed

Query-Based Automatic Training Set Selection for Microblog Retrieval

Khaled Albishre, Yuefeng Li and Yue Xu

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT II, Vol.10938, pp.325-336

Lecture Notes in Artificial Intelligence

01/01/2018

DOI: https://doi.org/10.1007/978-3-319-93037-4_26

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Information Systems

Computer Science, Theory & Methods

Science & Technology

Technology

Typical pseudo-relevance feedback models assume that the first-pass documents are the most relevant and use those documents to select feedback terms for query expansion. In real applications, however, short documents, such as microblogs, may not have enough information about the searched topic, thus increasing the chance that irrelevant documents will be included in the initial set of retrieved documents. This situation reduces a feedback model's ability to capture information that is relevant to users' needs, which makes determining the best documents for relevant feedback without requiring extra effort from the user a critical challenge. In this paper, we propose an innovative mechanism to automatically select useful feedback documents using a topic modeling technique to improve the effectiveness of pseudo-relevance feedback models. The main idea behind the proposed model is to discover the latent topics in the top-ranked documents that allow for the exploitation of the correlation between terms in relevant topics. To capture discriminative terms for query expansion, we incorporated topical features into a relevance model that focuses on the temporal information in the selected set of documents. Experimental results on TREC 2011-2013 microblog datasets illustrate that the proposed model significantly outperforms all state-of-the-art baseline models.

Metrics

1 Record Views

Details

Title: Query-Based Automatic Training Set Selection for Microblog Retrieval
Creators - without role: Khaled Albishre - Queensland University of Technology
Yuefeng Li - Queensland University of Technology
Yue Xu - Queensland University of Technology
Contributors - without role: D Phung
V S Tseng
G I Webb
B Ho
M Ganji
L Rashidi
Publication Details: ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT II, Vol.10938, pp.325-336
Series: Lecture Notes in Artificial Intelligence
Publisher: Springer Nature
Number of pages: 12
Identifiers: 9930823408331
Academic Unit: Umm Al Qura University
Language: English
Resource Type: Conference proceeding