A Hybrid Chinese Conversation model based on retrieval and generation

Tinghuai Ma; Huimin Yang; Qing Tian; Yuan Tian; Najla Al-Nabhan

doi:10.1016/j.future.2020.08.030

Back

A Hybrid Chinese Conversation model based on retrieval and generation

Journal article

Peer reviewed

A Hybrid Chinese Conversation model based on retrieval and generation

Tinghuai Ma, Huimin Yang, Qing Tian, Yuan Tian and Najla Al-Nabhan

Future generation computer systems, Vol.114, pp.481-490

01/2021

DOI: https://doi.org/10.1016/j.future.2020.08.030

Abstract

Conversation model

Generation model

Post-reranking

Retrieval model

Social computing

Conversation generation is an important natural language processing task and has attracted much attention in recent years. The realization of the conversation model is also of great significance to the field of social computing, helping to build artificial intelligence robots on social networks. The open domain conversation model is fundamentally data-driven, which can be roughly divided into retrieval models and generation models. Although remarkable progress has been achieved in recent years, it is still difficult to get responses that are grammatically and semantically appropriate. We propose the Rerank of Retrieval-based and Transformer-based Conversation model (RRT), a novel conversation model that combines the retrieval model with the generation model for the purpose of obtaining context–appropriate response. The context–response pairs with the highest similarity from training set are retrieved using traditional retrieval method, and further ranked to obtain the retrieval candidate response. We replaced the traditional sequence-to-sequence models for conversation generation by the transformer model and achieved better results with less training time. Finally, the post-reranking module is used to rank the retrieved candidate and the generated one to obtain the final response. We conducted detailed experiments on two datasets and the results show that compared with the traditional generation model, our model has a significant improvement in each metric, and the training time is decreased by a factor of 5. Furthermore, our model is more informative and relevant to the input context than the retrieval model. •Propose a novel conversation model that Combines the retrieval model with the generation model.•Apply an effective method of pre-retrieval and then sorting to retrieve response.•Apply transformer to the response generation model.•Use post-reranking to choose the most appropriate response for the input context.

Metrics

1 Record Views

Details

Title: A Hybrid Chinese Conversation model based on retrieval and generation
Creators - without role: Tinghuai Ma - Nanjing University of Information Science and Technology
Huimin Yang - Nanjing University of Information Science and Technology
Qing Tian - Nanjing University of Information Science and Technology
Yuan Tian - Nanjing Institute of Technology
Najla Al-Nabhan - King Saud University
Publication Details: Future generation computer systems, Vol.114, pp.481-490
Publisher: Elsevier B.V
Identifiers: 9946435008331
Academic Unit: King Saud University
Language: English
Resource Type: Journal article