中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training

文献类型:期刊论文

作者Yuxian Gu1,3; Jiaxin Wen1,3; Hao Sun1,3; Yi Song1,3; Pei Ke1,3; Chujie Zheng1,3; Zheng Zhang1,3; Jianzhu Yao3; Lei Liu2; Xiaoyan Zhu1,3
刊名Machine Intelligence Research
出版日期2023
卷号20期号:2页码:207-219
关键词Natural language processing deep learning (DL) large-scale pre-training dialogue systems Chinese open-domain conversational model
ISSN号2731-538X
DOI10.1007/s11633-022-1387-3
英文摘要Large-scale pre-training has shown remarkable performance in building open-domain dialogue systems. However, previous works mainly focus on showing and evaluating the conversational performance of the released dialogue model, ignoring the discussion of some key factors towards a powerful human-like chatbot, especially in Chinese scenarios. In this paper, we conduct extensive experiments to investigate these under-explored factors, including data quality control, model architecture designs, training approaches, and decoding strategies. We propose EVA2.0, a large-scale pre-trained open-domain Chinese dialogue model with 2.8 billion parameters, and will make our models and codes publicly available. Automatic and human evaluations show that EVA2.0 significantly outperforms other open-source counterparts. We also discuss the limitations of this work by presenting some failure cases and pose some future research directions on large-scale Chinese open-domain dialogue systems.
源URL[http://ir.ia.ac.cn/handle/173211/55975]  
专题自动化研究所_学术期刊_International Journal of Automation and Computing
作者单位1.The Conversational AI Group, Tsinghua University, Beijing 100084, China
2.Department of Electrical Engineering and Computer Science, York University, Toronto M3J1P3, Canada
3.Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
推荐引用方式
GB/T 7714
Yuxian Gu,Jiaxin Wen,Hao Sun,et al. EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training[J]. Machine Intelligence Research,2023,20(2):207-219.
APA Yuxian Gu.,Jiaxin Wen.,Hao Sun.,Yi Song.,Pei Ke.,...&Minlie Huang.(2023).EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training.Machine Intelligence Research,20(2),207-219.
MLA Yuxian Gu,et al."EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training".Machine Intelligence Research 20.2(2023):207-219.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。