EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training
文献类型:期刊论文
作者 | Yuxian Gu1,3; Jiaxin Wen1,3; Hao Sun1,3; Yi Song1,3; Pei Ke1,3; Chujie Zheng1,3; Zheng Zhang1,3![]() |
刊名 | Machine Intelligence Research
![]() |
出版日期 | 2023 |
卷号 | 20期号:2页码:207-219 |
关键词 | Natural language processing deep learning (DL) large-scale pre-training dialogue systems Chinese open-domain conversational model |
ISSN号 | 2731-538X |
DOI | 10.1007/s11633-022-1387-3 |
英文摘要 | Large-scale pre-training has shown remarkable performance in building open-domain dialogue systems. However, previous works mainly focus on showing and evaluating the conversational performance of the released dialogue model, ignoring the discussion of some key factors towards a powerful human-like chatbot, especially in Chinese scenarios. In this paper, we conduct extensive experiments to investigate these under-explored factors, including data quality control, model architecture designs, training approaches, and decoding strategies. We propose EVA2.0, a large-scale pre-trained open-domain Chinese dialogue model with 2.8 billion parameters, and will make our models and codes publicly available. Automatic and human evaluations show that EVA2.0 significantly outperforms other open-source counterparts. We also discuss the limitations of this work by presenting some failure cases and pose some future research directions on large-scale Chinese open-domain dialogue systems. |
源URL | [http://ir.ia.ac.cn/handle/173211/55975] ![]() |
专题 | 自动化研究所_学术期刊_International Journal of Automation and Computing |
作者单位 | 1.The Conversational AI Group, Tsinghua University, Beijing 100084, China 2.Department of Electrical Engineering and Computer Science, York University, Toronto M3J1P3, Canada 3.Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China |
推荐引用方式 GB/T 7714 | Yuxian Gu,Jiaxin Wen,Hao Sun,et al. EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training[J]. Machine Intelligence Research,2023,20(2):207-219. |
APA | Yuxian Gu.,Jiaxin Wen.,Hao Sun.,Yi Song.,Pei Ke.,...&Minlie Huang.(2023).EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training.Machine Intelligence Research,20(2),207-219. |
MLA | Yuxian Gu,et al."EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training".Machine Intelligence Research 20.2(2023):207-219. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。