中国科学院机构知识库网格系统: Explainability for Large Language Models: A Survey

Explainability for Large Language Models: A Survey

文献类型：期刊论文


作者	Zhao, Haiyan 2; Chen, Hanjie 3; Yang, Fan 4; Liu, Ninghao 5; Deng, Huiqi 6; Cai, Hengyi 7; Wang, Shuaiqiang 1; Yin, Dawei 1; Du, Mengnan 2
刊名	ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY
出版日期	2024-04-01
卷号	15 期号:2 页码:38
关键词	Explainability interpretability large language models
ISSN号	2157-6904
DOI	10.1145/3639372
英文摘要	Large language models (LLMs) have demonstrated impressive capabilities in natural language processing. However, their internal mechanisms are still unclear and this lack of transparency poses unwanted risks for downstream applications. Therefore, understanding and explaining these models is crucial for elucidating their behaviors, limitations, and social impacts. In this article, we introduce a taxonomy of explainability techniques and provide a structured overview ofmethods for explaining Transformer-based language models. We categorize techniques based on the training paradigms of LLMs: traditional fine-tuning-based paradigm and prompting-based paradigm. For each paradigm, we summarize the goals and dominant approaches for generating local explanations of individual predictions and global explanations of overall model knowledge. We also discuss metrics for evaluating generated explanations and discuss how explanations can be leveraged to debug models and improve performance. Lastly, we examine key challenges and emerging opportunities for explanation techniques in the era of LLMs in comparison to conventional deep learning models.
WOS研究方向	Computer Science
语种	英语
WOS记录号	WOS:001208775700001
出版者	ASSOC COMPUTING MACHINERY
源URL	[http://119.78.100.204/handle/2XEOYT63/39000]
专题	中国科学院计算技术研究所期刊论文_英文
通讯作者	Zhao, Haiyan
作者单位	1.10 Shangdi 10th St, Beijing 100085, Peoples R China 2.New Jersey Inst Technol, 323 Dr Martin Luther King Jr Blvd, Newark, NJ 07102 USA 3.Johns Hopkins Univ, 3400 N Charles St, Baltimore, MD 21218 USA 4.Wake Forest Univ, 1834 Wake Forest Rd, Winston Salem, NC 27109 USA 5.Univ Georgia, Herty Dr, Athens, GA 30602 USA 6.Shanghai Jiao Tong Univ, 800 Dongchuan RD, Shanghai 200240, Peoples R China 7.Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
推荐引用方式 GB/T 7714	Zhao, Haiyan,Chen, Hanjie,Yang, Fan,et al. Explainability for Large Language Models: A Survey[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY,2024,15(2):38.
APA	Zhao, Haiyan.,Chen, Hanjie.,Yang, Fan.,Liu, Ninghao.,Deng, Huiqi.,...&Du, Mengnan.(2024).Explainability for Large Language Models: A Survey.ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY,15(2),38.
MLA	Zhao, Haiyan,et al."Explainability for Large Language Models: A Survey".ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY 15.2(2024):38.

入库方式： OAI收割

来源：计算技术研究所

下载0

Explainability for Large Language Models: A Survey

其他版本