中国科学院机构知识库网格系统: Replay attack detection based on distortion by loudspeaker for voice authentication

Replay attack detection based on distortion by loudspeaker for voice authentication

文献类型：期刊论文


作者	Ren, Yanzhen 3; Fang, Zhong 2; Liu, Dengkai 3; Chen, Changwen 1
刊名	MULTIMEDIA TOOLS AND APPLICATIONS
出版日期	2019-04-01
卷号	78 期号:7 页码:8383-8396
关键词	Automatic Speaker Verification (ASV) Replay Attack Detection (RAD) Loudspeaker Low-frequency attenuation Spoofing attack
ISSN号	1380-7501
DOI	10.1007/s11042-018-6834-3
英文摘要	Identity authentication based on Automatic Speaker Verification (ASV) has attracted extensive attention. Voice can be used as a substitute of password in many applications. However, the security of current ASV systems has been seriously challenged by many malicious spoofing attacks. Among all those attacks, replay attack is one of the biggest threats to the ASV System, where an adversary can use a pre-recorded speech sample of the legal user to access the ASV system. In this paper, we present a replay attack detection (RAD) scheme to distinguish normal speech and replayed speech. We focus on the distortion caused by loudspeaker: low-frequency attenuation and high-frequency harmonics, and present a suite of RAD features DL-RAD, including Harmonic Energy Ratio (HER), Low Spectral Ratio (LSR), Low Spectral Variance (LSV), and Low Spectral Difference Variance (LSDV), to describe the different characteristics between the normal speech signal and replay speech signal. SVM is adopted as a classifier to evaluate the performance of these features. Experiment results show that the True Positive Rate (TPR), True Negative Rate (TNR) of the proposed method are about 98.15% and 98.75% respectively, which are significantly better than the existing scheme. The proposed scheme can be applied to both text-dependent and text-independent ASV systems.
资助项目	Natural Science Foundation of China (NSFC)[U1536114] ; Natural Science Foundation of China (NSFC)[61872275] ; Natural Science Foundation of China (NSFC)[U1536204] ; China Scholarship Council
WOS研究方向	Computer Science ; Engineering
语种	英语
WOS记录号	WOS:000466381800028
出版者	SPRINGER
源URL	[http://119.78.100.204/handle/2XEOYT63/4263]
专题	中国科学院计算技术研究所期刊论文_英文
通讯作者	Ren, Yanzhen
作者单位	1.SUNY Buffalo, Buffalo, NY 14260 USA 2.Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China 3.Wuhan Univ, Sch Cyber Sci & Engn, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Wuhan, Hubei, Peoples R China
推荐引用方式 GB/T 7714	Ren, Yanzhen,Fang, Zhong,Liu, Dengkai,et al. Replay attack detection based on distortion by loudspeaker for voice authentication[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2019,78(7):8383-8396.
APA	Ren, Yanzhen,Fang, Zhong,Liu, Dengkai,&Chen, Changwen.(2019).Replay attack detection based on distortion by loudspeaker for voice authentication.MULTIMEDIA TOOLS AND APPLICATIONS,78(7),8383-8396.
MLA	Ren, Yanzhen,et al."Replay attack detection based on distortion by loudspeaker for voice authentication".MULTIMEDIA TOOLS AND APPLICATIONS 78.7(2019):8383-8396.

入库方式： OAI收割

来源：计算技术研究所

下载0

Replay attack detection based on distortion by loudspeaker for voice authentication

其他版本