Replay attack detection based on distortion by loudspeaker for voice authentication
文献类型:期刊论文
作者 | Ren, Yanzhen3; Fang, Zhong2; Liu, Dengkai3; Chen, Changwen1 |
刊名 | MULTIMEDIA TOOLS AND APPLICATIONS
![]() |
出版日期 | 2019-04-01 |
卷号 | 78期号:7页码:8383-8396 |
关键词 | Automatic Speaker Verification (ASV) Replay Attack Detection (RAD) Loudspeaker Low-frequency attenuation Spoofing attack |
ISSN号 | 1380-7501 |
DOI | 10.1007/s11042-018-6834-3 |
英文摘要 | Identity authentication based on Automatic Speaker Verification (ASV) has attracted extensive attention. Voice can be used as a substitute of password in many applications. However, the security of current ASV systems has been seriously challenged by many malicious spoofing attacks. Among all those attacks, replay attack is one of the biggest threats to the ASV System, where an adversary can use a pre-recorded speech sample of the legal user to access the ASV system. In this paper, we present a replay attack detection (RAD) scheme to distinguish normal speech and replayed speech. We focus on the distortion caused by loudspeaker: low-frequency attenuation and high-frequency harmonics, and present a suite of RAD features DL-RAD, including Harmonic Energy Ratio (HER), Low Spectral Ratio (LSR), Low Spectral Variance (LSV), and Low Spectral Difference Variance (LSDV), to describe the different characteristics between the normal speech signal and replay speech signal. SVM is adopted as a classifier to evaluate the performance of these features. Experiment results show that the True Positive Rate (TPR), True Negative Rate (TNR) of the proposed method are about 98.15% and 98.75% respectively, which are significantly better than the existing scheme. The proposed scheme can be applied to both text-dependent and text-independent ASV systems. |
资助项目 | Natural Science Foundation of China (NSFC)[U1536114] ; Natural Science Foundation of China (NSFC)[61872275] ; Natural Science Foundation of China (NSFC)[U1536204] ; China Scholarship Council |
WOS研究方向 | Computer Science ; Engineering |
语种 | 英语 |
WOS记录号 | WOS:000466381800028 |
出版者 | SPRINGER |
源URL | [http://119.78.100.204/handle/2XEOYT63/4263] ![]() |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Ren, Yanzhen |
作者单位 | 1.SUNY Buffalo, Buffalo, NY 14260 USA 2.Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China 3.Wuhan Univ, Sch Cyber Sci & Engn, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Wuhan, Hubei, Peoples R China |
推荐引用方式 GB/T 7714 | Ren, Yanzhen,Fang, Zhong,Liu, Dengkai,et al. Replay attack detection based on distortion by loudspeaker for voice authentication[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2019,78(7):8383-8396. |
APA | Ren, Yanzhen,Fang, Zhong,Liu, Dengkai,&Chen, Changwen.(2019).Replay attack detection based on distortion by loudspeaker for voice authentication.MULTIMEDIA TOOLS AND APPLICATIONS,78(7),8383-8396. |
MLA | Ren, Yanzhen,et al."Replay attack detection based on distortion by loudspeaker for voice authentication".MULTIMEDIA TOOLS AND APPLICATIONS 78.7(2019):8383-8396. |
入库方式: OAI收割
来源:计算技术研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。