中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression

文献类型:期刊论文

作者Huang, Qingbo1,2; Liu TJ(刘铁军)2; Wu XH(吴玺宏)1; Qu TS(曲天书)1; Huang, Qingbo3,4; Liu TJ(刘铁军)4; Wu XH(吴玺宏)3; Qu TS(曲天书)3
刊名JOURNAL OF THE AUDIO ENGINEERING SOCIETY
出版日期2019
卷号67期号:12页码:986-993
ISSN号1549-4950
产权排序1
英文摘要

The high frequency components of the audio signal are often truncated during the encoding processing by a lossy codec. To avoid the sound quality degradation, the high frequency components are reconstructed during the decoding processing. This paper presents a new bandwidth extension method for audio compression. Frequency components of 6.9 -13.8 kHz are added using side information at 2 kbps. A generative neural network in the GAN is used to estimate relationship between the MDCT spectrum in the high frequency part and the low frequency part, and it is evaluated by a discriminant network in the GAN to get a more natural result. On this basis, a codec system is built up. The MUSHRA experiments show that the proposed method is comparable with SBR in HE-AAC.

WOS关键词NARROW-BAND ; SPEECH
资助项目State Key Laboratory of Robotics[2018-O09] ; National Natural Science Foundation of China[61175043] ; National Natural Science Foundation of China[61421062] ; High Performance Computing Platform of Peking University
WOS研究方向Acoustics ; Engineering
语种英语
WOS记录号WOS:000505043700007
资助机构State Key Laboratory of Robotics [2018-O09] ; National Natural Science Foundation of ChinaNational Natural Science Foundation of China [61175043, 61421062] ; High Performance Computing Platform of Peking University
源URL[http://ir.sia.cn/handle/173321/26182]  
专题沈阳自动化研究所_水下机器人研究室
通讯作者Qu TS(曲天书); Qu TS(曲天书)
作者单位1.Peking Univ, Speech & Hearing Res Ctr, Minist Educ, Key Lab Machine Percept, Beijing, Peoples R China
2.Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Beijing, Peoples R China
3.Peking Univ, Speech & Hearing Res Ctr, Minist Educ, Key Lab Machine Percept, Beijing, Peoples R China
4.Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Beijing, Peoples R China
推荐引用方式
GB/T 7714
Huang, Qingbo,Liu TJ,Wu XH,et al. A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression[J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY,2019,67(12):986-993.
APA Huang, Qingbo.,Liu TJ.,Wu XH.,Qu TS.,Huang, Qingbo.,...&Qu TS.(2019).A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression.JOURNAL OF THE AUDIO ENGINEERING SOCIETY,67(12),986-993.
MLA Huang, Qingbo,et al."A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression".JOURNAL OF THE AUDIO ENGINEERING SOCIETY 67.12(2019):986-993.

入库方式: OAI收割

来源:沈阳自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。