A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression
文献类型:期刊论文
作者 | Huang, Qingbo1,2; Liu TJ(刘铁军)2![]() |
刊名 | JOURNAL OF THE AUDIO ENGINEERING SOCIETY
![]() |
出版日期 | 2019 |
卷号 | 67期号:12页码:986-993 |
ISSN号 | 1549-4950 |
产权排序 | 1 |
英文摘要 | The high frequency components of the audio signal are often truncated during the encoding processing by a lossy codec. To avoid the sound quality degradation, the high frequency components are reconstructed during the decoding processing. This paper presents a new bandwidth extension method for audio compression. Frequency components of 6.9 -13.8 kHz are added using side information at 2 kbps. A generative neural network in the GAN is used to estimate relationship between the MDCT spectrum in the high frequency part and the low frequency part, and it is evaluated by a discriminant network in the GAN to get a more natural result. On this basis, a codec system is built up. The MUSHRA experiments show that the proposed method is comparable with SBR in HE-AAC. |
WOS关键词 | NARROW-BAND ; SPEECH |
资助项目 | State Key Laboratory of Robotics[2018-O09] ; National Natural Science Foundation of China[61175043] ; National Natural Science Foundation of China[61421062] ; High Performance Computing Platform of Peking University |
WOS研究方向 | Acoustics ; Engineering |
语种 | 英语 |
WOS记录号 | WOS:000505043700007 |
资助机构 | State Key Laboratory of Robotics [2018-O09] ; National Natural Science Foundation of ChinaNational Natural Science Foundation of China [61175043, 61421062] ; High Performance Computing Platform of Peking University |
源URL | [http://ir.sia.cn/handle/173321/26182] ![]() |
专题 | 沈阳自动化研究所_水下机器人研究室 |
通讯作者 | Qu TS(曲天书); Qu TS(曲天书) |
作者单位 | 1.Peking Univ, Speech & Hearing Res Ctr, Minist Educ, Key Lab Machine Percept, Beijing, Peoples R China 2.Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Beijing, Peoples R China 3.Peking Univ, Speech & Hearing Res Ctr, Minist Educ, Key Lab Machine Percept, Beijing, Peoples R China 4.Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Beijing, Peoples R China |
推荐引用方式 GB/T 7714 | Huang, Qingbo,Liu TJ,Wu XH,et al. A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression[J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY,2019,67(12):986-993. |
APA | Huang, Qingbo.,Liu TJ.,Wu XH.,Qu TS.,Huang, Qingbo.,...&Qu TS.(2019).A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression.JOURNAL OF THE AUDIO ENGINEERING SOCIETY,67(12),986-993. |
MLA | Huang, Qingbo,et al."A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression".JOURNAL OF THE AUDIO ENGINEERING SOCIETY 67.12(2019):986-993. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。