A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression
文献类型:期刊论文
| 作者 | Huang, Qingbo1,2; Liu TJ(刘铁军)2 ; Wu XH(吴玺宏)1; Qu TS(曲天书)1; Huang, Qingbo3,4; Liu TJ(刘铁军)4; Wu XH(吴玺宏)3; Qu TS(曲天书)3
|
| 刊名 | JOURNAL OF THE AUDIO ENGINEERING SOCIETY
![]() |
| 出版日期 | 2019 |
| 卷号 | 67期号:12页码:986-993 |
| ISSN号 | 1549-4950 |
| 产权排序 | 1 |
| 英文摘要 | The high frequency components of the audio signal are often truncated during the encoding processing by a lossy codec. To avoid the sound quality degradation, the high frequency components are reconstructed during the decoding processing. This paper presents a new bandwidth extension method for audio compression. Frequency components of 6.9 -13.8 kHz are added using side information at 2 kbps. A generative neural network in the GAN is used to estimate relationship between the MDCT spectrum in the high frequency part and the low frequency part, and it is evaluated by a discriminant network in the GAN to get a more natural result. On this basis, a codec system is built up. The MUSHRA experiments show that the proposed method is comparable with SBR in HE-AAC. |
| WOS关键词 | NARROW-BAND ; SPEECH |
| 资助项目 | State Key Laboratory of Robotics[2018-O09] ; National Natural Science Foundation of China[61175043] ; National Natural Science Foundation of China[61421062] ; High Performance Computing Platform of Peking University |
| WOS研究方向 | Acoustics ; Engineering |
| 语种 | 英语 |
| WOS记录号 | WOS:000505043700007 |
| 资助机构 | State Key Laboratory of Robotics [2018-O09] ; National Natural Science Foundation of ChinaNational Natural Science Foundation of China [61175043, 61421062] ; High Performance Computing Platform of Peking University |
| 源URL | [http://ir.sia.cn/handle/173321/26182] ![]() |
| 专题 | 沈阳自动化研究所_水下机器人研究室 |
| 通讯作者 | Qu TS(曲天书); Qu TS(曲天书) |
| 作者单位 | 1.Peking Univ, Speech & Hearing Res Ctr, Minist Educ, Key Lab Machine Percept, Beijing, Peoples R China 2.Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Beijing, Peoples R China 3.Peking Univ, Speech & Hearing Res Ctr, Minist Educ, Key Lab Machine Percept, Beijing, Peoples R China 4.Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Beijing, Peoples R China |
| 推荐引用方式 GB/T 7714 | Huang, Qingbo,Liu TJ,Wu XH,et al. A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression[J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY,2019,67(12):986-993. |
| APA | Huang, Qingbo.,Liu TJ.,Wu XH.,Qu TS.,Huang, Qingbo.,...&Qu TS.(2019).A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression.JOURNAL OF THE AUDIO ENGINEERING SOCIETY,67(12),986-993. |
| MLA | Huang, Qingbo,et al."A Generative Adversarial Net-Based Bandwidth Extension Method for Audio Compression".JOURNAL OF THE AUDIO ENGINEERING SOCIETY 67.12(2019):986-993. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


