中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [15]
声学研究所 [12]
深圳先进技术研究院 [2]
长春光学精密机械与物... [2]
沈阳自动化研究所 [2]
计算技术研究所 [1]
更多
采集方式
OAI收割 [36]
内容类型
学位论文 [18]
期刊论文 [13]
会议论文 [5]
发表日期
2023 [1]
2022 [1]
2021 [3]
2020 [3]
2019 [1]
2018 [1]
更多
学科主题
计算机科学技术::人... [1]
筛选
浏览/检索结果:
共36条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
提交时间升序
提交时间降序
发表日期升序
发表日期降序
题名升序
题名降序
作者升序
作者降序
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition
期刊论文
OAI收割
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:
Fan, Cunhang
;
Ding, Mingming
;
Yi, Jiangyan
;
Li, Jinpeng
;
Lv, Zhao
  |  
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2023/11/16
Robust end-to-end ASR
Speech enhancement
Masking and mapping
Speech distortion
Deep spectrum fusion
SpecMNet: Spectrum mend network for monaural speech enhancement
期刊论文
OAI收割
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
作者:
Fan, Cunhang
;
Zhang, Hongmei
;
Yi, Jiangyan
;
Lv, Zhao
;
Tao, Jianhua
  |  
收藏
  |  
浏览/下载:33/0
  |  
提交时间:2022/07/25
Monaural speech enhancement
Speech distortion
Spectrum mend network
SI-SNR
BLSTM
面向鸡尾酒会问题的视觉辅助语音分离算法研究
学位论文
OAI收割
中国科学院自动化研究所: 中国科学院自动化研究所, 2021
作者:
张鹏
  |  
收藏
  |  
浏览/下载:32/0
  |  
提交时间:2021/06/21
鸡尾酒会问题
语音分离
视觉辅助
在线流式处理
生成对抗训练
Exploiting the directional coherence function for multichannel source extraction
期刊论文
OAI收割
SPEECH COMMUNICATION, 2021, 卷号: 128, 页码: 1-14
作者:
Liang, Shan
;
Li, Guanjun
;
Nie, Shuai
;
Yang, ZhanLei
;
Liu, WenJu
  |  
收藏
  |  
浏览/下载:27/0
  |  
提交时间:2021/05/06
Directional coherence function
Coherent-to-Diffuse Ratio
General sidelobe canceller
Desired Speech Presence Probability
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition
期刊论文
OAI收割
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:
Fan, Cunhang
;
Yi, Jiangyan
;
Tao, Jianhua
;
Tian, Zhengkun
;
Liu, Bin
  |  
收藏
  |  
浏览/下载:35/0
  |  
提交时间:2021/03/08
Speech enhancement
Speech recognition
Training
Noise measurement
Logic gates
Acoustic distortion
Task analysis
Gated recurrent fusion
robust end-to-end speech recognition
speech distortion
speech enhancement
speech transformer
Improving speech enhancement by focusing on smaller values using relative loss
期刊论文
OAI收割
IET SIGNAL PROCESSING, 2020, 卷号: 14, 期号: 6, 页码: 374-384
作者:
Li, Hongfeng
;
Xu, Yanyan
;
Ke, Dengfeng
;
Su, Kaile
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2020/09/07
speech enhancement
speech intelligibility
performance evaluation
learning (artificial intelligence)
neural nets
absolute differences
speech quality
relative loss
single-channel speech enhancement
noisy speech
ideal ratio mask
phase-sensitive mask
mean square error
loss function
absolute error values
magnitude spectra
deep learning
clean speech recovery
short-time objective intelligibility
signal-to-distortion ratio
segmental signal-to-noise ratio
performance evaluation
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features
期刊论文
OAI收割
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 页码: 1303-1314
作者:
Fan, Cunhang
;
Tao, Jianhua
;
Liu, Bin
;
Yi, Jiangyan
;
Wen, Zhengqi
  |  
收藏
  |  
浏览/下载:51/0
  |  
提交时间:2020/06/22
Feature extraction
Training
Interference
Speech enhancement
Clustering algorithms
Spectrogram
Speech separation
end-to-end post-filter
deep attention fusion features
deep clustering
permutation invariant training
Modeling of Individual HRTFs Based on Spatial Principal Component Analysis
期刊论文
OAI收割
IEEE/ACM Transactions on Audio Speech and Language Processing, 2020, 卷号: 28, 页码: 785-797
作者:
Zhang, Mengfan
;
Ge, Zhongshu
  |  
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2020/03/01
Anthropometric parameters
HRTF
individual
SPCA
Replay attack detection based on distortion by loudspeaker for voice authentication
期刊论文
OAI收割
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 卷号: 78, 期号: 7, 页码: 8383-8396
作者:
Ren, Yanzhen
;
Fang, Zhong
;
Liu, Dengkai
;
Chen, Changwen
  |  
收藏
  |  
浏览/下载:40/0
  |  
提交时间:2019/08/16
Automatic Speaker Verification (ASV)
Replay Attack Detection (RAD)
Loudspeaker
Low-frequency attenuation
Spoofing attack
Cbldnn-based Speaker-independent Speech Separation Via Generative Adversarial Training
会议论文
OAI收割
Calgary, 2020-4
作者:
Li, Chenxing
;
Zhu, Lei
;
Xu, Shuang
;
Gao, Peng
;
Xu, Bo
  |  
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2020/07/21