中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [15]
心理研究所 [7]
计算技术研究所 [5]
声学研究所 [4]
西安光学精密机械研究... [2]
深圳先进技术研究院 [1]
更多
采集方式
OAI收割 [36]
iSwitch采集 [1]
内容类型
期刊论文 [20]
学位论文 [12]
会议论文 [5]
发表日期
2024 [1]
2023 [2]
2022 [1]
2021 [4]
2019 [4]
2018 [2]
更多
学科主题
筛选
浏览/检索结果:
共37条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
提交时间升序
提交时间降序
发表日期升序
发表日期降序
题名升序
题名降序
作者升序
作者降序
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments
期刊论文
OAI收割
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:
Xu, Jiaming
;
Cui, Jian
;
Hao, Yunzhe
;
Xu, Bo
  |  
收藏
  |  
浏览/下载:2/0
  |  
提交时间:2024/02/22
Cocktail party problem
target speaker separation
multi-cue guided separation
semi-supervised learning
Visually Guided Sound Source Separation With Audio-Visual Predictive Coding
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15
作者:
Song, Zengjie
;
Zhang, Zhaoxiang
  |  
收藏
  |  
浏览/下载:6/0
  |  
提交时间:2023/11/17
Feature fusion
multimodal learning
predictive coding (PC)
self-supervised learning
sound source separation
A Trained Humanoid Robot can Perform Human-Like Crossmodal Social Attention and Conflict Resolution
期刊论文
OAI收割
International Journal of Social Robotics, 2023
作者:
Di Fu
;
Fares Abawi
;
Hugo Carneiro
;
Matthias Kerzel
;
Ziwei Chen
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2023/04/17
Crossmodal social attention
Eye gaze
Conflict processing
Saliency prediction model
iCub robot
视觉深度运动信息对声音运动方向感知的影响
学位论文
OAI收割
中国科学院心理研究所: 中国科学院大学, 2022
作者:
臧奋英
  |  
收藏
  |  
浏览/下载:30/0
  |  
提交时间:2022/09/09
深度信息
运动知觉
渐近偏差
多通道整合
Online Audio-Visual Speech Separation with Generative Adversarial Training
会议论文
OAI收割
线上会议, 2021-4-23
作者:
Zhang Peng
;
Xu Jiaming
;
Hao Yunzhe
;
Xu Bo
  |  
收藏
  |  
浏览/下载:53/0
  |  
提交时间:2021/06/21
audio-visual speech separation
online processing
generative adversarial training
causal temporal convolutional network
Changes in delta and theta oscillations in the brain indicate dynamic switching of attention between internal and external processing
会议论文
OAI收割
Xi'an, China, May 21-23, 2021
作者:
Yuying Jiang
;
Haoran Zhang
;
Shan Yu
  |  
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2021/06/16
Happy Emotion Recognition From Unconstrained Videos Using 3D Hybrid Deep Features
期刊论文
OAI收割
IEEE ACCESS, 2021, 卷号: 9, 页码: 35524-35538
作者:
Samadiani, Najmeh
;
Huang, Guangyan
;
Hu, Yu
;
Li, Xiaowei
  |  
收藏
  |  
浏览/下载:35/0
  |  
提交时间:2021/12/01
Feature extraction
Emotion recognition
Face recognition
Videos
Three-dimensional displays
Long short term memory
Visualization
Facial landmarks
facial expression recognition
long short term memory
multi-layer neural networks
happy emotion recognition
Bridging Text and Video: A Universal Multimodal Transformer for Audio-Visual Scene-Aware Dialog
期刊论文
OAI收割
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 2476-2483
作者:
Li, Zekang
;
Li, Zongjia
;
Zhang, Jinchao
;
Feng, Yang
;
Zhou, Jie
  |  
收藏
  |  
浏览/下载:46/0
  |  
提交时间:2021/12/01
Task analysis
Feature extraction
Visualization
Speech processing
History
Social networking (online)
Pattern recognition
Dialogue System
Multimodal
Natural Language Processing
Video Understanding
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video
期刊论文
OAI收割
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
作者:
Li, Haoran
;
Zhu, Junnan
;
Ma, Cong
;
Zhang, Jiajun
;
Zong, Chengqing
  |  
收藏
  |  
浏览/下载:83/0
  |  
提交时间:2019/07/12
Summarization
multimedia
multi-modal
cross-modal
natural language processing
computer vision
Modeling implicit learning in a cross-modal audio-visual serial reaction time task
期刊论文
OAI收割
COGNITIVE SYSTEMS RESEARCH, 2019, 卷号: 54, 页码: 154-164
作者:
Taesler, Philipp
;
Jablonowski, Julia
;
Fu, Qiufang
;
Rose, Michael
  |  
收藏
  |  
浏览/下载:64/0
  |  
提交时间:2019/01/08
Implicit learning
Cross-modal
Modeling
Serial reaction time task
Audio-visual