中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [12]
心理研究所 [3]
近代物理研究所 [2]
计算技术研究所 [1]
采集方式
OAI收割 [18]
内容类型
期刊论文 [14]
会议论文 [3]
学位论文 [1]
发表日期
2024 [4]
2023 [2]
2022 [3]
2021 [3]
2019 [1]
2018 [2]
更多
学科主题
筛选
浏览/检索结果:
共18条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Temporal-Semantic Aligning and Reasoning Transformer for Audio-Visual Zero-Shot Learning
期刊论文
OAI收割
MATHEMATICS, 2024, 卷号: 12, 期号: 14, 页码: 16
作者:
Zhang, Kaiwen
;
Zhao, Kunchen
;
Tian, Yunong
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2024/09/09
audio-visual zero-shot learning
transformer
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene
期刊论文
OAI收割
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:
You, Sisi
;
Zuo, Yukun
;
Yao, Hantao
;
Xu, Changsheng
  |  
收藏
  |  
浏览/下载:29/0
  |  
提交时间:2023/12/21
Cross-modal audio-visual fusion
incremental learning
person recognition
elastic weight consolidation
feature replay
Attribute-Guided Cross-Modal Interaction and Enhancement for Audio-Visual Matching
期刊论文
OAI收割
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 4986-4998
作者:
Wang, Jiaxiang
;
Zheng, Aihua
;
Yan, Yan
;
He, Ran
;
Tang, Jin
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2024/07/03
Audio-visual cross-modal matching
attribute-guided cross-modal interaction
attribute-guided cross-modal enhancement
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art
期刊论文
OAI收割
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:
Mengting Liu
;
Ying Zhou
;
Yuwei Wu
;
Feng Gao
  |  
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation
Emotion-Aware Music Driven Movie Montage
期刊论文
OAI收割
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 3, 页码: 540-553
作者:
Liu, Wu-Qin
;
Lin, Min-Xuan
;
Huang, Hai-Bin
;
Ma, Chong-Yang
;
Song, Yu
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2023/12/21
movie montage
emotion analysis
audio-visual modality
contrastive learning
Semantic and Relation Modulation for Audio-Visual Event Localization
期刊论文
OAI收割
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 7711-7725
作者:
Wang, Hao
;
Zha, Zheng-Jun
;
Li, Liang
;
Chen, Xuejin
;
Luo, Jiebo
  |  
收藏
  |  
浏览/下载:17/0
  |  
提交时间:2023/12/04
Visualization
Location awareness
Correlation
Proposals
Semantics
Task analysis
Modulation
Audio-visual learning
event localization
normalization
Integrative interaction of emotional speech in audio-visual modality
期刊论文
OAI收割
FRONTIERS IN NEUROSCIENCE, 2022, 卷号: 16, 页码: 13
作者:
Dong, Haibin
;
Li, Na
;
Fan, Lingzhong
;
Wei, Jianguo
;
Xu, Junhai
  |  
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2023/03/20
audio-visual integration
emotional speech
fMRI
left insula
weighted RSA
VAG: A Uniform Model for Cross-Modal Visual-Audio Mutual Generation
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 13
作者:
Hao, Wangli
;
Guan, He
;
Zhang, Zhaoxiang
  |  
收藏
  |  
浏览/下载:38/0
  |  
提交时间:2022/06/10
Task analysis
Instruments
Visualization
Image reconstruction
Generators
Decoding
Generative adversarial networks
Cross modality
cross-modal generation
mutual generation
visual and audio
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching
期刊论文
OAI收割
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 338-351
作者:
Zheng, Aihua
;
Hu, Menglan
;
Jiang, Bo
;
Huang, Yan
;
Yan, Yan
  |  
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2022/03/17
Visualization
Task analysis
Measurement
Speech recognition
Videos
Location awareness
Image recognition
Adversarial learning
audio-visual matching
cross-modal learning
metric learning
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training
会议论文
OAI收割
线上会议, 2021-7-18
作者:
Zhang Peng
;
Xu Jiaming
;
Shi Jing
;
Hao Yunzhe
;
Qin Lei
  |  
收藏
  |  
浏览/下载:34/0
  |  
提交时间:2021/06/21
audio-visual speech separation
robust
adversarial training method
time-domain approach