中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [12]
计算技术研究所 [2]
西安光学精密机械研究... [2]
采集方式
OAI收割 [16]
内容类型
期刊论文 [16]
发表日期
2024 [1]
2023 [8]
2021 [2]
2020 [2]
2019 [2]
学科主题
筛选
浏览/检索结果:
共16条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Unbiased Visual Question Answering by Leveraging Instrumental Variable
期刊论文
OAI收割
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6648-6662
作者:
Pan, Yonghua
;
Liu, Jing
;
Jin, Lu
;
Li, Zechao
  |  
收藏
  |  
浏览/下载:17/0
  |  
提交时间:2024/07/22
Visualization
Correlation
Instruments
Training
Predictive models
Color
Generators
Visual question answering
instrumental variable
causal inference
out of distribution
Multi-modal spatial relational attention networks for visual question answering
期刊论文
OAI收割
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:
Yao, Haibo
;
Wang, Lipeng
;
Cai, Chengtao
;
Sun, Yuxin
;
Zhang, Zhi
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2024/02/22
Visual question answering
Spatial relation
Attention mechanism
Pre -training strategy
VQAPT: A New visual question answering model for personality traits in social media images
期刊论文
OAI收割
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:
Biswas, Kunal
;
Shivakumara, Palaiahnakote
;
Pal, Umapada
;
Liu, Cheng-Lin
;
Lu, Yue
  |  
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2024/02/22
Personality trait images
Multimodal concept
Text recognition
Social media images
Natural language processing
Visual question answering
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering
期刊论文
OAI收割
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:
Zheng, Wenbo
;
Yan, Lan
;
Wang, Fei-Yue
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2023/12/21
Graph attention
graph reasoning
multimodal graph
self-attention
text-based visual question answering
Medical visual question answering with symmetric interaction attention and cross-modal gating
期刊论文
OAI收割
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 卷号: 85, 页码: 10
作者:
Chen, Zhi
;
Zou, Beiji
;
Dai, Yulan
;
Zhu, Chengzhang
;
Kong, Guilan
  |  
收藏
  |  
浏览/下载:24/0
  |  
提交时间:2023/11/17
Medical visual question answering
Self-attention
Information interaction
Cross-modal gating
Hierarchical Attention Networks for Fact-based Visual Question Answering
期刊论文
OAI收割
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 页码: 18
作者:
Yao, Haibo
;
Luo, Yongkang
;
Zhang, Zhi
;
Yang, Jianhang
;
Cai, Chengtao
  |  
收藏
  |  
浏览/下载:7/0
  |  
提交时间:2023/11/17
Fact-based Visual Question Answering
Hierarchical attention networks
Self-attention
Multiple attention interaction
Positional encoding
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering
期刊论文
OAI收割
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:
Song, Yaguang
;
Yang, Xiaoshan
;
Wang, Yaowei
;
Xu, Changsheng
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2023/06/12
Multi-modal Foundation Model
Out-of-Distribution Generalization
Visual Question Answering
Knowledge Distillation
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense
期刊论文
OAI收割
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 5, 页码: 5561-5578
作者:
Gao, Difei
;
Wang, Ruiping
;
Shan, Shiguang
;
Chen, Xilin
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2023/12/04
Visualization
Task analysis
Tail
Head
Annotations
Magnetic heads
Mouth
Visual question answering
compositional reasoning
commonsense reasoning
dataset construction
Visual Superordinate Abstraction for Robust Concept Learning
期刊论文
OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 79-91
作者:
Qi Zheng
;
Chao-Yue Wang
;
Dadong Wang
;
a-Cheng Tao
  |  
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2024/04/23
Concept learning
visual question answering
weakly-supervised learning
multi-modal learning
curriculum learning
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 2, 页码: 913-926
作者:
Li, Xuelong
;
Yuan, Aihong
;
Lu, Xiaoqiang
  |  
收藏
  |  
浏览/下载:44/0
  |  
提交时间:2021/02/22
Deep learning
image captioning
multimodal
visual question answering (VQA)