中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [32]
计算技术研究所 [4]
心理研究所 [2]
西安光学精密机械研究... [2]
金属研究所 [1]
深圳先进技术研究院 [1]
更多
采集方式
OAI收割 [44]
iSwitch采集 [1]
内容类型
期刊论文 [29]
会议论文 [15]
学位论文 [1]
发表日期
2023 [11]
2022 [2]
2021 [6]
2020 [6]
2019 [13]
2018 [3]
更多
学科主题
心理学 [1]
认知心理学 [1]
筛选
浏览/检索结果:
共45条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
提交时间升序
提交时间降序
发表日期升序
发表日期降序
题名升序
题名降序
作者升序
作者降序
Multi-modal spatial relational attention networks for visual question answering
期刊论文
OAI收割
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:
Yao, Haibo
  |  
收藏
  |  
浏览/下载:2/0
  |  
提交时间:2024/02/22
Visual question answering
Spatial relation
Attention mechanism
Pre -training strategy
VQAPT: A New visual question answering model for personality traits in social media images
期刊论文
OAI收割
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:
Biswas, Kunal
;
Shivakumara, Palaiahnakote
;
Pal, Umapada
;
Liu, Cheng-Lin
;
Lu, Yue
  |  
收藏
  |  
浏览/下载:3/0
  |  
提交时间:2024/02/22
Personality trait images
Multimodal concept
Text recognition
Social media images
Natural language processing
Visual question answering
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering
期刊论文
OAI收割
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:
Zheng, Wenbo
;
Yan, Lan
;
Wang, Fei-Yue
  |  
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/12/21
Graph attention
graph reasoning
multimodal graph
self-attention
text-based visual question answering
Medical visual question answering with symmetric interaction attention and cross-modal gating
期刊论文
OAI收割
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 卷号: 85, 页码: 10
作者:
Chen, Zhi
;
Zou, Beiji
;
Dai, Yulan
;
Zhu, Chengzhang
;
Kong, Guilan
  |  
收藏
  |  
浏览/下载:7/0
  |  
提交时间:2023/11/17
Medical visual question answering
Self-attention
Information interaction
Cross-modal gating
General Greedy De-Bias Learning
期刊论文
OAI收割
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9789-9805
作者:
Han, Xinzhe
;
Wang, Shuhui
;
Su, Chi
;
Huang, Qingming
;
Tian, Qi
  |  
收藏
  |  
浏览/下载:7/0
  |  
提交时间:2023/12/04
Task analysis
Correlation
Training
Data models
Question answering (information retrieval)
Visualization
Image classification
Curriculum learning
dataset biases
greedy strategy
robust learning
Hierarchical Attention Networks for Fact-based Visual Question Answering
期刊论文
OAI收割
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 页码: 18
作者:
Yao, Haibo
;
Luo, Yongkang
;
Zhang, Zhi
;
Yang, Jianhang
;
Cai, Chengtao
  |  
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/11/17
Fact-based Visual Question Answering
Hierarchical attention networks
Self-attention
Multiple attention interaction
Positional encoding
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering
期刊论文
OAI收割
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:
Song, Yaguang
;
Yang, Xiaoshan
;
Wang, Yaowei
;
Xu, Changsheng
  |  
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2023/06/12
Multi-modal Foundation Model
Out-of-Distribution Generalization
Visual Question Answering
Knowledge Distillation
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense
期刊论文
OAI收割
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 5, 页码: 5561-5578
作者:
Gao, Difei
;
Wang, Ruiping
;
Shan, Shiguang
;
Chen, Xilin
  |  
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2023/12/04
Visualization
Task analysis
Tail
Head
Annotations
Magnetic heads
Mouth
Visual question answering
compositional reasoning
commonsense reasoning
dataset construction
Visual Superordinate Abstraction for Robust Concept Learning
期刊论文
OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 79-91
作者:
Qi Zheng
  |  
收藏
  |  
浏览/下载:20/0
  |  
提交时间:2023/01/18
Concept learning
visual question answering
weakly-supervised learning
multi-modal learning
curriculum learning
Multimodal Pretraining from Monolingual to Multilingual
期刊论文
OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 220-232
作者:
Liang Zhang, Ludan Ruan, Anwen Hu, Qin Jin
  |  
收藏
  |  
浏览/下载:13/0
  |  
提交时间:2023/04/03
Multilingual pretraining
multimodal pretraining
cross-lingual transfer
multilingual generation
cross-modal retrieval