中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
内容类型
发表日期
学科主题
筛选

浏览/检索结果: 共71条,第1-10条 帮助

条数/页: 排序方式:
Scene text recognition via dual character counting-aware visual and semantic modeling network 期刊论文  OAI收割
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 卷号: 67, 期号: 3, 页码: 2
作者:  
Xiao, Ke;  Zhu, Anna;  Iwana, Brian Kenji;  Liu, Cheng-Lin
  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/03/13
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文  OAI收割
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  
Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan
  |  收藏  |  浏览/下载:5/0  |  提交时间:2024/02/22
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文  OAI收割
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  
Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan
  |  收藏  |  浏览/下载:5/0  |  提交时间:2024/02/23
Multi-modal spatial relational attention networks for visual question answering 期刊论文  OAI收割
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  
Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi
  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/02/22
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文  OAI收割
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  
Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
  |  收藏  |  浏览/下载:5/0  |  提交时间:2023/12/21
Knowledge-Embedded Mutual Guidance for Visual Reasoning 期刊论文  OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 页码: 13
作者:  
Zheng, Wenbo;  Yan, Lan;  Chen, Long;  Li, Qiang;  Wang, Fei-Yue
  |  收藏  |  浏览/下载:8/0  |  提交时间:2023/11/16
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文  OAI收割
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  
Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming
  |  收藏  |  浏览/下载:8/0  |  提交时间:2023/11/16
VLP: A Survey on Vision-language Pre-training 期刊论文  OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  
Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi
  |  收藏  |  浏览/下载:6/0  |  提交时间:2023/06/21
Masked Vision-language Transformer in Fashion 期刊论文  OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  
Ge-Peng Ji
  |  收藏  |  浏览/下载:6/0  |  提交时间:2023/05/29
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文  OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  
Xiao Wang
  |  收藏  |  浏览/下载:4/0  |  提交时间:2023/08/02