中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [12]
计算技术研究所 [5]
地理科学与资源研究所 [3]
采集方式
OAI收割 [20]
内容类型
期刊论文 [19]
会议论文 [1]
发表日期
4565 [1]
2026 [2]
2024 [7]
2023 [5]
2020 [1]
2019 [2]
更多
学科主题
筛选
浏览/检索结果:
共20条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
发表日期升序
发表日期降序
题名升序
题名降序
提交时间升序
提交时间降序
AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing
期刊论文
OAI收割
2025 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 4565, 卷号: N/A, 页码: 6007
作者:
Ji, Huawei
;
Deng, Cheng
;
Xue, Bo
;
Jin, Zhouyang
;
Ding, Jiaxin
  |  
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2026/01/21
Academic literature parsing
Benchmark dataset
Vision-language model
Data-centric
DGL-RSIS: Decoupling global spatial context and local class semantics for training-free remote sensing image segmentation
期刊论文
OAI收割
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2026, 卷号: 146, 页码: 105113
作者:
Li, Boyi
;
Zhang, Ce
;
Timmerman, Richard M.
;
Bao, Wenxuan
  |  
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2026/03/16
Vision language model
Open-vocabulary semantic segmentation
Referring expression segmentation
Domain knowledge
Training-free
Exploratory UAV Autonomous Visual Navigation With Spatiotemporal Cognition
期刊论文
OAI收割
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2026, 卷号: 19, 页码: 4119-4132
作者:
Jia, Huitong
;
He, Hongyuan
;
E, Chao
;
Zhang, Bing
  |  
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2026/03/16
Autonomous aerial vehicles
Visualization
Cognition
Global Positioning System
Path planning
Satellites
Natural languages
Autonomous robots
Simultaneous localization and mapping
Three-dimensional displays
Exploratory
spatiotemporal cognition
unmanned aerial vehicle (UAV) vision-and-language navigation (VLN)
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
期刊论文
OAI收割
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:
Yu, Ting
;
Lin, Xiaojun
;
Wang, Shuhui
;
Sheng, Weiguo
;
Huang, Qingming
  |  
收藏
  |  
浏览/下载:46/0
  |  
提交时间:2024/05/20
Three-dimensional displays
Task analysis
Visualization
Point cloud compression
Grounding
Surveys
Solid modeling
3D dense captioning
vision-language bridging
visual captioning
3D point cloud
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis
期刊论文
OAI收割
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:
Yi, Guofeng
;
Fan, Cunhang
;
Zhu, Kang
;
Lv, Zhao
;
Liang, Shan
  |  
收藏
  |  
浏览/下载:65/0
  |  
提交时间:2024/02/22
Multimodal sentiment analysis
Vision-language
Multimodal fusion
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding
期刊论文
OAI收割
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:
Xiao, Linhui
;
Yang, Xiaoshan
;
Peng, Fang
;
Yan, Ming
;
Wang, Yaowei
  |  
收藏
  |  
浏览/下载:32/0
  |  
提交时间:2024/05/30
Grounding
Reliability
Adaptation models
Task analysis
Visualization
Data models
Annotations
Visual grounding
curriculum learning
pseudo-language label
and vision-language models
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
期刊论文
OAI收割
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-16
作者:
Dong An
;
Hanqing Wang
;
Wenguan Wang
;
Zun Wang
;
Yan Huang
  |  
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2024/05/27
Vision-Language Navigation
Topological Map
Obstacle Avoidance
Memory-Adaptive Vision-and-Language Navigation
期刊论文
OAI收割
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:
Keji He
;
Ya Jing
;
Yan Huang
;
Zhihe Lu
;
Dong An
  |  
收藏
  |  
浏览/下载:39/0
  |  
提交时间:2024/06/26
Vision-and-Language Navigation
Memory bank
History noises
Memory-Adaptive Model
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification
期刊论文
OAI收割
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:
Peng, Fang
;
Yang, Xiaoshan
;
Xiao, Linhui
;
Wang, Yaowei
;
Xu, Changsheng
  |  
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2024/07/03
Few-shot
image classification
vision-language models
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
期刊论文
OAI收割
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:
Wang, Wenxuan
;
He, Xingjian
;
Zhang, Yisi
;
Guo, Longteng
;
Shen, Jiachen
  |  
收藏
  |  
浏览/下载:50/0
  |  
提交时间:2024/07/03
Referring image segmentation
cross-modality guidance
masked self-distillation
vision and language