中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature

文献类型:期刊论文

作者Guo, Zhixin3; Wang, Chaoyang2; Zhou, Jianping3; Zheng, Guanjie3; Wang, Xinbing3; Zhou, Chenghu1,3
刊名REMOTE SENSING
出版日期2024-05-01
卷号16期号:9页码:19
关键词multimodal data compilation data extraction data fusion scientific database
DOI10.3390/rs16091484
英文摘要With the advent of big data science, the field of geoscience has undergone a paradigm shift toward data-driven scientific discovery. However, the abundance of geoscience data distributed across multiple sources poses significant challenges to researchers in terms of data compilation, which includes data collection, collation, and database construction. To streamline the data compilation process, we present GeoKnowledgeFusion, a publicly accessible platform for the fusion of text, visual, and tabular knowledge extracted from the geoscience literature. GeoKnowledgeFusion leverages a powerful network of models that provide a joint multimodal understanding of text, image, and tabular data, enabling researchers to efficiently curate and continuously update their databases. To demonstrate the practical applications of GeoKnowledgeFusion, we present two scenarios: the compilation of Sm-Nd isotope data for constructing a domain-specific database and geographic analysis, and the data extraction process for debris flow disasters. The data compilation process for these use cases encompasses various tasks, including PDF pre-processing, target element recognition, human-in-the-loop annotation, and joint multimodal knowledge understanding. The findings consistently reveal patterns that align with manually compiled data, thus affirming the credibility and dependability of our automated data processing tool. To date, GeoKnowledgeFusion has supported forty geoscience research teams within the program by processing over 40,000 documents uploaded by geoscientists.
WOS关键词NAMED ENTITY RECOGNITION
资助项目NSF China
WOS研究方向Environmental Sciences & Ecology ; Geology ; Remote Sensing ; Imaging Science & Photographic Technology
语种英语
WOS记录号WOS:001219877600001
出版者MDPI
资助机构NSF China
源URL[http://ir.igsnrr.ac.cn/handle/311030/205791]  
专题资源与环境信息系统国家重点实验室_外文论文
通讯作者Zheng, Guanjie
作者单位1.Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
2.Chinese Acad Geol Sci, Inst Geol, Beijing 100037, Peoples R China
3.Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
推荐引用方式
GB/T 7714
Guo, Zhixin,Wang, Chaoyang,Zhou, Jianping,et al. GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature[J]. REMOTE SENSING,2024,16(9):19.
APA Guo, Zhixin,Wang, Chaoyang,Zhou, Jianping,Zheng, Guanjie,Wang, Xinbing,&Zhou, Chenghu.(2024).GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature.REMOTE SENSING,16(9),19.
MLA Guo, Zhixin,et al."GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature".REMOTE SENSING 16.9(2024):19.

入库方式: OAI收割

来源:地理科学与资源研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。