GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature
文献类型:期刊论文
作者 | Guo, Zhixin3; Wang, Chaoyang2; Zhou, Jianping3; Zheng, Guanjie3; Wang, Xinbing3; Zhou, Chenghu1,3 |
刊名 | REMOTE SENSING
![]() |
出版日期 | 2024-05-01 |
卷号 | 16期号:9页码:19 |
关键词 | multimodal data compilation data extraction data fusion scientific database |
DOI | 10.3390/rs16091484 |
英文摘要 | With the advent of big data science, the field of geoscience has undergone a paradigm shift toward data-driven scientific discovery. However, the abundance of geoscience data distributed across multiple sources poses significant challenges to researchers in terms of data compilation, which includes data collection, collation, and database construction. To streamline the data compilation process, we present GeoKnowledgeFusion, a publicly accessible platform for the fusion of text, visual, and tabular knowledge extracted from the geoscience literature. GeoKnowledgeFusion leverages a powerful network of models that provide a joint multimodal understanding of text, image, and tabular data, enabling researchers to efficiently curate and continuously update their databases. To demonstrate the practical applications of GeoKnowledgeFusion, we present two scenarios: the compilation of Sm-Nd isotope data for constructing a domain-specific database and geographic analysis, and the data extraction process for debris flow disasters. The data compilation process for these use cases encompasses various tasks, including PDF pre-processing, target element recognition, human-in-the-loop annotation, and joint multimodal knowledge understanding. The findings consistently reveal patterns that align with manually compiled data, thus affirming the credibility and dependability of our automated data processing tool. To date, GeoKnowledgeFusion has supported forty geoscience research teams within the program by processing over 40,000 documents uploaded by geoscientists. |
WOS关键词 | NAMED ENTITY RECOGNITION |
资助项目 | NSF China |
WOS研究方向 | Environmental Sciences & Ecology ; Geology ; Remote Sensing ; Imaging Science & Photographic Technology |
语种 | 英语 |
WOS记录号 | WOS:001219877600001 |
出版者 | MDPI |
资助机构 | NSF China |
源URL | [http://ir.igsnrr.ac.cn/handle/311030/205791] ![]() |
专题 | 资源与环境信息系统国家重点实验室_外文论文 |
通讯作者 | Zheng, Guanjie |
作者单位 | 1.Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China 2.Chinese Acad Geol Sci, Inst Geol, Beijing 100037, Peoples R China 3.Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China |
推荐引用方式 GB/T 7714 | Guo, Zhixin,Wang, Chaoyang,Zhou, Jianping,et al. GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature[J]. REMOTE SENSING,2024,16(9):19. |
APA | Guo, Zhixin,Wang, Chaoyang,Zhou, Jianping,Zheng, Guanjie,Wang, Xinbing,&Zhou, Chenghu.(2024).GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature.REMOTE SENSING,16(9),19. |
MLA | Guo, Zhixin,et al."GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature".REMOTE SENSING 16.9(2024):19. |
入库方式: OAI收割
来源:地理科学与资源研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。