中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Integrating data and analysis technologies within leading environmental research infrastructures: Challenges and approaches

文献类型:期刊论文

作者Huber, Robert2; D'Onofrio, Claudio3; Devaraju, Anusuriya4; Klump, Jens1; Loescher, Henry W.5; Kindermann, Stephan6; Guru, Siddeswara4; Grant, Mark4; Morris, Beryl4; Wyborn, Lesley7
刊名ECOLOGICAL INFORMATICS
出版日期2021-03-01
卷号61页码:11
ISSN号1574-9541
关键词Scientific data analysis Research infrastructures Data service providers Data analysis environments
DOI10.1016/j.ecoinf.2021.101245
通讯作者Huber, Robert(rhuber@uni-bremen.de)
英文摘要When researchers analyze data, it typically requires significant effort in data preparation to make the data analysis ready. This often involves cleaning, pre-processing, harmonizing, or integrating data from one or multiple sources and placing them into a computational environment in a form suitable for analysis. Research infrastructures and their data repositories host data and make them available to researchers, but rarely offer a computational environment for data analysis. Published data are often persistently identified, but such identifiers resolve onto landing pages that must be (manually) navigated to identify how data are accessed. This navigation is typically challenging or impossible for machines. This paper surveys existing approaches for improving environmental data access to facilitate more rapid data analyses in computational environments, and thus contribute to a more seamless integration of data and analysis. By analysing current state-of-the-art approaches and solutions being implemented by world?leading environmental research infrastructures, we highlight the existing practices to interface data repositories with computational environments and the challenges moving forward. We found that while the level of standardization has improved during recent years, it still is challenging for machines to discover and access data based on persistent identifiers. This is problematic in regard to the emerging requirements for FAIR (Findable, Accessible, Interoperable, and Reusable) data, in general, and problematic for seamless integration of data and analysis, in particular. There are a number of promising approaches that would improve the state-of-the-art. A key approach presented here involves software libraries that streamline reading data and metadata into computational environments. We describe this approach in detail for two research infrastructures. We argue that the development and maintenance of specialized libraries for each RI and a range of programming languages used in data analysis does not scale well.
资助项目European Union's Horizon 2020 research and innovation program[824068] ; European Union's Horizon 2020 research and innovation program[831558] ; National Science Foundation (NSF) ; National Collaborative Research Infrastructure Strategy (NCRIS), an Australian Government Initiative ; [EF-1029808]
WOS研究方向Environmental Sciences & Ecology
语种英语
出版者ELSEVIER
WOS记录号WOS:000632605900011
资助机构European Union's Horizon 2020 research and innovation program ; National Science Foundation (NSF) ; National Collaborative Research Infrastructure Strategy (NCRIS), an Australian Government Initiative
源URL[http://ir.igsnrr.ac.cn/handle/311030/161982]  
专题中国科学院地理科学与资源研究所
通讯作者Huber, Robert
作者单位1.SIRO, 26 Dick Perry Ave, Kensington, WA, Australia
2.Univ Bremen, MARUM Ctr Marine Environm Sci, Leobener Str 8,POB 330440, D-28359 Bremen, Germany
3.Lund Univ, Dept Phys Geog & Ecosyst Sci, ICOS Carbon Portal, Solvegatan12, SE-22362 Lund, Sweden
4.Univ Queensland, TERN Australia, Brisbane, Qld, Australia
5.Natl Ecol Observ Network NEON, Battelle, Boulder, CO USA
6.DKRZ Deutsch Klimarechenzentrum GmbH, Hamburg, Germany
7.Australian Natl Univ, Natl Computat Infrastruct NCI, Canberra, ACT, Australia
8.Environm Agcy Austria, Spittelauer Lande 5, A-1090 Vienna, Austria
9.Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Key Lab Ecosyst Network Observat & Modeling, Beijing 100101, Peoples R China
10.TIB Leibniz Informat Ctr Sci & Technol, Hannover, Germany
推荐引用方式
GB/T 7714
Huber, Robert,D'Onofrio, Claudio,Devaraju, Anusuriya,et al. Integrating data and analysis technologies within leading environmental research infrastructures: Challenges and approaches[J]. ECOLOGICAL INFORMATICS,2021,61:11.
APA Huber, Robert.,D'Onofrio, Claudio.,Devaraju, Anusuriya.,Klump, Jens.,Loescher, Henry W..,...&Stocker, Markus.(2021).Integrating data and analysis technologies within leading environmental research infrastructures: Challenges and approaches.ECOLOGICAL INFORMATICS,61,11.
MLA Huber, Robert,et al."Integrating data and analysis technologies within leading environmental research infrastructures: Challenges and approaches".ECOLOGICAL INFORMATICS 61(2021):11.

入库方式: OAI收割

来源:地理科学与资源研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。