Integrating data and analysis technologies within leading environmental research infrastructures: Challenges and approaches
文献类型:期刊论文
作者 | Huber, Robert2; D'Onofrio, Claudio3; Devaraju, Anusuriya4; Klump, Jens1; Loescher, Henry W.5; Kindermann, Stephan6; Guru, Siddeswara4; Grant, Mark4; Morris, Beryl4; Wyborn, Lesley7 |
刊名 | ECOLOGICAL INFORMATICS |
出版日期 | 2021-03-01 |
卷号 | 61页码:11 |
ISSN号 | 1574-9541 |
关键词 | Scientific data analysis Research infrastructures Data service providers Data analysis environments |
DOI | 10.1016/j.ecoinf.2021.101245 |
通讯作者 | Huber, Robert(rhuber@uni-bremen.de) |
英文摘要 | When researchers analyze data, it typically requires significant effort in data preparation to make the data analysis ready. This often involves cleaning, pre-processing, harmonizing, or integrating data from one or multiple sources and placing them into a computational environment in a form suitable for analysis. Research infrastructures and their data repositories host data and make them available to researchers, but rarely offer a computational environment for data analysis. Published data are often persistently identified, but such identifiers resolve onto landing pages that must be (manually) navigated to identify how data are accessed. This navigation is typically challenging or impossible for machines. This paper surveys existing approaches for improving environmental data access to facilitate more rapid data analyses in computational environments, and thus contribute to a more seamless integration of data and analysis. By analysing current state-of-the-art approaches and solutions being implemented by world?leading environmental research infrastructures, we highlight the existing practices to interface data repositories with computational environments and the challenges moving forward. We found that while the level of standardization has improved during recent years, it still is challenging for machines to discover and access data based on persistent identifiers. This is problematic in regard to the emerging requirements for FAIR (Findable, Accessible, Interoperable, and Reusable) data, in general, and problematic for seamless integration of data and analysis, in particular. There are a number of promising approaches that would improve the state-of-the-art. A key approach presented here involves software libraries that streamline reading data and metadata into computational environments. We describe this approach in detail for two research infrastructures. We argue that the development and maintenance of specialized libraries for each RI and a range of programming languages used in data analysis does not scale well. |
资助项目 | European Union's Horizon 2020 research and innovation program[824068] ; European Union's Horizon 2020 research and innovation program[831558] ; National Science Foundation (NSF) ; National Collaborative Research Infrastructure Strategy (NCRIS), an Australian Government Initiative ; [EF-1029808] |
WOS研究方向 | Environmental Sciences & Ecology |
语种 | 英语 |
出版者 | ELSEVIER |
WOS记录号 | WOS:000632605900011 |
资助机构 | European Union's Horizon 2020 research and innovation program ; National Science Foundation (NSF) ; National Collaborative Research Infrastructure Strategy (NCRIS), an Australian Government Initiative |
源URL | [http://ir.igsnrr.ac.cn/handle/311030/161982] |
专题 | 中国科学院地理科学与资源研究所 |
通讯作者 | Huber, Robert |
作者单位 | 1.SIRO, 26 Dick Perry Ave, Kensington, WA, Australia 2.Univ Bremen, MARUM Ctr Marine Environm Sci, Leobener Str 8,POB 330440, D-28359 Bremen, Germany 3.Lund Univ, Dept Phys Geog & Ecosyst Sci, ICOS Carbon Portal, Solvegatan12, SE-22362 Lund, Sweden 4.Univ Queensland, TERN Australia, Brisbane, Qld, Australia 5.Natl Ecol Observ Network NEON, Battelle, Boulder, CO USA 6.DKRZ Deutsch Klimarechenzentrum GmbH, Hamburg, Germany 7.Australian Natl Univ, Natl Computat Infrastruct NCI, Canberra, ACT, Australia 8.Environm Agcy Austria, Spittelauer Lande 5, A-1090 Vienna, Austria 9.Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Key Lab Ecosyst Network Observat & Modeling, Beijing 100101, Peoples R China 10.TIB Leibniz Informat Ctr Sci & Technol, Hannover, Germany |
推荐引用方式 GB/T 7714 | Huber, Robert,D'Onofrio, Claudio,Devaraju, Anusuriya,et al. Integrating data and analysis technologies within leading environmental research infrastructures: Challenges and approaches[J]. ECOLOGICAL INFORMATICS,2021,61:11. |
APA | Huber, Robert.,D'Onofrio, Claudio.,Devaraju, Anusuriya.,Klump, Jens.,Loescher, Henry W..,...&Stocker, Markus.(2021).Integrating data and analysis technologies within leading environmental research infrastructures: Challenges and approaches.ECOLOGICAL INFORMATICS,61,11. |
MLA | Huber, Robert,et al."Integrating data and analysis technologies within leading environmental research infrastructures: Challenges and approaches".ECOLOGICAL INFORMATICS 61(2021):11. |
入库方式: OAI收割
来源:地理科学与资源研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。