Browsing by Author "Profeta, Lucia"
Now showing 1 - 18 of 18
- Results Per Page
- Sort Options
- Some of the metrics are blocked by yourconsent settingsAligning, combining and extending vocabularies for the AstroMat, DIGIS and EarthChem geo- and cosmochemical data systems(ARDC, 2023-11)
; ;Profeta, Lucia ;Richard, Stephen ;Johansson, Annika; ;Lehnert, Kerstin ;Luzi-Helbing, Manja ;Sarbas, Bärbel ;Sweets, HannahSynthesis databases, i.e. compilations of data published over decades by different sources, provide the foundation for many data analytics and machine learning techniques in modern geochemical data science. The GEOROC and PetDB databases are leading, open-access sources of geochemical and isotopic datasets of terrestrial igneous and metamorphic rocks and minerals. The primary purpose of these geochemistry databases is to support and facilitate new research projects using previously published data. Established simultaneously 24 years ago, they currently provide access to curated compilations of rock and mineral compositions from thousands of publications, ranging from the late 19th century to today and totalling >40 million single data values.Ensuring consistent data quality across such a broad range of data sources and analytical techniques remains a challenge. The DIGIS initiative for GEOROC 2.0 and EarthChem for PetDB are now combining their vocabularies, following a modular approach to develop shared terminologies that will enable the data systems to converge towards interoperability.The combined system of vocabularies describes samples and analytical procedures linked to geochemical analyses, including, among others, a comprehensive list of mineral names or a description and categorisation of analytical methods and instruments. These vocabularies are aligned with external vocabularies by recognised authorities wherever possible (e.g. the International Mineralogical Association's 'List of Minerals', IUPAC chemical terminologies, DRUM). For remaining terms, local profiles are published in Research Vocabularies Australia, subsetting terms from multiple recognised international vocabularies.In aligning their vocabularies, DIGIS and EarthChem closely collaborate with the Astromaterials Data System that publishes and compiles analytical data on astromaterials samples. The 'Analytical methods for geochemistry and cosmochemistry' vocabulary 1 was developed jointly with the Geo.X network of laboratories and instruments. The long-term aim of this collaboration is easier data integration between the data systems - enabling more ambitious data science applications in geochemistry research. References 1. Analytical methods for geochemistry and cosmochemistry, 2023. Research VocabulariesAustralia, Version 1.2, https://vocabs.ardc.edu.au/viewById/650. - Some of the metrics are blocked by yourconsent settingsChallenging the Status Quo: What Databases/Data Compilations are You Using for Your Data-Driven Geochemical Research? Will they still be accessible in 2030? Are your data compilations ethical?(2022-12-14)
;Wyborn, Lesley A.; ;Lehnert, Kerstin ;Prent, Alexander ;Farrington, Rebecca ;Klump, Jens F. ;Elger, Kirsten ;Hezel, Dominik ;ter Maat, GeertjeProfeta, LuciaThe status quo of data-driven geochemical research is dominated by multiple, highly specialised and unsustainable data compilations to feed specific research questions: the future requires persistent FAIR geochemical data repositories that make data both human and machine-readable. New, improved and automated instrumentation in the field of geochemistry mean that geochemical and isotopic data are being generated at faster rates and in greater volumes than have hitherto been possible. Data-driven analysis of existing geochemical datasets is growing as a research endeavour as new tools and new capabilities such as artificial intelligence (AI) and Machine Learning (ML) increase in popularity. Availability of cloud resources enable researchers to work with ever larger datasets. Combined, these factors mean that large datasets suitable for these analyses are growing in availability. As it is near impossible for a single researcher to collect, analyse and then create a single dataset, researchers increasingly rely on open-access curated global databases (eg, EarthChem, GEOROC). Additional national databases are emerging to capture all analytical data produced in laboratories (eg, EPOS MSL, AusGeochem). Today it is commonplace for project-based databases to be created by combining multiple datasets from the literature or downloading data from online databases. These compilation datasets rarely cite the individual researcher who created the dataset. Further, these compilations rarely have sustainable funding and are only maintained for as long as the creator has the capability. This leads to duplication of datasets with multiple conflicting versions that are rarely curated and kept up-to-date. As well as threatening research validity, this also raises issues of ethics: in a research paper, the ideas of others are always referenced and credited. In these data compilations, even if permissive licensing allow data to be copied, the best scientific research method would require citation of and credit be given to all sources of any analysis. Increasingly funders are asking for evidence of return of investment on where they have funded sample collection, analytical equipment, laboratories, etc. But if a researcher has no idea of where their analysis is used in data compilations how can they report to their funders? - Some of the metrics are blocked by yourconsent settingsCommunity recommendations for geochemical data, services and analytical capabilities in the 21st century(2023-04)
; ;Wyborn, Lesley ;Lehnert, Kerstin ;Ware, Bryant ;Prent, Alexander ;Profeta, Lucia ;Kohlmann, Fabian ;Noble, Wayne ;Bruno, Ian ;Lambart, Sarah ;Ananuer, Halimulati ;Barber, Nicholas ;Becker, Harry ;Brodbeck, Maurice ;Deng, Hang ;Deng, Kai ;Elger, Kirsten ;de Souza Franco, Gabriel ;Gao, Yajie ;Ghasera, Khalid ;Hezel, Dominik ;Huang, Jingyi ;Kerswell, Buchanan ;Koch, Hilde ;Lanati, Anthony ;ter Maat, Geertje ;Martínez-Villegas, Nadia ;Nana Yobo, Lucien ;Redaa, Ahmad ;Schäfer, Wiebke ;Swing, Megan ;Taylor, Richard ;Traun, Marie Katrine ;Whelan, JoZhou, TengfeiThe majority of geochemical and cosmochemical research is based upon observations and,in particular, upon the acquisition, processing and interpretation of analytical data fromphysical samples. The exponential increase in volumes and rates of data acquisition over thelast century, combined with advances in instruments, analytical methods and an increasingvariety of data types analysed, has necessitated the development of new ways of data cura-tion, access and sharing. Together with novel data processing methods, these changes haveenabled new scientific insights and are driving innovation in Earth and Planetary Scienceresearch. Yet, as approaches to data-intensive research develop and evolve, new challengesemerge. As large and often global data compilations increasingly form the basis for newresearch studies, institutional and methodological differences in data reporting are provingto be significant hurdles in synthesising data from multiple sources. Consistent data for-mats and data acquisition descriptions are becoming crucial to enable quality assessment,reusability and integration of results fostering confidence in available data for reuse. Here,we explore the key challenges faced by the geo- and cosmochemistry community and, bydrawing comparisons from other communities, recommend possible approaches to overcome them. The first challenge is bringing together the numerous sub-disciplines withinour community under a common internationally initiative. One key factor for this conver-gence will be gaining endorsement from the international geochemical, cosmochemical andanalytical societies and associations, journals and institutions. Increased education andoutreach, spearheaded by ambassadors recruited from leading scientists across disciplines,will further contribute to raising awareness, and to uniting and mobilising the community.Appropriate incentives, recognition and credit for good data management as well as animproved, user-oriented technical infrastructure will be essential for achieving a culturalchange towards an environment in which the effective use and real-time interchange of largedatasets is common-place. Finally, the development of best practices for standardised datareporting and exchange, driven by expert committees, will be a crucial step towards mak-ing geo- and cosmochemical data more Findable, Accessible, Interoperable and Reusableby both humans and machines (FAIR). - Some of the metrics are blocked by yourconsent settingsDeveloping Vocabularies for the International OneGeochemistry Initiative: Specifying Locally, Harmonising Globally(Zenodo, 2023)
; ;Profeta, Lucia ;Prent, Alexander ;Brownlee, Rowan ;Wyborn, Lesley ;Elger, Kirsten ;Luzi-Helbing, Manja ;Lehnert, KerstinHezel, DominikOver the last century enormous volumes of geochemical data have been acquired and thousands of globally distributed geochemical databases across the academic, government and industry sectors now exist. Unfortunately, there have been few efforts to standardise data or the diverse approaches for processing and modelling them, making it difficult to harness these data assets and contribute to global societal challenges (e.g., UN Sustainable Development Goals).<br> <br> OneGeochemistry is a new initiative that seeks to build and maintain consensus-driven standards to create a global network of findable and accessible geochemical data that is truly interoperable and reusable to both humans and machines. The highest priority is the development of internationally shared FAIR, machine-actionable vocabularies. Progressing towards community-agreed vocabularies, as well as their endorsement and governance, is difficult because geochemistry is spread across four science unions and tens of international/ national societies. Fortunately, geochemistry is one of eleven discipline case studies of the CODATA-led EU H2020 WorldFAIR project, which is exploring implementation of the FAIR principles across these case studies as well as leading the development of a global international framework for interdisciplinary interoperability and reusability of digital research objects.<br> <br> To start this process, four international geochemical databases (Astromat, AusGeochem, EarthChem, GEOROC) are now collaborating on a modular approach to develop shared terminologies. Where possible, terms from existing international vocabularies will be utilised (e.g., MINDAT, IUPAC chemical terminologies, DRUM). For remaining terms, local profiles will be published in Research Vocabularies Australia. The creation of a vocabularies record to be used by each database can converge databases to interoperability, with a vocabulary being a FAIR enabling resource (FER). FAIR Implementation Profiles are a tool to capture each of these FERs, and FER nanopublication URIs can help to enhance the machine readability and interoperability of local databases within a global network of geochemical data resources. - Some of the metrics are blocked by yourconsent settingsGEOROC and EarthChem: Optimizing Data Services for Geochemistry through Collaboration(2022)
; ;Lehnert, Kerstin ;Profeta, Lucia ;Sarbas, Bärbel; ;Cao, Sean ;Figueroa, Juan David; ;Ji, Peng ;Johansson, Annika; ;Möller-McNett, Stefan ;Mukhumova, Mariyam; ;Sturm, Adrian ;Sweets, Hannah; Geochemical data are fundamental to understanding many planetary and environmental processes – yet in the absence of a community-endorsed data culture that adheres to common data standards, the geochemical data landscape is highly fragmented. The GEOROC and PetDB databases are leading, open-access resources for geochemical and isotopic rock and mineral data that have collaborated for nearly 25 years to provide researchers with access to large volumes of curated and harmonized data collections. PetDB is a global synthesis of published chemical, isotopic and mineralogical data for rocks, minerals and melt inclusions with a focus on data for igneous and metamorphic rocks from the ocean floor, ophiolites, xenolith samples from the Earth's mantle and lower crust and tephra, operated by the EarthChem data facility. Its counterpart, GEOROC hosts a collection of published analyses of volcanic and plutonic rocks, minerals and mantle xenoliths, predominantly derived from ocean islands and continental settings. These curated, domain-specific databases are increasingly valuable to data-driven and interdisciplinary research and form the basis of hundreds of new research articles each year across numerous earth data disciplines. Over the last two decades, both GEOROC and EarthChem have invested great efforts into operating data infrastructures for findable, accessible, interoperable and reusable data, while working together to develop and maintain the EarthChem Portal (ECP) as a global open data service to the geochemical, petrological, mineralogical and related communities. The ECP provides a single point of access to >30 million analytical values for >1 million samples, aggregated from independently operated databases (PetDB, NAVDAT, GEOROC, USGS, MetPetDB, DARWIN). Yet one crucial element of FAIR data is still largely missing: interoperability across different data systems, that allows data in separately curated databases, such as GEOROC and PetDB, to be integrated into comprehensive, global geochemical datasets. Both EarthChem and GEOROC have recently embarked on major new developments and upgrades to their systems to improve the interoperability of their data systems. The new Digital Geochemical Data Infrastructure (DIGIS) initiative for GEOROC 2.0 aims to develop a connected platform to meet future challenges of digital data-based research and provide advanced services to the community. EarthChem has been developing an API-driven architecture to align with growing demands for machine-readable, Analysis Ready Data (ARD). This has presented an opportunity to make the two data infrastructures more interoperable and complementary. EarthChem and DIGIS have committed to cooperation on system architecture design, data models, data curation, methodologies, best practices and standards for geochemistry. This cooperation will include: (a) joint research projects; (b) optimized coordination and alignment of technologies, procedures and community engagement; and (c) exchange of personnel, data, technology and information. The EarthChem-DIGIS collaboration integrates with the international OneGeochemistry initiative to create a global geochemical data network that facilitates and promotes discovery and access of geochemical data through coordination and collaboration among international geochemical data providers, in close dialogue with the scientific community and with journal publishers. - Some of the metrics are blocked by yourconsent settingsGEOROC and EarthChem: Optimizing Data Services for Geochemistry through Collaboration(2022-06-08)
; ;Lehnert, Kerstin ;Profeta, Lucia ;Sarbas, Bärbel; ;Cao, Sean ;Figueroa, Juan David; ;Ji, Peng ;Johansson, Annika; ;Möller-McNett, Stefan ;Mukhumova, Mariyam; ;Sturm, Adrian ;Sweets, Hannah; - Some of the metrics are blocked by yourconsent settingsInnovating and Networking Global Geochemical Data Resources Through OneGeochemistry(2023)
;Prent, Alexander M. ;Hezel, Dominik C.; ;Wyborn, Lesley ;Farrington, Rebecca ;Elger, Kirsten ;Profeta, Lucia ;Nixon, Angus L.Lehnert, Kerstin - Some of the metrics are blocked by yourconsent settingsIntroducing OneGeochemistry, landscape of data resources and standards: showcases growing list of publications on data reporting standards etc(Zenodo, 2022)
;Wyborn, Lesley ;Lehnert, Kerstin ;Prent, Alexander; ;Profeta, Lucia ;Elger, Kirstenter Maat, GeertjeThis presentation was part of the Goldschmidt 2022 workshop <em>“Earth Science meets Data Science: what are our needs for geochemical data, services and analytical capabilities in the 21st century?. </em>It covers uses for geochemical data; the need for FAIR standards for geochemical data; the history of developing geochemistry standards so far; equivalent initiatives in climate, seismology, crystallography and pure and applied chemistry; proposed methods to develop the required standards through 'expert' communities; and an introduction to the CODATA-led WorldFAIR project<strong>: </strong>Global cooperation on FAIR data policy and practice. - Some of the metrics are blocked by yourconsent settingsOneGeochemistry Interim Governance(Zenodo, 2022)
;Lehnert, Kerstin; ;Elger, Kirsten ;Wyborn, Lesley ;Prent, Alexander ;Hezel, Dominik ;Profeta, LuciaNixon, AngusOneGeochemistry is an international collaboration between multiple national and international organisations that support geochemistry capability and data production. This document describes the interim governance of OneGeochemistry that will be valid until the network is formally constituted (planned for mid 2024). - Some of the metrics are blocked by yourconsent settingsOneGeochemistry: a Proposed International Framework to Enable Online Interchange of Globally Distributed Geochemical Data(Zenodo, 2021)
;Wyborn, Lesley ;Lehnert, Kerstin ;Prent, Alexander; ;Klump, Jens ;ter Maat, Geertje ;Elger, KirstenProfeta, LuciaGeochemical data are fundamental to understanding processes in natural systems and have been collected for more than a century. They could now be a vital input into many of the UN Sustainable Development Goals (SDG), in particular SDG#6 (Clean Water and Sanitation); SDG#7 (Affordable and Clean Energy); SDG#8 (Decent Work and Economic Growth); SDG#9 (Industry, Innovation and Infrastructure); SDG#13 (Climate Action) and SDG#15 (Life on Land). Unfortunately, it is near impossible to reuse the vast existing amounts of geochemical data: they are currently globally fragmented over thousands of databases and are located in either personal, institutional or national silos. Very little is accessible online and where it is, the lack of agreed international standards for metadata/data make it near impossible to reuse without considerable human effort in data wrangling and cleaning. A mapping of the global landscape identified some major national geochemical data ‘Systems’ (GeoRoc, EarthChem, Deep-time Digital Earth, AuScope Geochemistry Network, EPOS): each deals with various parts of the geochemical ecosystem ranging from collection /description of samples in the field, through laboratory analysis, to publication of the results and their longer term accessibility in online databases. Although each ‘System’ has a different driver, funding and context, there are common elements within each that can be leveraged into a OneGeochemistry ‘Framework’ (e.g., target analytes are based on the Periodic Table; all require standard units of measure; many use rock or mineral names). The overall goal is to create a FAIR global network of interoperable distributed geochemical databases and data systems. - Some of the metrics are blocked by yourconsent settingsOneGeochemistry: Progress on the Path toward Shared, Reusable, and Interoperable Geochemical Data(2022-12-15)
;Lehnert, Kerstin ;Wyborn, Lesley A.; ;Prent, Alexander ;Hezel, Dominik C. ;ter Maat, GeertjeProfeta, LuciaThe OneGeochemistry initiative aims to advance a global network of geochemical data resources through coordination and collaboration among international geochemical, cosmochemical, and isotopic data providers. OneGeochemistry is driven by the urgent need 1) to transform geochemical data for use in new scientific, data-driven methodologies to help solve the grand challenges of humanity in the 21st century; 2) to connect complementary data systems within an increasingly fragmented landscape of geochemical data; and 3) to establish data standards and their governance in geochemistry to enable the full implementation of the FAIR principles, which requires community standards for metadata and vocabularies as well as technical protocols to structure the data for machine-to-machine data exchange and integration. Over the past three years, the founders of OneGeochemistry have worked to inform and engage a wide range of stakeholders including researchers, funders, publishers, as well as manufacturers of analytical instrumentation. OneGeochemistry has organized community events such as an EGU 2022 Great Debate and workshops at the Goldschmidt Conference 2022 that have provided important insights into the requirements for data standards in geochemistry and revealed challenges especially in changing the data culture in geochemistry. Discussions with manufacturers of analytical instrumentation are opening new opportunities to develop standardized interfaces for data exchange between software tools, data systems, and instruments to create seamless data pipelines from the field to the lab to data repository that will ease the burden of data management for researchers. OneGeochemistry is now part of the WorldFAIR project (separate presentation in session IN009) that enables global cooperation on FAIR data policy and practices across 11 disciplines with funding provided by the European Commission. OneGeochemistry’s next step is to develop its governance structure and plans for its long-term sustainability. Another aim is to extend participation in OneGeochemistry to the Global South to be truly global, inclusive & equitable. - Some of the metrics are blocked by yourconsent settingsOvercoming Fragmentation of Geochemical Data Resources: Collaboration between EarthChem, Astromat, GEOROC, and MetBase(2023)
;Lehnert, Kerstin; ;Hezel, Dominik ;Profeta, Lucia ;Ji, PengSturm, AdrianGlobal geochemical datasets are increasingly valuable for solving research questions in geochemistry, volcanology and beyond. To support new research, open sharing and access of geochemical data needs to be easy for researchers so they can take full advantage of the rapidly growing volume of data generated in laboratories across the globe, and to comply with the principles of Open Science. Instead, the fragmented landscape of geochemical data systems makes it difficult for researchers to find, access, and contribute their data: Geochemical data are curated and published in a range of thematic, institutional, and programmatic data systems that differ in architecture, metadata schemas, terminology, and data output formats. Researchers have to figure out where to obtain the data they need; learn to use different search applications; retrieve data from multiple databases and painstakingly reformat the datasets they obtained from different systems to integrate them. They need to select an appropriate repository for their data, and potentially work with different submission systems and templates. Collaboration among geochemical data systems is a critical step to overcome this fragmentation and facilitate geochemical data management and access for the research community by coordinating, aligning, and integrating their systems. Through collaboration, data repositories and databases can also leverage each other’s expertise and resources to operate their services more effectively and efficiently. We here report about new collaborative efforts among four geochemical data systems that aim to harmonize and integrate their data holdings and software ecosystem for the benefit of the research community and to improve their sustainability: EarthChem (https://earthchem.org/), GEOROC (https://georoc.eu/), MetBase (https://metbase.org/), and the Astromaterials Data System (https://www.astromat.org/). Building on the long-term collaboration between EarthChem and GEOROC, this collaboration leverages the new development of the Astromaterials Data System with modern technology and two new projects funded to overhaul the infrastructure of the GEOROC and MetBase databases as an opportunity to jointly develop a more resilient, sustainable platform for data exchange. Results of the collaboration so far include: a) alignment of the Astronaut and MetBase data models b) migration of the MetBase data holdings into the Astromat synthesis database; c) alignment of the EarthChem and GEOROC data models; d) new automated synchronization process of GEOROC data to the ECP; e) harmonized vocabularies for chemical variables, analytical methods (Others are in development in alignment with emerging efforts of the OneGeochemstry initiative); f) design of the future shared architecture of EarthChem and GEOROC that includes plans for a joint data entry tool for curators and a single data submission platform for researchers to contribute their data to the affiliated domain repositories. The ultimate goal of this harmonization between EarthChem, Astromat, GEOROC and MetBase is to make it easier for researchers to access and contribute data. We hope to integrate further systems in the future, building on ongoing collaborations with the Australian Geochemistry Network, the US Geological Survey, SAMIS (Sample Analysis Microinformation System), the GFZ Data Services, and the Sparrow software. - Some of the metrics are blocked by yourconsent settingsSynchronizing GEOROC and EarthChem towards Global Geochemical Data Exchange(CODATA, 2022-06-22)
; ;Profeta, Lucia ;Lehnert, Kerstin ;Sarbas, BärbelThe DIGIS Team; The EarthChem TeamGEOROC and EarthChem provide services for open data publication, archiving, and interactive access of geochemical and isotopic data of igneous and metamorphic rocks, minerals and inclusions. The two data providers have collaborated since the early 2000s to deliver consistent and complementary data to the geochemistry community. Curation work for PetDB and GEOROC relies on a strong foundation of consultation with the broader geochemical community, to establish which dataset quality related metadata needs to be included to provide end-users with truly reusable datasets. Data exchange is currently implemented through the EarthChem Portal (ECP). The ECP was developed as a global open data service to the geochemical, petrological, mineralogical, and related communities. It provides a single point of access to >45 million analytical values for >1 million samples, aggregated from independently operated databases (PetDB, NAVDAT, GEOROC, USGS, MetPetDB, DARWIN). In an effort to improve the FAIRness of their data services and the interoperability of their data systems, a new suite of API-driven architecture is being developed by both systems. EarthChem and DIGIS (Digital Geochemical Data Infrastructure initiative for GEOROC 2.0) have committed to cooperation on system architecture design, data models, data curation, methodologies, best practices and standards for their geochemical data infrastructures. This effort will include working towards harmonized vocabularies for the core database concepts as well as dataset quality information. These common vocabularies are being developed as part of the Observations Data Model 2 (ODM2) ecosystem. Versioning of the vocabularies will ensure dynamic response to evolving community requirements. This will allow for versatility for expansions to other data types, after more rounds of community consultations to determine best practices. The vocabularies further build on established concepts from other communities, such as mineralogy and chemistry. These common vocabularies developed between EarthChem and DIGIS will be the foundation of future seamless API-driven data exchange that can act as a prototype for the distributed data framework envisioned by OneGeochemistry. - Some of the metrics are blocked by yourconsent settingsThe OneGeochemistry initiative as a CODATA Working Group; bringing together international geochemical data systems for easy data discovery(2023)
;Prent, Alexander ;Wyborn, Lesley; ;Lehnert, Kerstin ;Elger, Kirsten ;Hezel, Dominik ;Profeta, Lucia ;ter Maat, Geertje ;Farrington, RebeccaRawling, TimAs geochemical data enable understanding of the Earth system and help to address critical societal issues the organisation thereof is important. Questions asked about processes affecting our environment and geological past become more complex and interdisciplinary in nature as well as multidimensional. To help answer these questions within the geochemistry research capabilities and data compilations are required to be comprehensive and both human and machine readable. Various international organisations are building infrastructure to capture and distribute geochemical data in a consistent manner adhering to the FAIR principles. Since May 2021 the OneGeochemistry initiative has officially started efforts towards aligning these organisations’ data frameworks in order to standardise how geochemical data is reported around the globe. In November 2022 the OneGeochemistry initiative applied and was granted to become the OneGeochemistry CODATA Working Group as part of the International Science Councils Committee on Data. The initiative has now also been endorsed by the Geochemical Society, the European Association of Geochemistry and the Working Group has been endorsed by the IUGS Commission on Global Geochemical Baselines. Coordination of the OneGeochemistry initiative is funded through the WorldFAIR project where it is one of the work packages in the larger ‘WorldFAIR: Global cooperation on FAIR data policy and practice’ project. A FAIR Implementation Profile analyses of the geochemistry communities of Australia (AusGeochem), USA (EarthChem, AstroMat) and Europe (GEOROC-DIGIS, EPOS-MSL, NFDI4EARTH) resulted in recognition of the need for common vocabularies for geochemistry data reporting as one of the most important actions to undertake towards international geochemistry data interoperability. A task adopted by EarthChem-DIGIS(GEOROC)-GFZ(DataSystems) collaboration and Research Vocabularies Australia. Here we will present an overview of the current OneGeochemistry initiative and its preliminary outcomes with regards to FAIR Implementation Profiles and processes that will help enable geochemical data interoperability between various stakeholders. - Some of the metrics are blocked by yourconsent settingsThe OneGeochemistry Initiative: Mobilising a Global Network of FAIR Geochemical Data to Support Research into the Grand Challenge of an Environmentally Sustainable Future(Zenodo, 2021)
;Wyborn, Lesley ;Elger, Kirsten ;Prent, Alexander ;Lehnert, Kerstin ;Bruno, Ian; ;Klump, Jens ;Profeta, Lucia ;Quinn, Daven P. ;Ramdeen, Sarahter Maat, GeertjeGeochemical data are fundamental for understanding past, present, and future processes in natural systems, from the interior of the Earth to its surface environments on land, in the oceans and in the air, to the entire solar system. Currently, despite the pervasive acquisition and analysis of geochemical data in the last century, it is hard to harness this wealth of data as existing practices have resulted in geochemical databases that are located in either personal, institutional, national, or programmatic silos. Due to lack of standards that are especially challenging to develop in long-tail communities, like geochemistry, much of this existing data is not interoperable and reusable: very little is open and accessible online. To create a global network of reusable geochemical data the International Geochemistry Community needs to come together to define the required, globally-agreed standards and best practices that will enable world-wide interoperability, reuse and open sharing of geochemical data. The OneGeochemistry initiative is proposed as a new initiative to rally geochemists around the world to come together to help develop the required international standards and define the best practices to enable the creation of a FAIR (Findable, Accessible, Interoperable and Reusable) global network of interoperable distributed geochemical databases. This cannot be done in isolation of standards that are being developed elsewhere, in particular liaison with the International Union of Pure and Applied Chemistry (IUPAC) is required so that the Geochemistry Community can leverage relevant standards based on the periodic table that are already developed within the chemistry community. The objective of this session is to consolidate ideas for a viable and sustainable FAIR global geochemistry network to support research grand challenges of today and meet those of the future. The session will be organised around the Why, What, Where, Who and How. The session will start with some ignition talks to set the scene on the ‘Why’ and the ‘What’ of establishing the OneGeochemistry Initiative and will be followed by a set of 4 minute lightning talks that raise awareness of ‘Where’ data systems are being built. The presentations will be followed by a roundtable discussion to help determine the key steps on the ‘How’ and to decide ‘Who’ needs to be involved to enable OneGeochemistry Initiative to move forward and become a reality. Speakers A.‘Why” we need OneGeochemistry Kerstin Lehnert - The Drivers for OneGeochemistry B. Lightning talks on “What” building blocks are required. Sarah Ramdeen - Assigning Globally Unique Persistent Identifiers at Birth to Materials Used in Geochemical Analysis. Kirsten Elger - Perspectives from Journals and Domain Repositories on the incorporation of Geochemical Data in Publications. Ian Bruno - The IUPAC Digital Chemistry Initiative and the Relevance to OneGeochemistry Initiative. C Lighting Talks on the ‘Where’ of existing data systems around the world Marthe Kloecking - The Geochemistry of Rocks of the Oceans and Continents (GEOROC) Data System. Kerstin Lehnert - EarthChem. Lucia Profeta - The Astromaterials Data System. Geertje ter Maat - The European Plate Observing System (EPOS) Multiscale Laboratories Data System. Alexander Prent - Australian AuScope Geochemistry Network. Daven Quinn - The EarthCube Sparrow Geochemical Data System. Lesley Wyborn - Using the FAIMS (Field Acquired Information Management System) app to support incorporation geochemical metadata standards at initial sample acquisition in the field. - Some of the metrics are blocked by yourconsent settingsThe Path to Creating Open Global Networks of All Types of Geochemical Data from Multiple Countries, Repositories and Researchers: what variables and vocabularies are YOU using to describe your data?(2023-12)
;Wyborn, Lesley A; ;Prent, Alexander ;Nixon, Angus ;Lehnert, Kerstin ;Profeta, Lucia ;Richard, Stephen M ;Hezel, Dominik ;Elger, Kirsten ;Farrington, RebeccaLuzi-Helbing, ManjaGeochemical datasets are notoriously heterogeneous and are collected by thousands of researchers/research groups on a diversity of samples (rocks, minerals, meteorites, fluids, gases, etc) using hundreds of analytical techniques across multiple disciplines around the globe. Even where there is consensus on the minimum variables needed to be reported for a particular analytical method, the vocabularies used for each variable are rarely consistent, making automated integration of equivalent data types extremely difficult. New, data-driven research questions are emerging that require globally aggregated geochemical datasets to enhance our understanding of planetary system processes, for which AI and ML tools are becoming increasingly employed. Combined, these applications are exacerbating the need to make geochemical data Findable, Accessible, Interoperable and Reusable (FAIR) thus driving a demand for standardisation of geochemical variables and vocabularies. Unfortunately there is presently little coordination and multiple vocabularies are emerging that often replicate the same concepts. The OneGeochemistry Initiative (http://onegeochemistry.org/) is working towards enabling a Open Global Network of all types of geochemical data from multiple countries, repositories and researchers. To achieve a balance between meeting increasing demands for local FAIR vocabularies versus time required to reach international agreement, OneGeochemistry developed a three-tiered approach (local, community, international): 1) Encouraging data providers with locally defined vocabularies to make them available online and ensure each term has a persistent identifier; 2) Encouraging groups with similar topics to begin harmonising on multiple locally-derived concepts/definitions and publish these as community resources; 3) Raising awareness of groups harmonising and making semantic resources FAIR-compliant at an international level, particularly those with endorsement from International authoritative groups (eg Scientific Unions/Societies). As convergence takes place towards internationally-agreed terms, the size of the community able to share (meta)data machine to machine grows. As better defined vocabularies become available they should be easy to translate into multiple languages. - Some of the metrics are blocked by yourconsent settingsUnlocking the Power of Mineral Data in Global Geochemical Databases(2022-07-20)
; ;Lehnert, Kerstin ;Profeta, Lucia; The DIGIS Team; The EarthChem TeamNovel methods of data analysis have led to numerous innovations in mineralogical research in recent years, requiring increasingly larger precompiled datasets. Whilst dedicated, high-quality expert compilations are invaluable for addressing specific research questions, comprehensive and curated global databases, such as GEOROC and PetDB, are essential for realising the full scientific potential of mineralogical data. Since the late 90s, the GEOROC and PetDB databases have collated, archived and curated over a century’s worth of geochemical analyses of natural minerals. The databases currently contain 17,531,666 and 2,664,189 individual mineral data values, respectively, which makes up over half of their total data holdings. These data are compiled together with carefully selected metadata that capture the diversity of mineral geochemical data, while also enabling targeted data quality assessment. To continue to obtain new insights from the accumulated data collected by mineralogists worldwide, our community needs to come together to ensure the continued improvement and sustainability of such curated databases. - Some of the metrics are blocked by yourconsent settingsWorldFAIR Project (MS6) Geochemistry Scientific Content Component(Zenodo, 2023)
;Prent, Alexander ;Wyborn, Lesley ;Farrington, Rebecca ;Lehnert, Kerstin; ;Elger, Kirsten ;Hezel, Dominik ;ter Maat, Geertje ;Profeta, LuciaRawling, TimWorldFAIR Milestone 6, reported here, specifies work done and being undertaken for Deliverable 5.2 (due month 20), ‘Geochemistry Methodology and Outreach’, which has the following description: “This deliverable will outline the methodology used to develop and update FIPs and promulgate knowledge of them, including publishers to ensure the quality, interoperability and reusability of data in publications”. As geochemical data is collected on a diversity of natural and synthetic samples (rocks, sediments, minerals, fossils, meteorites, cosmic dust, fluids, gases, etc), from the Earth or other planetary bodies, there is an incredible range of analytical instruments used and hundreds of analytical techniques applied. This results in a community with many subdisciplines that produce typically ‘long tail’ data - data that are highly specific and small in volume. The community and the data produced are heterogeneous and overlaps of common minimum variables are scarce. We conclude that developing a single FAIR Implementation Profile (FIP) for all geochemical data will not be possible; rather, there will need to be multiple linked FIPs for geochemistry subdisciplines and at multiple levels of granularity. As a FIP is underpinned by FAIR Enabling Resources (FERs), many such FERs need to be publicly available or need to be published. By specifying any FER(s) that accompany each FAIR principle within the individual FIP, users of any geochemical dataset/database will have accurate documentation for each FAIR Principles, and thus enhance machine readability. This Milestone describes progress towards developing a methodology designed to assist in defining the individual FERs required to fully describe the minimum scientific and technical variables used to describe any geochemical analysis. These FERs will enable the generation of multiple FIPs, facilitating published results to be reproduced and shared globally with sufficient metadata to make any geochemical resource FAIR for both humans and machines. This Milestone report then discusses how the components of this methodology are being executed in the community, discusses resulting progress towards minimum common variables of samples, discusses how to make best practices for geochemical methods available online and specifies a set of vocabularies published to describe methodologies. Visit WorldFAIR online at http://worldfair-project.eu. WorldFAIR is funded by the EC HORIZON-WIDERA-2021-ERA-01-41 Coordination and Support Action under Grant Agreement No. 101058393.