Data Integration Initiative: Planning Document (original) (raw)
Related papers
Community for Data Integration 2015 annual report
Open-File Report, 2016
The Community for Data Integration (CDI) continued to experience success in fiscal year 2015. The CDI community members have been sharing, learning, and collaborating through monthly forums, workshops, working groups, and funded projects. In fiscal year 2015, CDI coordinated 10 monthly forums with 16 different speakers from the U.S. Geological Survey and external partners; funded 11 collaborative projects; and hosted an in-person, four-day workshop, which attracted 168 (134 in-person and 34 remote) data practitioners, data providers, and data consumers from across the USGS, academia, industry, and other government agencies. The Citizen Science, Connected Devices, Data Management, Semantic Web, and Tech Stack Working Groups continued to accomplish great things in fiscal year 2015. These working groups were major stakeholders in planning the 2015 CDI Workshop; they continued developing solutions to pressing challenges, and they brought in speakers throughout the year for more focused presentations and discussions. Additionally, a new working group formed during the 2015 CDI Workshop-the Earth-Science Themes Working Group.
Community for Data Integration 2016 annual report
Open-File Report
The Community for Data Integration (CDI) represents a dynamic community of practice focused on advancing science data and information management and integration capabilities across the U.S. Geological Survey and the CDI community. This annual report describes the various presentations, activities, and outcomes of the CDI monthly forums, working groups, virtual training series, and other CDI-sponsored events in fiscal year 2016. The report also describes the objectives and accomplishments of the 13 CDI-funded projects in fiscal year 2016.
D12.2 – Mid-term report on data integration
2021
This deliverable describes the activities carried out and the results achieved during the first two years of the ARIADNEplus project within four tasks of Work Package 12 (WP12). The objectives are to develop, deliver and maintain the components of the ARIADNEplus infrastructure that support the integration and interoperability of the data provided by the members of the consortium. The catalogue data integrated by the ADI (the Aggregative Data Infrastructure developed in T12.2) are made available as RDF records compliant to the AO-Cat model to the ARIADNEplus portal (T12.3) and the pilots developed in WP16 via two services: (1) the ARIADNEplus AC (the data and knowledge cloud developed in T12.1) , which exposes a SPARQL API, and (2) an Elasticsearch server, which provides a full-text index of the content of the AC. The deeper integration of item level data (item-level integration) is investigated in task 12.4, in order to develop support for research questions that require informatio...
A scoping review on data integration in the field of infectious diseases, 2009-2018
International Journal of One health, 2021
Background and Aim: Little is known about data integration in public health research and its impact. This study aimed to summarize known collaboration information, the characteristics of the datasets used, the methods of data integration, and knowledge gaps. Materials and Methods: We reviewed papers on infectious diseases from two or more datasets published during 2009- 2018, before the coronavirus disease pandemic. Two independent researchers searched the Medline and Global Health databases using predetermined criteria. Results: Of the 2375 items retrieved, 2272 titles and abstracts were reviewed. Of these, 164 were secondary reviews. Full-text reviews identified 153 relevant articles; we excluded 11 papers that did not meet our inclusion criteria. Of the 153 papers, 150 were single-country studies. Most papers were from North America (n=47). Viral diseases were the most commonly researched diseases (n=66), and many studies sought to define infection rates (n=62). Data integration usually employed unique national identifiers (n=37) or address-based identifiers (n=30). Two data sources were combined (n=121), and at least one data source typically included routine surveillance information. Conclusion: We found a growing usage of data integration in infectious diseases, emphasizing the advantages of data integration and linkage analysis, and reiterating its importance in public health emergency preparedness and response.
2018
In June 2017, the International Council for Science (ICSU) and its Committee on Data for Science and Technology (CODATA) brought together international scientific unions and associations of ICSU and the International Social Science Council (ISSC) that have made major strides in this area of work, as well as other organisations that curate standards and vocabularies for particular disciplines. The objective of the meeting was to develop an action plan to realise the full potential of the data science, technologies, and infrastructures currently being created by specific disciplinary groups and expand those efforts on an inter- and trans-disciplinary basis. The meeting identified key opportunities of the digital revolution and how they can be achieved. Priorities for action include: the need for examples of the benefits that have already been realised by specific disciplinary groups and inter- and trans-disciplinary projects; the need to extend activities to disciplinary fields that h...
2013
Established and emerging European research infrastructures are holding or will in the near future hold immense quantities of data. Power lies not only in storing and managing these data, but especially also in making them available and accessible to a wider audience, across national borders, scientific communities and disciplines, and by integrating datasets so that more complex scientific questions can be solved. This has challenges, many of which are shared between different scientific communities. To exchange existing expertise and address obstacles, the BioMedBridges, CRISP, DASISH and ENVRI projects-covering the biomedical sciences, physics, social science and humanities, and environmental sciences-have come together to identify cross-cutting topics, discuss current approaches and develop recommendations for future actions needed to solve them.