Research Directions for Principles of Data Management (Abridged) (original) (raw)

Research Directions for Principles of Data Management

ACM SIGMOD Record, 2017

In April 2016, a community of researchers working in the area of Principles of Data Management (PDM) joined in a workshop at the Dagstuhl Castle in Germany. The workshop was organized jointly by the Executive Committee of the ACM Symposium on Principles of Database Systems (PODS) and the Council of the International Conference on Database Theory (ICDT). The mission of the workshop was to identify and explore some of the most important research directions that have high relevance to society and to Computer Science today, and where the PDM community has the potential to make significant contributions. This article presents a summary of the report created by the workshop [4]. That report describes the family of research directions that the workshop focused on from three perspectives: potential practical relevance, results already obtained, and research questions that appear surmountable in the short and medium term. The report organizes the identified research challenges for PDM around seven core themes, namely Managing Data at Scale, Multi-model Data, Uncertain Information, Knowledge-enriched Data, Data Management and Machine Learning, Process and Data, and Ethics and Data Management. Since new challenges in PDM arise all the time, we note that this list of themes is not intended to be exclusive.

Manifesto from Dagstuhl Perspectives Workshop 16151 Research Directions for Principles of Data Management

2018

The area of Principles of Data Management (PDM) has made crucial contributions to the development of formal frameworks for understanding and managing data and knowledge. This work has involved a rich cross-fertilization between PDM and other disciplines in mathematics and computer science, including logic, complexity theory, and knowledge representation. We anticipate on-going expansion of PDM research as the technology and applications involving data management continue to grow and evolve. In particular, the lifecycle of Big Data Analytics raises a wealth of challenge areas that PDM can help with. In this report we identify some of the most important research directions where the PDM community has the potential to make significant contributions. This is done from three perspectives: potential practical relevance, results already obtained, and research questions that appear surmountable in the short and medium term. Perspectives Workshop April 10–15, 2016 – http://www.dagstuhl.de/16...

Where is the human in the data? A guide to ethical data use

GigaScience, 2018

Being asked to write about the ethics of big data is a bit like being asked to write about the ethics of life. Big data is now integral to so many aspects of our daily lives-communication, social interaction, medicine, access to government services, shopping, and navigation. Given this diversity, there is no one-size-fits-all framework for how to ethically manage your data. With that in mind, I present seven ethical values for responsible data use.

Ethics-aware Data Governance (Vision Paper)

2018

The number of datasets available to legal practitioners, policy makers, scientists, and many other categories of citizens is growing at an unprecedented rate. Ethics-aware data processing has become a pressing need, considering that data are often used within critical decision processes (e.g., staff evaluation, college admission, criminal sentencing). The goal of this paper is to propose a vision for the injection of ethical principles (fairness, non-discrimination, transparency, data protection, diversity, and human interpretability of results) into the data analysis lifecycle (source selection, data integration, and knowledge extraction) so as to make them first-class requirements. In our vision, a comprehensive checklist of ethical desiderata for data protection and processing needs to be developed, along with methods and techniques to ensure and verify that these ethically motivated requirements and related legal norms are fulfilled throughout the data selection and exploration ...

Data Processing: Reflections on Ethics

2019

Ethics-related aspects are becoming prominent in data management, thus the current processes for searching, querying, or analyzing data should be designed is such a way as to take into account the social problems their outcomes could bring about. In this paper we provide reflections on the unavoidable ethical facets entailed by all the steps of the information life-cycle, including source selection, knowledge extraction, data integration and data analysis. Such reflections motivated us to organize the First International Workshop on Processing Information Ethically (PIE).

Privacy Value Modeling: A Gateway to Ethical Big Data Handling (full paper)

2020

EU through General Data Protection Regulation, GDPR, stipulates to safeguard EU citizens fundamental rights by ensuring ethical, uninterrupted, big data sharing within and outside EU. Healthcare data is no exception to this. While dealing with big data, healthcare providers, Big Data Analysts(BDAs) and government bodies have collectively realized that patients values are to be prioritized for patients optimal value care and for an efficient healthcare system at large. To ensure patients value care, privacy, inter alia, is incorporated both by design within each domain’s data base and by policy via international, pan-European and national laws and regulations. This also became viable by standardizing the Information Security Management System (ISMS) indicators for healthcare providers and regulators alike. Lack of standard respective metrics for each privacy assuring parameter, constrains privacy from becoming an objective value object for each value actor. Still, privacy can be seen...