Data curation, as defined by The University of Illinois’ Graduate School of Library and Information Science: “is the active and ongoing management of data through its life cycle of interest and usefulness.” Sayeed Choudhury, Associate Dean for Research Data Management at Johns Hopkins University (JHU) and leader of the Data Conservancy, further breaks down Data Curation iterative activities to:
- Preserving: Collecting and taking care of research data.
- Sharing: Revealing data’s potential across domains
- Discovering: Promoting the re-use and new combinations of data
According to Alation:
“In practice, data curation is more concerned with maintaining and managing the metadata rather than the database itself and, to that end, a large part of the process of data curation revolves around ingesting metadata such as schema, table and column popularity, usage popularity, top joins/filters/queries. Data curators not only create, manage, and maintain data, but may also be involved in determining best practices for working with that data. Data curators often present the data in a visual format such as a chart, dashboard or report.”