Checklist for Data Management Plan peer review
Reviewers of USDA Data Management Plans may use the following questions to guide their evaluation. If you have suggestions or comments on this review checklist, please contact https://www.nal.usda.gov/main/ask-question and choose “Ag Data Commons” as your topic.
- Does the DMP cover the full life cycle of the data?
- Are best practices in the scientific discipline available and if so are they referenced and/or followed?
- Does the budget adequately cover activities in the DMP?
- Is the DMP thoughtful and specific such that it is apparent that the project has personnel with appropriate knowledge and experience to manage the data?
Expected Data Types
- Is there a description of each data type and how they will be generated and/or sourced? Are these data types expected for this type of research? Is there any data to be expected that is not mentioned? For example, will data be born digital or recorded on paper? If born digital, will it be generated by measurements from a human or by equipment like a sensor or DNA sequencer?
- Does the DMP mention documentation or metadata accompanying each file, for understandability, validation, and reuse?
- Does the author account for the estimated volume of each data type? (May also be included in Data Storage section)
- Does the author include any code or multimedia resources that will be generated?
- Are the data types using the appropriate, research community-supported formats?
- Are these data formats non-proprietary? If not, is there a plan for generating non-proprietary data?
- If the data types are non-digital, does the author include plans for digitizing the data?
- Does the author indicate which metadata and data standards they will use? Are these appropriate for the discipline? Is/Are these formats machine readable? Does metadata standard match chosen repository guidelines?
- If there is no metadata standard mentioned, does the author explain what information will be contained within their metadata and where it can be found? For example, will certain metadata be in a README file, .txt format, or embedded in a spreadsheet tab?
- Does the author describe software needed to work with data and the metadata?
Data Storage and Preservation
- Does the author describe the hardware, storage environment, and local or external services to be used?
- Is the storage environment appropriate for the described data types?
- Does the author describe who will have access to working data and how access will be managed before and after the grant period?
- Does the author explain how the data will be transferred and shared between collaborators during the award period?
- Does the author explain the back-up plan for data throughout the award period?
- Does the author describe how long the data will be retained or preserved and why?
- Does the author describe hardware or campus or commercial services to be used to assure data preservation? Is this preservation sufficient for the nature of the data?
Data Sharing and Public Access
- Are there any privacy issues for the data types being shared? If so, does the author plan appropriate measures to protect privacy (e.g. measures taken to anonymize data, disposition of data including personally identifiable information)? If there is private or proprietary data, will it be explicitly stated in the data publication or release language?
- Does the author indicate when data collection will be completed?
- Is it clear at which stage the data will be shared, processed or raw?
- If there are legal and ethical requirements that may preclude sharing of any of the data types, does the author describe these circumstances?
- If the data types have commercialization potential that may delay or preclude data sharing, is this described?
- Does the author acknowledge when the data will be shared? Is this within the maximum 30 months of the data collection end date?
- Does the author state at what point the data will be shared, if it can be shared?
- Is there a location from which the data will be made publically available (e.g. a recognized repository capable of public access to, and long-term preservation of the data: data repositories, disciplinary repositories, institutional repositories that follow the FAIR guidelines re3data, fairsharing.org)? Is this the best place for the data given the disciplinary best practices?
- Does the author discuss how users will discover the data (e.g. a specific repository, data citation, project website, Internet search engines, Ag Data Commons catalog). Note that a project website is an excellent assist in discovery of data, but is not considered a viable archival solution.
- Is there a mention of a persistent identifier (e.g. DOI, Handle) to allow for permanent access to the dataset?
- Is the intellectual property of the datasets clear, i.e. who has copyright if applicable
- Does the author describe conditions for reuse of the data by others including any licenses that will be applied (e.g. Creative Commons)? Are these appropriate for the funding program and discipline?
- Does the author acknowledge any data types acquired from third parties and under what conditions they are able to share this data?
Roles and Responsibilities
- Is the person(s) with the primary responsibility for implementing the data management plan outlined? Is this person qualified?
- Is there a plan for transfer of responsibility if key personnel depart from the project?
- Does the author include sufficient budget to cover costs for data management in proposal budget, if those are allowed costs?
Monitoring and Reporting
- Does the author indicate the plan for monitoring and reporting on the implementation of the DMP during and after the award period?