Data Quality

Datasets available on the National Environmental Data Centre follow best practice data management standards, including the New Zealand Data and Information Management Principles, the FAIR Principles, and the CARE Principles for Indigenous Data Governance.

Datasets will aim to display a ranking across eight criteria to help communicate overarching completeness and fit for purpose. See below for the data quality criteria, ranking, and their descriptions.

Some of the datasets may also have links to standards-compliant metadata and conform to international and national domain standards. They may also have information on their national importance and applicability.

To guide you on data quality and how a dataset has met the criteria for quality we have developed a visual data quality score.

 

Data Quality criteria and descriptions

  Good Medium Poor
Accessible (AC) Data are accessible from a public facing server Data are accessible via personal contact or intermediate application Data are not readily accessible
Common Format (CF) Data are available in a commonly used format (syntax) Data are available in a proprietary format that requires specific software Data are in a format that requires software that is not readily available
Data Lineage Statement (DL) Data are well described in terms of their origins, modifications and processing history Data are inadequately described in terms of their origins, modifications and processing history Data are not described in terms of their origins, modifications or processing history
Data Quality Statement (DQ) Data are well described in terms of their content quality, completeness and consistency Data are not well established in terms of their content quality, completeness and consistency Data lack any description of their content quality, completeness or consistency
Governance (GO) The contributing organisation (CRI) has ownership or recognised stewardship of the data The contributing organisation (CRI) has ambiguous ownership or recognised stewardship of the data The contributing organisation (CRI) does not state ownership or recognised stewardship of the data
Identifiers (ID) Both metadata and the data are described with a unique identifier Only one of the metadata or the data are described with a unique identifier Neither the metadata nor the data are described with a unique identifier
Terms of use (TU) A data licence is easily accessible and understandable A data license does not exist but conditions of use are explained to some extent A data licence does not exist nor conditions of use explained
Last Updated (LU) A last updated statement is available The last updated statement is ambiguous A last updated statement is unavailable