K has 6 defined data quality dimensions in the Data Quality collection.
Data quality dimensions are measurement attributes of data that can help you assess the quality of your data and identify opportunities to improve data trust.
Accuracy
Accuracy is the degree to which data correctly reflects a real world object or an event being described. Accurate data should be verified against an authentic source.
Key questions include: Are there incorrect spellings of product or person names, addresses, and even untimely or not current data?
Completeness
Completeness is defined as the user's expected comprehensiveness. Data can be complete even if optional data is missing. As long as the data meets expectations, then the data is considered complete.
Key questions include: Is all the requisite information available? Do any data values have missing elements? Or are they in an unusable state?
Consistency
Consistency means data across all systems reflects the same information and are in sync with each other across the enterprise (e.g. customer address). Data consistency is often associated with data accuracy, and any data set scoring high on both will be a high-quality data set.
Key questions include: Are data values the same across the data sets? Are there any distinct occurrences of the same data instances that provide conflicting information?
Integrity
Integrity indicates that the attributes are maintained correctly, even as data gets stored and used in diverse systems. Data integrity ensures that all enterprise data can be traced and connected to other data.
Key questions include: Is there any data missing important relationship linkages?
Timeliness
Timeliness references whether information is available when it is expected and needed based on the user's expectation.
Key questions include: Is the data available when you need it?
Uniqueness
Uniqueness is the most critical dimension for ensuring no duplication or overlaps. Data uniqueness is measured against all records within a data set or across data sets and indicates if it is a single recorded instance in the data set used.
Key questions include: Are there any duplicates within your data?
You can add additional dimensions by going to the Data Quality Dimension collection page and clicking Add.