Ensuring data quality: Which rules should you follow?
Data quality rules are requirements that every organization must implement to ensure data quality and integrity. These requirements aim to meet two interdependent objectives:- To define the format to which the data must conform and the relationships that must exist between the data
- To provide a reference for the company to verify and measure its data quality against these requirements
1. Involve the managers of the different departments of the company
The different departments and services of a company have priorities that are not necessarily the same. To develop effective data quality rules, it is essential to define them with the requirements of the managers of each department, in agreement with the various data people.2. Have a reasonable number of rules
Quality rules are a great help in moving towards better data management, but it is crucial to create a reasonable number of them. Searching for a data item with ten rules is not the same as searching for one with 100 rules; the solution should not become a problem! Therefore, you must find the right balance between consistent data quality control and a certain measure of rule implementation.3. Encourage a step-by-step approach
It is not necessary to create rules covering all data right away. Similarly, an organization that is just starting to implement its data governance strategy should not solve the quality rules issue with a snap of the fingers. For good quality management, it is best to identify critical data that needs immediate attention.4. Create rules taking into account each type of data
Many data quality characteristics will allow you to establish rules according to the domain to which a given data belongs. For example, if the "employee's full name" data is critical, indispensable information, whereas the "employee's contact number" data is not necessarily as important. These two data will not have to meet the same quality requirements. While the first data will have to meet the requirements of completeness, uniqueness, and accuracy, the second data will have to be accurate and orderly. These requirements will have to be reflected in the quality rules, for example:- The employee's full name must not be N / Y (for completeness)
- Only one "Employee's full name" must correspond to one "Social Security Number" (to ensure uniqueness)
- The employee's full name must have at least one space, and contain only letters, no numbers or other permitted characters (to ensure accuracy and completeness)
- The employee's phone number must be numeric only (to ensure accuracy and sequence)