If you want high data quality you must have highly accurate data. To get that you need to be proactive. You need a dedicated, focused
You need to focus on data accuracy. This means you need an organization that is dedicated to improving data accuracy. You also need trained staff
You need to use technology heavily. Achieving high levels of data accuracy requires looking at data and acting on what you see. You need to do a lot of data profiling. You need to have
You need to treat information about your data as of equal or greater importance than the data itself. You must install and maintain a
You need to educate other corporate
Business users of data need to be sensitized to quality issues.
Business analysts must become experts on data quality concepts and play an active role in data quality projects.
Developers need to be taught best practices for database and application design to ensure improved data accuracy.
Data administrators need to be taught the importance of accuracy and how they can help improve it.
All employees who generate data need to be
The executive team needs to understand the value of improved data accuracy and the impact it has on improved information quality.
You need to make quality assurance a part of all data projects. Data quality assurance activities need to be planned along with all of the other activities of the information systems department. Assisting a new project in achieving its data quality goals is of equal or higher value than conducting assessment projects in isolation. The more integrated data quality assurance is with the entire information system function, the more value is realized. And finally, everyone needs to work well together to accomplish the quality goals of the corporation.
Data quality investigations are all designed to surface problems with the data. This is true whether the problems come from stand-alone assessments or through data profiling services to projects. It also does not matter whether assessments reveal problems from an inside-out or an outside-in method. The output of all these efforts is a collection of facts that get consolidated into issues. An issue is a problem with the database that calls for action. In the context of data quality assurance, it is derived from a collection of information that defines a problem that has a single root cause or can be grouped to describe a single course of action.
That is clearly not the end of the data quality effort. Just identifying issues does nothing to improve things. The issues need to drive changes that will improve the quality of the data for the eventual users.
It is important to have a formal process for moving issues from information to action. It is also important to track the progress of issues as they go through this process. The disposition of issues and the results obtained from implementing changes as a result of those issues are the true documentation of the work done and value of the data quality assurance department.
{% if main.adsdop %}{% include 'adsenceinline.tpl' %}{% endif %}
Figure 5.1 shows the phases for managing issues after they are created. It does not matter who
Figure 5.1:
Issue management phases.
An issue management system should be used to
The collection of issues and the management process can