|
|
identified costs, 111
identifier columns, 194-195
impacts
already happening, 86
assessing, 85-87
documenting, 87
not yet happening, 87
See also issues
inaccurate data
acceptance of, 8-9
blame for, 9-10
cause investigation, 87-94
clustering of, 35
cost, 9, 15
distribution of, 32-34
facts, recording, 207-209
finding, 35-40
impact, 12-14
impact assessment, 85-87
increase in, 8
reasons for, 34
remedy development, 94-99
tolerance levels, 40-41
inaccurate data sources, 43-64
areas, 43
data accuracy decay, 50-52
initial data entry, 44-49
moving/restructuring data, 52-62
problem scope, 63
using data, 62-63
inclusive relationship, 203
inconsistencies
actual point, finding, 167
change-induced, 30
finding with programmatic methods, 167
object-level, 32
reasons for, 167
information gathering (column property analysis), 153
information gathering (structure analysis), 189-191
commonsense speculation, 191
data models, 189-190
database definitions, 190
metadata repositories, 189
information products, conversion to, 93-94
information systems
complexity, 7
continuous evolution of, 5-8
importance, 105
information technology
evolution, 10
management, 11
inside-out method, 72-73
analysis, 72
defined, 72
inaccurate data evidence, 73
outside-in method comparison, 74-75
problem catching, 74
rule set, 72
See also data quality assurance methods
integration, 7, 62
intended use decisions, 169
investigations, 80, 81
cause, 87-94
facts, 81
outside-in, 86
issue collection, 81-85
metrics, 82-85
output, 85
issue management, 80-102
cause investigation, 87-94
impact assessment, 85-87
phases, 81
post-implementation monitoring, 99-101
remedy development, 94-99
remedy implementation, 99
summary, 101-102
issues
crystallizing, 87
data profiling repository, 278
defined, 85
impact assessment of, 85-87
life-span, 101
recording, 85
tracking, 101
|
|