|
|
same-table synonyms, 205-207
with data not in first-normal form, 206
primary/foreign key, 205-206
See also synonyms
sampling, 126
schema definition, 272
second-normal form, 181
simple data rule analysis, 134-135, 215-236
consolidations, 231-232
data gathering, 221-224
decision support systems, 232
definitions, 216-220
mapping with other applications, 230-232
migrating to new applications and, 230-231
output validation, 225
paralysis, 235
process, 220-225
process illustration, 221
summary, 235-236
testing, 224-225
See also complex data rule analysis; data rule analysis
simple data rules
checked during data entry, 233
checked during transaction processing, 233
checkers for periodic checks, 234
checkers for transactions, 234
dates, 226-227
deferred for execution, 233
derived-value, 229
durations, 227
evaluation, 232-234
example, 269-270
execution, 225
multiple-row/same columns, 230
no need to be checked, 234
object subgrouping columns, 227-228
remedies, 232-235
types of, 226-230
work flow, 228-229
single-source tables, 210-211
skip-over rules, 162
soft data rules, 218, 238
software support, 139-140
software tools, 18-22
data cleansing, 21-22
data monitoring, 20-21
data profiling, 20
DBMS, 22
emergence of, 18
metadata repositories, 19
See also data quality assurance
source code scavenging, 221-222, 239
source databases, 54
denormalization, 59
fixed, 212
improving, 169-170
matching to target databases, 56-57
special domains, 164-165
benefit, 164-165
examples, 164
See also column properties
speculation
functional dependencies, 193-194
gathering complex data rules through, 240
gathering simple data rules through, 224
in structure analysis, 191
split table
form, 184
relationship, 207
stand-alone assessments, 77
storage properties, 157-160
length, 159
mapping and, 168
physical data type, 157-159
precision, 159-160
structural matching, 57
structure analysis, 38, 132-134, 173-214
data discovery, 191-192
defined, 38, 132
definitions, 173-187
elements, 133
extraction and, 187-188
functional dependencies, 174-176
information gathering, 189-191
issues, 173
keys, 176-181
normal forms, 181-184
process, 188-193
process illustration, 189
results verification, 192-193
summary, 213-214
synonyms, 184-187
uses, 39, 134
violations, 134
See also data profiling
structure rules, 173, 193-210
consequences, 173
example, 266-268
structure-level remedies, 212-213
subgrouping rules, 227-228
subsetting, 220
substitution correction, 21
synonyms, 184-187
analysis, 184
analyzing, in same table, 205-207
candidate, determining, 199-202
characteristics, determining, 205
data profiling repository, 276
defined, 184
degree of overlap, 203
domain, 186-187
finding, 199-209
inclusive relationship, 203
merge, 187
value correspondence, 220-223
violations, 208
system problems, 48-49
|
|