Index_S

S

same-table synonyms, 205-207

with data not in first-normal form, 206

primary/foreign key, 205-206

See also synonyms

sampling, 126

schema definition, 272

second-normal form, 181

simple data rule analysis, 134-135, 215-236

consolidations, 231-232

data gathering, 221-224

decision support systems, 232

definitions, 216-220

mapping with other applications, 230-232

migrating to new applications and, 230-231

output validation, 225

paralysis, 235

process, 220-225

process illustration, 221

summary, 235-236

testing, 224-225

See also complex data rule analysis; data rule analysis

simple data rules

checked during data entry, 233

checked during transaction processing, 233

checkers for periodic checks, 234

checkers for transactions, 234

dates, 226-227

deferred for execution, 233

derived-value, 229

durations, 227

evaluation, 232-234

example, 269-270

execution, 225

multiple-row/same columns, 230

no need to be checked, 234

object subgrouping columns, 227-228

remedies, 232-235

types of, 226-230

work flow, 228-229

single-source tables, 210-211

skip-over rules, 162

soft data rules, 218, 238

software support, 139-140

software tools, 18-22

data cleansing, 21-22

data monitoring, 20-21

data profiling, 20

DBMS, 22

emergence of, 18

metadata repositories, 19

See also data quality assurance

source code scavenging, 221-222, 239

source databases, 54

denormalization, 59

fixed, 212

improving, 169-170

matching to target databases, 56-57

special domains, 164-165

benefit, 164-165

examples, 164

See also column properties

speculation

functional dependencies, 193-194

gathering complex data rules through, 240

gathering simple data rules through, 224

in structure analysis, 191

split table

form, 184

relationship, 207

stand-alone assessments, 77

storage properties, 157-160

length, 159

mapping and, 168

physical data type, 157-159

precision, 159-160

structural matching, 57

structure analysis, 38, 132-134, 173-214

data discovery, 191-192

defined, 38, 132

definitions, 173-187

elements, 133

extraction and, 187-188

functional dependencies, 174-176

information gathering, 189-191

issues, 173

keys, 176-181

normal forms, 181-184

process, 188-193

process illustration, 189

results verification, 192-193

summary, 213-214

synonyms, 184-187

uses, 39, 134

violations, 134

See also data profiling

structure rules, 173, 193-210

consequences, 173

example, 266-268

structure-level remedies, 212-213

subgrouping rules, 227-228

subsetting, 220

substitution correction, 21

synonyms, 184-187

analysis, 184

analyzing, in same table, 205-207

candidate, determining, 199-202

characteristics, determining, 205

data profiling repository, 276

defined, 184

degree of overlap, 203

domain, 186-187

finding, 199-209

inclusive relationship, 203

merge, 187

value correspondence, 220-223

violations, 208

system problems, 48-49



Data Quality(c) The Accuracy Dimension
Data Quality: The Accuracy Dimension (The Morgan Kaufmann Series in Data Management Systems)
ISBN: 1558608915
EAN: 2147483647
Year: 2003
Pages: 133
Authors: Jack E. Olson

flylib.com © 2008-2017.
If you may any questions please contact us: flylib@qtcs.net