Objectifs : • List the common data quality contaminants
• Describe each of the following processes:
- Investigation
- Standardization
- Match
- Survivorship
• Describe QualityStage architecture
• Describe QualityStage clients and their functions
• Import metadata
• Build and run DataStage/QualityStage jobs, review results
• Build Investigate jobs
• Use Character Discrete, Concatenate, and Word Investigations to analyze data fields
• Describe the Standardize stage
• Identify Rule Sets
• Build jobs using the Standardize stage
• Interpret standardization results
• Investigate unhandled data and patterns
• Build a QualityStage job to identify matching records
• Apply multiple Match passes to increase efficiency
• Interpret and improve match results
• Build a QualityStage Survive job that will consolidate matched records into a single master record
• Build a single job to match data using a Two-Source match