IBM InfoSphere QualityStage Essentials v11.5 KM213G

Duration: 
4 days
Codes: 
KM213G
Versions: 
V11.5

Overview

This course teaches how to build QualityStage parallel jobs that investigate, standardize, match, and consolidate data records. Students will gain experience by building an application that combines customer data from three source systems into a single master customer record.

Audience

  • Data Analysts responsible for data quality using QualityStage
  • Data Quality Architects

Data Cleansing Developers

Skills Gained

List the common data quality contaminants

Describe each of the following processes:

§Investigation

§Standardization

§Match

§Survivorship

Describe QualityStage architecture

Describe QualityStage clients and their functions

Import metadata

Build and run DataStage/QualityStage jobs, review results

Build Investigate jobs

Use Character Discrete, Concatenate, and Word Investigations to analyze data fields

Describe the Standardize stage

Identify Rule Sets

Build jobs using the Standardize stage

Interpret standardization results

Investigate unhandled data and patterns

Build a QualityStage job to identify matching records

Apply multiple Match passes to increase efficiency

Interpret and improve match results

Build a QualityStage Survive job that will consolidate matched records into a single master record

Build a single job to match data using a Two-Source match

Prerequisites

Participants should have:

  • Familiarity with the Windows operating system

Familiarity with a text editor

Helpful, but not required, would be some understanding of elementary statistics principles such as weighted averages and probability.

Course Outline

1. Data Quality Issues

  • Listing the common data quality contaminants
  • Describing data quality processes 2. QualityStage Overview
  • Describing QualityStage architecture
  • Describing QualityStage clients and their functions 3. Developing with QualityStage
  • Importing metadata
  • Building DataStage/QualityStage Jobs
  • Running jobs
  • Reviewing results 4. Investigate
  • Building Investigate jobs
  • Using Character Discrete, Concatenate, and Word Investigations to analyze data fields
  • Reviewing results 5. Standardize
  • Describing the Standardize stage
  • Identifying Rule Sets
  • Building jobs using the Standardize stage
  • Interpreting standardize results
  • Investigating unhandled data and patterns 6. Match
  • Building a QualityStage job to identify matching records
  • Applying multiple Match passes to increase efficiency
  • Interpreting and improving Match results 7. Survive
  • Building a QualityStage survive job that will consolidate matched records into a single master record 8. Two-Source Match

Building a QualityStage job to match data using a reference match

Related Courses

 

Thinking about Onsite?

If you need training for 3 or more people, you should ask us about onsite training. Putting aside the obvious location benefit, content can be customised to better meet your business objectives and more can be covered than in a public classroom. It's a cost effective option.

Submit an enquiry from any page on this site, and let us know you are interested in the requirements box, or simply mention it when we contact you.

ITILv3, RESILIA, PRINCE2, PRINCE2 Agile, AgileSHIFT, MSP, M_o_R, P3M3, P3O, MoP, MoV courses on this page are offered by QA, ATO of AXELOS Limited. ITIL, RESILIA, PRINCE2, PRINCE2 Agile, AgileSHIFT, MSP, M_o_R, P3M3, P3O,MoP, MoV are registered trademarks of AXELOS Limited. All rights reserved.

Upcoming Dates

  • GREEN This class is Guaranteed To Run.
  • SPVC - Self-Paced Virtual Class.
  • Click a Date to Enroll.
Course Location Days Cost Date
Live Virtual Live Virtual4 3000 £3000 2019-09-23
Live Virtual Live Virtual4 3000 £3000 2019-10-29
Live Virtual Live Virtual4 3000 £3000 2019-11-25
Live Virtual Live Virtual4 3000 £3000 2019-12-16
Live Virtual Live Virtual4 3000 £3000 2020-01-28