IBM Cloud Pak for Data Course

course overview

Click to View dates & book now


Preparation and transformation of data with IBM Data Refinery is the third course in the learning path for professional data scientists that are working with the IBM Cloud Pak for Data platform. The course aims to familiarize data scientists with the data cleansing, and data shaping capabilities of the Data Refinery tool. Data Refinery saves preparation time by quickly transforming large amounts of raw data into consumable, high-quality information that's ready for analytics.

Learners follow the story of Sara (the data scientist), Muneiza (the data engineer), Liam (the data steward), and Tim (the data quality analyst) working in the Data Analytics department of a large health products company. The company plans a marketing campaign around coupons that are issued to customers and wants to better understand customer behavior. But they first need to access and prepare the relevant data for analytics. The team will mainly use IBM Data Refinery for this task.

Follow along with Sara, Muneiza, Liam, and Timâs story as learners create a suitable data set ready for analytics. Learners verify their acquired knowledge by completing several hands-on lab exercises in a remote classroom environment (provided to each learner during the course introduction).


Data Engineers, Data Scientists, Data Quality Analysts, Business Analysts

Skills Gained

After completing this course, you should be able to:

  • Outline the role of the Data Refinery tool in the ModelOps process.
  • Use Data Refinery to profile data.
  • Construct various data visualizations.
  • Use Data Refinery to analyze and transform data into consumable, high-quality information ready for analytics and machine learning.
  • Implement Data Refinery management tasks.


The prerequisite skills and knowledge include:

  • Basic knowledge of data wrangling and the ModelOps process
  • Experience working with tables and databases
  • Ability to navigate the Cloud Pak for Data and Watson Studio graphical user interfaces
  • Practical experience with routine data management tasks


  • Introduction to Data Refinery
  • Connect to your data
  • Profile and visualize your data
  • Analyze and transform your data
  • Manage Data Refinery flows

Talk to an expert

Thinking about Onsite?

If you need training for 3 or more people, you should ask us about onsite training. Putting aside the obvious location benefit, content can be customised to better meet your business objectives and more can be covered than in a public classroom. Its a cost effective option. One on one training can be delivered too, at reasonable rates.

Submit an enquiry from any page on this site, and let us know you are interested in the requirements box, or simply mention it when we contact you.

All $ prices are in USD unless it’s a NZ or AU date

SPVC = Self Paced Virtual Class

LVC = Live Virtual Class

Please Note: All courses are availaible as Live Virtual Classes

Trusted by over 1/2 million students in 15 countries

Our clients have included prestigious national organisations such as Oxford University Press, multi-national private corporations such as JP Morgan and HSBC, as well as public sector institutions such as the Department of Defence and the Department of Health.