Data Pipelines

Data Pipelines

Learn how to create a Flow whose output is a dataset, to be shared with other projects or externally to Dataiku.

About this course

If you have learned how to manipulate data in the Basics courses, you’re ready to build a more complex data pipeline.

In this course, you will create a Flow whose output is a dataset, to be shared with other projects or externally to Dataiku.  When done making changes to recipes in the Flow, you will learn how to propagate the new columns to the end of the Flow. 

Curriculum

  • Data Pipelines
  • Update the Prepare Recipe
  • Create a New Branch in the Flow
  • Using the Window Recipe to Compute Customer Ranks by Year
  • Pivot Ranks from Rows into Columns
  • Join Pivoted Data to Customer Data
  • Propagate Schema Changes
  • Build the Final Dataset
  • Build Datasets

About this course

If you have learned how to manipulate data in the Basics courses, you’re ready to build a more complex data pipeline.

In this course, you will create a Flow whose output is a dataset, to be shared with other projects or externally to Dataiku.  When done making changes to recipes in the Flow, you will learn how to propagate the new columns to the end of the Flow. 

Curriculum

  • Data Pipelines
  • Update the Prepare Recipe
  • Create a New Branch in the Flow
  • Using the Window Recipe to Compute Customer Ranks by Year
  • Pivot Ranks from Rows into Columns
  • Join Pivoted Data to Customer Data
  • Propagate Schema Changes
  • Build the Final Dataset
  • Build Datasets