Data Preparation (video)

Summary

Summary

The goal of the Data Preparation phase is to wrangle and enrich data as input for model building.

A DSS recipe is a repeatable set of actions to perform on one or more input datasets, resulting in one or more output datasets.

Visual recipes provide a simple UI for accomplishing the most common data transformations.

Code recipes satisfy needs that are more customized than what a visual recipe can provide.

Plugin recipes allow users to create reusable components that wrap additional functionality into a visual UI, thereby extending the capabilities of DSS to a wider audience. 

Actions in DSS, such as running a recipe (whether it may be visual, code or plugin) or training a model, generate a job.  Wherever possible, DSS pushes down the computation to the underlying location of the data.