Data Cleaning

Excel supports a number of data cleaning operations, including:

  • Renaming columns
  • Removing columns
  • Assessing data quality
  • Finding and replacing invalid values

Each of these operations, and more, are handled as individual steps in Dataiku’s Prepare recipe. The Prepare recipe’s script provides a history of the cleaning and enrichment actions taken, and allows you to quickly re-apply the data preparation when new data arrives. See the Prepare Recipe: Basics video below for an introduction to handling these tasks in Dataiku.

For more information on data preparation in Dataiku, please consult the reference documentation.