Data Transformation

Ha Khanh Nguyen (hknguyen)

1. Removing Duplicates


2. Transforming Data Using a Function or Mapping


3. Replacing Values


4. Discretization and Binning

4.1 Specified Bins

4.2 Number of bins

4.3 Cut using quantiles


5. Detecting and Filtering Outliers

This lecture notes reference materials from Chapter 7 of Wes McKinney's Python for Data Analysis 2nd Ed.