Data Cleaning¶
These transformers can be used to reduce the number of columns during the feature selection step.
Base data_cleaning transformer¶
Base data cleaning transformer. |
Off-line data cleaning¶
Drop the categorical columns having a large cardinality. |
|
Drop the columns having a large NaN values ratio. |
|
Drop the categorical columns having a low cardinality. |
Realtime data cleaning¶
Set the datatype of the selected columns to a given datatype. |
|
Drop the columns given by the user. |
|
Drop the columns belonging to a given datatype. |
|
Drop the columns which are not given by the user. |
|
Replace the categorical values by the ones given by the user. |