39813c572463211d05f35a0dcac56ba7bd07c725,examples/plot_investigating_dirty_categories.py,,,#,17

Before Change


//     "Employee Position Title"].sample(n=n_observations)?
categorical_encoder = CategoricalEncoder(encoding="onehot-dense")
one_hot_encoded = categorical_encoder.fit_transform(
    sorted_values[:n_obs].reshape(-1, 1))
f3, ax3 = plt.subplots(figsize=(6, 6))
cax3 = ax3.matshow(one_hot_encoded[:n_obs, :n_categories])
f3.colorbar(cax3)

After Change



// encoding simply a subset of the observations
n_obs = 20
employee_position_titles = values["Employee Position Title"].head(
    n_obs).to_frame()
categorical_encoder = CategoricalEncoder(encoding="onehot-dense")
one_hot_encoded = categorical_encoder.fit_transform(employee_position_titles)
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 2

Instances


Project Name: dirty-cat/dirty_cat
Commit Name: 39813c572463211d05f35a0dcac56ba7bd07c725
Time: 2018-06-07
Author: pierreglaser@msn.com
File Name: examples/plot_investigating_dirty_categories.py
Class Name:
Method Name:


Project Name: cesium-ml/cesium
Commit Name: 2ab1119596b420cc696a783271d617562ccf7e28
Time: 2017-03-29
Author: brettnaul@gmail.com
File Name: examples/plot_EEG_Example.py
Class Name:
Method Name:


Project Name: CNuge/kaggle-code
Commit Name: b7990885d8b26b9404fd9ce952b0b2f005019594
Time: 2018-01-12
Author: nugentc@uoguelph.ca
File Name: california_housing/feature_engineering.py
Class Name:
Method Name: