39813c572463211d05f35a0dcac56ba7bd07c725,examples/plot_investigating_dirty_categories.py,,,#,17

Before Change


one_hot_encoded = categorical_encoder.fit_transform(
    sorted_values[:n_obs].reshape(-1, 1))
f3, ax3 = plt.subplots(figsize=(6, 6))
cax3 = ax3.matshow(one_hot_encoded[:n_obs, :n_categories])
f3.colorbar(cax3)
f3.suptitle("Employee Position Title values, one-hot encoded")
ax3.xaxis.tick_bottom()

After Change


// similarities:

f4, ax4 = plt.subplots(figsize=(6, 6))
similarity_encoded = similarity_encoder.fit_transform(employee_position_titles)
cax4 = ax4.matshow(similarity_encoded)
f4.colorbar(cax4)
f4.suptitle("Employee Position Title values, similarity encoded")
ax4.xaxis.tick_bottom()
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 3

Instances


Project Name: dirty-cat/dirty_cat
Commit Name: 39813c572463211d05f35a0dcac56ba7bd07c725
Time: 2018-06-07
Author: pierreglaser@msn.com
File Name: examples/plot_investigating_dirty_categories.py
Class Name:
Method Name:


Project Name: logpai/loglizer
Commit Name: d990f23b72c2409084a799bdf49109a996a02256
Time: 2019-02-17
Author: zhujm.home@gmail.com
File Name: demo/PCA_demo.py
Class Name:
Method Name:


Project Name: scikit-learn-contrib/categorical-encoding
Commit Name: 9e2385f00975bcba7926396c6563eb8488d778f6
Time: 2018-09-02
Author: jan@motl.us
File Name: examples/benchmarking_large/util.py
Class Name:
Method Name: train_encoder