ce493b09494c007b27c6fbc5944636136fedd6e3,Orange/preprocess/preprocess.py,RemoveSparse,__call__,#RemoveSparse#Any#,593

Before Change


                sparsness = data.X.shape[0] - np.count_nonzero(data.X, axis=0)
        else: // filter by nans
            if sp.issparse(data.X):
                sparsness = np.sum(np.isnan(data.X.data), axis=0)
            else:
                sparsness = np.sum(np.isnan(data.X), axis=0)
        att = [a for a, s in zip(data.domain.attributes, sparsness) if s <= tailored_threshold]
        domain = Orange.data.Domain(att, data.domain.class_vars,

After Change


        else: // filter by nans
            if sp.issparse(data.X):
                data_csc = sp.csc_matrix(data.X)
                sparsness = [np.sum(np.isnan(data.X[:, i].data)) for i in range(data_csc.shape[1])]
            else:
                sparsness = np.sum(np.isnan(data.X), axis=0)
        att = [a for a, s in zip(data.domain.attributes, sparsness) if s <= tailored_threshold]
        domain = Orange.data.Domain(att, data.domain.class_vars,
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 4

Instances


Project Name: biolab/orange3
Commit Name: ce493b09494c007b27c6fbc5944636136fedd6e3
Time: 2019-12-13
Author: kovacic.andreja@gmail.com
File Name: Orange/preprocess/preprocess.py
Class Name: RemoveSparse
Method Name: __call__


Project Name: janfreyberg/superintendent
Commit Name: cb266bbe1ce215ec56cc54ee009ea3a2a055b6d8
Time: 2018-05-31
Author: janfreyberg@users.noreply.github.com
File Name: superintendent/semisupervisor.py
Class Name: SemiSupervisor
Method Name: annotate


Project Name: biolab/orange3
Commit Name: b39db6e2bf0c806554578b5c9895d9b7ef6bd48c
Time: 2017-09-09
Author: pavlin.g.p@gmail.com
File Name: Orange/statistics/util.py
Class Name:
Method Name: _count_nans_per_row_sparse