d78a9d2692be1e7a32d157cf870126fe7abe6643,pmlb/pmlb.py,,fetch_data,#,198

Before Change


        if return_X_y == True: A tuple of NumPy arrays containing (features, labels)

    
    if dataset_name not in dataset_names:
        raise ValueError("Data set not found in PMLB.")

    dataset_url = "https://github.com/EpistasisLab/penn-ml-benchmarks/raw/master/datasets/{DATASET_NAME}/{DATASET_NAME}.csv.gz".format(DATASET_NAME=dataset_name)

    if local_cache_dir is None:
        dataset = pd.read_csv(dataset_url, sep="\t", compression="gzip")

After Change


        if return_X_y == True: A tuple of NumPy arrays containing (features, labels)

    
    if dataset_name in classification_dataset_names:
        data_type = "classification"
    elif dataset_name in regression_dataset_names:
        data_type = "regression"
    else:
        raise ValueError("Data set not found in PMLB.")

    dataset_url = "{GITHUB_URL}/{DATA_TYPE}/{DATASET_NAME}/{DATASET_NAME}{SUFFIX}".format(GITHUB_URL=GITHUB_URL,
                                DATA_TYPE=data_type,
                                DATASET_NAME=dataset_name,
                                SUFFIX=suffix
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 7

Instances


Project Name: EpistasisLab/penn-ml-benchmarks
Commit Name: d78a9d2692be1e7a32d157cf870126fe7abe6643
Time: 2017-12-27
Author: weixuanf@pennmedicine.upenn.edu
File Name: pmlb/pmlb.py
Class Name:
Method Name: fetch_data


Project Name: BVLC/caffe
Commit Name: 96c2fe1de80c9752b992c4578a3ce46028d21fc5
Time: 2015-07-23
Author: jonlong@cs.berkeley.edu
File Name: python/caffe/net_spec.py
Class Name: Function
Method Name: _get_name


Project Name: nicodv/kmodes
Commit Name: 2bc7fcee8799b6cb67f1f88eee50b5a033572359
Time: 2019-06-05
Author: nico.devos@auto-grid.com
File Name: kmodes/tests/test_common.py
Class Name:
Method Name: test_non_meta_estimators