d78a9d2692be1e7a32d157cf870126fe7abe6643,pmlb/pmlb.py,,fetch_data,#,198
Before Change
if return_X_y == True: A tuple of NumPy arrays containing (features, labels)
if dataset_name not in dataset_names:
raise ValueError ("Data set not found in PMLB." )
dataset_url = "https://github.com/EpistasisLab/penn-ml-benchmarks/raw/master/datasets/{DATASET_NAME}/{DATASET_NAME}.csv.gz" .format(DATASET_NAME=dataset_name)
if local_cache_dir is None:
dataset = pd.read_csv(dataset_url, sep="\t" , compression="gzip" )
After Change
if return_X_y == True: A tuple of NumPy arrays containing (features, labels)
if dataset_name in classification_dataset_names:
data_type = "classification"
elif dataset_name in regression_dataset_names:
data_type = "regression"
else :
raise ValueError ("Data set not found in PMLB." )
dataset_url = "{GITHUB_URL}/{DATA_TYPE}/{DATASET_NAME}/{DATASET_NAME}{SUFFIX}" .format(GITHUB_URL=GITHUB_URL,
DATA_TYPE=data_type,
DATASET_NAME=dataset_name,
SUFFIX=suffix
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 7
Instances Project Name: EpistasisLab/penn-ml-benchmarks
Commit Name: d78a9d2692be1e7a32d157cf870126fe7abe6643
Time: 2017-12-27
Author: weixuanf@pennmedicine.upenn.edu
File Name: pmlb/pmlb.py
Class Name:
Method Name: fetch_data
Project Name: BVLC/caffe
Commit Name: 96c2fe1de80c9752b992c4578a3ce46028d21fc5
Time: 2015-07-23
Author: jonlong@cs.berkeley.edu
File Name: python/caffe/net_spec.py
Class Name: Function
Method Name: _get_name
Project Name: nicodv/kmodes
Commit Name: 2bc7fcee8799b6cb67f1f88eee50b5a033572359
Time: 2019-06-05
Author: nico.devos@auto-grid.com
File Name: kmodes/tests/test_common.py
Class Name:
Method Name: test_non_meta_estimators