c3031320afca012e143e2df46f9fe4ed3f3d18d3,ludwig/datasets/mixins/process.py,MultifileJoinProcessMixin,process_downloaded_dataset,#MultifileJoinProcessMixin#,54

Before Change


            all_files.append(file_df)

        concat_df = pd.concat(all_files, ignore_index=True)
        if not os.path.exists(self.processed_dataset_path):
            os.makedirs(self.processed_dataset_path)
        concat_df.to_csv(
            os.path.join(self.processed_dataset_path, self.csv_filename),
            index=False)

After Change


            elif split_name == "test_file":
                file_df["split"] = 2
            else:
                raise ValueError(f"Unrecognized split name: {split_name}")
            all_files.append(file_df)

        concat_df = pd.concat(all_files, ignore_index=True)
        os.makedirs(self.processed_dataset_path, exist_ok=True)
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 5

Instances


Project Name: uber/ludwig
Commit Name: c3031320afca012e143e2df46f9fe4ed3f3d18d3
Time: 2021-02-21
Author: tgaddair@gmail.com
File Name: ludwig/datasets/mixins/process.py
Class Name: MultifileJoinProcessMixin
Method Name: process_downloaded_dataset


Project Name: tensorflow/datasets
Commit Name: 1f65deb60665a460edf5e9238a70a2c597b3a12c
Time: 2020-09-24
Author: epot@google.com
File Name: tensorflow_datasets/core/load.py
Class Name:
Method Name: find_builder_dir


Project Name: deepmipt/DeepPavlov
Commit Name: 7d7c9cfcb200722f256f67337a6a6a827e7b4540
Time: 2019-08-08
Author: ignatov.fedor@gmail.com
File Name: deeppavlov/utils/socket/socket.py
Class Name: SocketServer
Method Name: __init__