7247571ab448f9ccf6b392a26df0b8b40b1085eb,2018-asr-attention/librispeech/full-setup-attention/tools/collect-train-text.py,,,#,4

Before Change


zip_files = ["%s/%s" % (zip_dir, fn) for fn in zip_files]
assert all([os.path])

for fn in sorted(glob("train-*/*/*/*.trans.txt")):
    for l in open(fn).read().splitlines():
        seq_name, txt = l.split(" ", 1)
        print(txt)

After Change


    assert isinstance(info, ZipInfo)
    path = info.filename.split("/")
    assert path[0] == "LibriSpeech", "does not expect %r (%r)" % (info, info.filename)
    if path[1].startswith("train-"):
      subdir = path[1]  // e.g. "train-clean-100"
      if path[-1].endswith(".trans.txt"):
        for l in zip_file.read(info).decode("utf8").splitlines():
          seq_name, txt = l.split(" ", 1)
          print(txt)

In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 5

Instances

Link

Project Name: rwth-i6/returnn-experiments

Commit Name: 7247571ab448f9ccf6b392a26df0b8b40b1085eb

Time: 2018-05-16

Author: zeyer@i6.informatik.rwth-aachen.de

File Name: 2018-asr-attention/librispeech/full-setup-attention/tools/collect-train-text.py

Class Name:

Method Name:

Link

Project Name: ina-foss/inaSpeechSegmenter

Commit Name: 688e56c09d028c0a956d1434ee16c60aea820de1

Time: 2018-04-19

Author: eliottlechapt@yahoo.fr

File Name: scripts/ina_speech_segmenter.py

Class Name:

Method Name:

Link

Project Name: GoogleCloudPlatform/healthcare

Commit Name: db2a9d3a62e36db12066c980d180377fffb31fcc

Time: 2018-11-29

Author: noreply@google.com

File Name: deploy/rule_generator/rule_generator.py

Class Name:

Method Name: run