4e144c9f842d7415d8be5bdbb5912d88ae32cced,pycorrector/seq2seq/corpus_reader.py,CGEDReader,read_tokens,#CGEDReader#,96
Before Change
childNodes[0].data.strip()
else:
// Input the correct text
sentence = doc.getElementsByTagName("CORRECTION")[0]. \
childNodes[0].data.strip()
yield segment(sentence, cut_type="char")
After Change
def read_tokens(self, path, is_infer=False):
i = 0
with open(path, "r", encoding="utf-8") as f:
for line in f:
// Input the correct text, which start with 0
if i % 2 == 1:
if line and len(line) > 5:
yield line.lower()[5:].strip().split()
i += 1
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 5
Instances
Project Name: shibing624/pycorrector
Commit Name: 4e144c9f842d7415d8be5bdbb5912d88ae32cced
Time: 2018-04-16
Author: 507153809@qq.com
File Name: pycorrector/seq2seq/corpus_reader.py
Class Name: CGEDReader
Method Name: read_tokens
Project Name: janfreyberg/superintendent
Commit Name: b43b46e835c403a752286ca2c612891d345045db
Time: 2018-05-09
Author: jan.freyberg@gmail.com
File Name: superintendent/iterator_functions.py
Class Name:
Method Name: _default_data_iterator
Project Name: coala/coala-bears
Commit Name: bfd61fb7a0c4456ce812a227f3b1962b2c727879
Time: 2016-09-03
Author: abdealikothari@gmail.com
File Name: bears/general/KeywordBear.py
Class Name: KeywordBear
Method Name: run