bf03e74b9b0858f2e827e6da4b0ca8d960b3b5b6,soynlp/pos/_news_pos.py,NewsPOSExtractor,_extract_compound_nouns,#NewsPOSExtractor#,303

Before Change


                if word in words:
                    words[word] = max(0, words.get(word, 0) - count)

        words = {word:count for word, count in words.items()
                 if (not (word in nouns_)) and (count > 0)}

        return nouns_, words

After Change



        tokenizer = MaxScoreTokenizer(scores = {noun:1 for noun in nouns if len(noun) > 1})

        compounds, removals = {}, set()
        for word, count in eojeols.items():
            // format: [(word, begin, end, score, length)]
            tokens = tokenizer.tokenize(word, flatten=False)[0]
            noun = parse_compound(tokens)
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 4

Instances


Project Name: lovit/soynlp
Commit Name: bf03e74b9b0858f2e827e6da4b0ca8d960b3b5b6
Time: 2018-11-13
Author: soy.lovit@gmail.com
File Name: soynlp/pos/_news_pos.py
Class Name: NewsPOSExtractor
Method Name: _extract_compound_nouns


Project Name: IDSIA/sacred
Commit Name: 82d573a8775982e3c601d690bdf934c54cc07d54
Time: 2019-10-10
Author: gabrieldemarmiesse@gmail.com
File Name: sacred/experiment.py
Class Name: Experiment
Method Name: get_default_options


Project Name: explosion/thinc
Commit Name: 3d12c536e66ff4414860b8de1467fe56ede54063
Time: 2019-11-28
Author: adrianeboyd@gmail.com
File Name: thinc/neural/_classes/model.py
Class Name: Model
Method Name: define_operators