4794be6f7e3827228b6e0dc9b1cfe432a3ecdeb3,beginner_source/text_sentiment_ngrams_tutorial.py,,,#,44

Before Change


import os
if not os.path.isdir("./.data"):
	os.mkdir("./.data")
train_dataset, test_dataset = text_classification.DATASETS["AG_NEWS"](
    root="./.data", ngrams=NGRAMS, vocab=None)
BATCH_SIZE = 16
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

After Change



tokenizer = get_tokenizer("basic_english")
train_iter = AG_NEWS(split="train")
counter = Counter()
for (label, line) in train_iter:
    counter.update(tokenizer(line))
vocab = Vocab(counter, min_freq=1)


////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
// The vocabulary block converts a list of tokens into integers.

In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 4

Instances

Link

Project Name: pytorch/tutorials

Commit Name: 4794be6f7e3827228b6e0dc9b1cfe432a3ecdeb3

Time: 2021-03-04

Author: brianjo@fb.com

File Name: beginner_source/text_sentiment_ngrams_tutorial.py

Class Name:

Method Name:

Link

Project Name: RaRe-Technologies/gensim

Commit Name: 680de8d4f35325e7486c07c4e06422929e826b57

Time: 2019-01-10

Author: __Singleton__@hackerdom.ru

File Name: gensim/corpora/lowcorpus.py

Class Name: LowCorpus

Method Name: line2doc

Link

Project Name: pytorch/tutorials

Commit Name: 4794be6f7e3827228b6e0dc9b1cfe432a3ecdeb3

Time: 2021-03-04

Author: brianjo@fb.com

File Name: beginner_source/transformer_tutorial.py

Class Name:

Method Name: