Quiz: Tokens, N-Grams, tf-idf, and Stemming

Test your knowledge of document search strategies, tokenization, n-grams, stemming, and tf-idf.

Natural Language Tools

1

ngrams( ) has two arguments. The first is the document to be broken into n-grams. What is the second?

A)

n = is the length of n-grams in tokens.

B)

width = is the maximum number of letters in the n-gram.

C)

width = is the minimum number of letters in the n-gram.

D)

width = is the minimum number of tokens in the resulting n-grams.

Question 1 of 40 attempted

Get hands-on with 1200+ tech skills courses.