All functions

as_sbo_dictionary()

Coerce to dictionary

babble()

Babble!

eval_sbo_predictor()

Evaluate Stupid Back-off next-word predictions

kgram_freqs() sbo_kgram_freqs() kgram_freqs_fast() sbo_kgram_freqs_fast()

k-gram frequency tables

plot(<word_coverage>)

Plot method for word_coverage objects

predict(<sbo_kgram_freqs>)

Predict method for k-gram frequency tables

predict(<sbo_predictor>)

Predict method for Stupid Back-off text predictor

preprocess()

Preprocess text corpus

prune()

Prune k-gram objects

sbo_dictionary() dictionary()

Dictionaries

sbo_predictor() predictor() sbo_predtable() predtable()

Stupid Back-off text predictions

tokenize_sentences()

Sentence tokenizer

twitter_dict

Top 1000 dictionary from Twitter training set

twitter_freqs

k-gram frequencies from Twitter training set

twitter_predtable

Next-word prediction tables from 3-gram model trained on Twitter training set

twitter_test

Twitter test set

twitter_train

Twitter training set

word_coverage()

Word coverage fraction