Plot method for word_coverage objects — plot.word

Plot cumulative corpus coverage fraction of a dictionary.

# S3 method for word_coverage
plot(
  x,
  include_EOS = FALSE,
  show_limit = TRUE,
  type = "l",
  xlim = c(0, length(x)),
  ylim = c(0, 1),
  xticks = seq(from = 0, to = length(x), by = length(x)/5),
  yticks = seq(from = 0, to = 1, by = 0.25),
  xlab = "Rank",
  ylab = "Covered fraction",
  title = "Cumulative corpus coverage fraction of dictionary",
  subtitle = "_default_",
  ...
)

Arguments

x	a `word_coverage` object.
include_EOS	length one logical. Should End-Of-Sentence tokens be considered in the computation of coverage fraction?
show_limit	length one logical. If `TRUE`, plots an horizontal line corresponding to the total coverage fraction.
type	what type of plot should be drawn, as detailed in `?plot`.
xlim	length two numeric. Extremes of the x-range.
ylim	length two numeric. Extremes of the y-range.
xticks	numeric vector. position of the x-axis ticks.
yticks	numeric vector. position of the y-axis ticks.
xlab	length one character. The x-axis label.
ylab	length one character. The y-axis label.
title	length one character. Plot title.
subtitle	length one character. Plot subtitle; if "default", prints dictionary length and total covered fraction.
...	further arguments passed to or from other methods.

Details

This function generates nice plots of cumulative corpus coverage fractions. The x coordinate in the resulting plot is the word rank in the underlying dictionary; the y coordinate at x is the cumulative coverage fraction for rank <= x.

Author

Valerio Gherardi

Examples

# \donttest{
c <- word_coverage(twitter_dict, twitter_test)
plot(c)
# }