Plot cumulative corpus coverage fraction of a dictionary.
# S3 method for word_coverage plot( x, include_EOS = FALSE, show_limit = TRUE, type = "l", xlim = c(0, length(x)), ylim = c(0, 1), xticks = seq(from = 0, to = length(x), by = length(x)/5), yticks = seq(from = 0, to = 1, by = 0.25), xlab = "Rank", ylab = "Covered fraction", title = "Cumulative corpus coverage fraction of dictionary", subtitle = "_default_", ... )
x | a |
---|---|
include_EOS | length one logical. Should End-Of-Sentence tokens be considered in the computation of coverage fraction? |
show_limit | length one logical. If |
type | what type of plot should be drawn, as detailed in |
xlim | length two numeric. Extremes of the x-range. |
ylim | length two numeric. Extremes of the y-range. |
xticks | numeric vector. position of the x-axis ticks. |
yticks | numeric vector. position of the y-axis ticks. |
xlab | length one character. The x-axis label. |
ylab | length one character. The y-axis label. |
title | length one character. Plot title. |
subtitle | length one character. Plot subtitle; if "default", prints dictionary length and total covered fraction. |
... | further arguments passed to or from other methods. |
This function generates nice plots of cumulative corpus coverage
fractions. The x
coordinate in the resulting plot is the word rank in the
underlying dictionary; the y
coordinate at
x
is the cumulative coverage fraction for rank <= x
.
Valerio Gherardi
# }