Get sentence tokens from text
tokenize_sentences(input, EOS = ".?!:;")
input | a character vector. |
---|---|
EOS | a length one character vector listing all (single character) end-of-sentence tokens. |
a character vector, each entry of which corresponds to a single sentence.
Valerio Gherardi
tokenize_sentences("Hi there! I'm using `sbo`.")#> [1] "Hi there" "I'm using `sbo`"