failure: length > 1 in coercion to logical
issue
in n_gram_merge()
, in commit 4336ee6.stringdist
C API, and
calling C functions in place of using
stringdist::stringdistmatrix()
. This change results in
speed improvements in function n_gram_merge()
, and requires
that stringdist
v0.9.5.1 or greater be installed.n_gram_merge()
, renamed arg
edit_dist_weights
to weight
. The only purpose
of this arg is to be passed along to function
stringdistmatrix
from the stringdist
package (which uses the name weight
, so this change is
simply to match that).Fixed issue in which input strings that contained accent marks
were not being properly handled/clustered (#9). The fix
involved adding stringi to
Imports
and using
stringi::stri_trans_general()
.
Fixed issue in n_gram_merge()
in which incorrect
values were being return when input arg ignore_strings
was
not NULL
, and arg bus_suffix = FALSE
(#7).
Fixed issue in which input strings that contained punctuation that was NOT surrounded by spaces was returning incorrect values (#6).
Fixed issue in which the edit value assigned to a cluster was sometimes not the most frequent string in that cluster (#5).
std::unordered_map()
, resulting in a substantial speed
improvement when passing large character vectors (length 100,000+) to
either of the exported functions (#8).