fuzzystring: Fast Fuzzy String Joins for Data Frames

Perform fuzzy joins on data frames using approximate string matching. Implements inner, left, right, full, semi, and anti joins with string distance metrics from the 'stringdist' package, including Optimal String Alignment, Levenshtein, Damerau-Levenshtein, Jaro-Winkler, q-gram, cosine, Jaccard, and Soundex. Uses a 'data.table' backend plus compiled 'C++' result assembly to reduce overhead in large joins, while adaptive candidate planning avoids unnecessary distance evaluations in single-column string joins. Suitable for reconciling misspellings, inconsistent labels, and other near-match identifiers while optionally returning the computed distance for each match.

Version: 0.0.5
Depends: R (≥ 4.1)
Imports: data.table, Rcpp, stringdist
LinkingTo: Rcpp
Suggests: dplyr, ggplot2, knitr, qdapDictionaries, readr, rmarkdown, rvest, stringr, testthat (≥ 3.0.0), tibble, tidyr
Published: 2026-03-28
DOI: 10.32614/CRAN.package.fuzzystring
Author: Paul E. Santos Andrade ORCID iD [aut, cre], David Robinson [ctb] (aut of fuzzyjoin)
Maintainer: Paul E. Santos Andrade <paulefrens at gmail.com>
BugReports: https://github.com/PaulESantos/fuzzystring/issues
License: MIT + file LICENSE
URL: https://github.com/PaulESantos/fuzzystring, https://paulesantos.github.io/fuzzystring/
NeedsCompilation: yes
Materials: README, NEWS
CRAN checks: fuzzystring results

Documentation:

Reference manual: fuzzystring.html , fuzzystring.pdf
Vignettes: Benchmarking fuzzystring against fuzzyjoin (source, R code)
Getting Started with fuzzystring (source, R code)

Downloads:

Package source: fuzzystring_0.0.5.tar.gz
Windows binaries: r-devel: fuzzystring_0.0.5.zip, r-release: fuzzystring_0.0.5.zip, r-oldrel: fuzzystring_0.0.5.zip
macOS binaries: r-release (arm64): fuzzystring_0.0.5.tgz, r-oldrel (arm64): fuzzystring_0.0.1.tgz, r-release (x86_64): fuzzystring_0.0.5.tgz, r-oldrel (x86_64): fuzzystring_0.0.5.tgz
Old sources: fuzzystring archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=fuzzystring to link to this page.