It brings together several aspects of biodiversity
data-cleaning in one place. 'bdc' is organized in thematic modules
related to different biodiversity dimensions, including 1) Merge
datasets: standardization and integration of different datasets; 2)
Pre-filter: flagging and removal of invalid or non-interpretable
information, followed by data amendments; 3) Taxonomy: cleaning,
parsing, and harmonization of scientific names from several taxonomic
groups against taxonomic databases locally stored through the
application of exact and partial matching algorithms; 4) Space:
flagging of erroneous, suspect, and low-precision geographic
coordinates; and 5) Time: flagging and, whenever possible, correction
of inconsistent collection date. In addition, it contains
features to visualize, document, and report data quality – which is
essential for making data quality assessment transparent and
reproducible. The reference for the methodology is Bruno et al. (2022)
<doi:10.1111/2041-210X.13868>.
Version: |
1.1.5 |
Imports: |
CoordinateCleaner, doParallel, dplyr, DT, foreach, fs, ggplot2, here, magrittr, purrr, qs, readr, rgnparser, rnaturalearth, sf (≥ 1.0.5), stringdist, stringi, stringr, taxadb (≥ 0.1.3), tibble, tidyselect |
Suggests: |
contentid (≥ 0.0.15), covr, cowplot, DBI, duckdb (≥ 0.3.2), knitr (≥ 1.31), maps, markdown, rappdirs, raster, remotes, rlang (≥ 1.0.1), rmarkdown, rnaturalearthdata, sp, rvest, xml2, testthat (≥ 3.0.0) |
Published: |
2024-12-17 |
DOI: |
10.32614/CRAN.package.bdc |
Author: |
Bruno Ribeiro
[aut, cre],
Santiago Velazco
[aut],
Karlo Guidoni-Martins
[aut],
Geiziane Tessarolo
[aut],
Lucas Jardim
[aut],
Steven Bachman
[ctb],
Rafael Loyola
[ctb] |
Maintainer: |
Bruno Ribeiro <ribeiro.brr at gmail.com> |
BugReports: |
https://github.com/brunobrr/bdc/issues |
License: |
GPL (≥ 3) |
URL: |
https://brunobrr.github.io/bdc/ (website)
https://github.com/brunobrr/bdc |
NeedsCompilation: |
no |
Language: |
en-gb |
Materials: |
README NEWS |
CRAN checks: |
bdc results |