match_name()
gains a new argument in its signature,
sector_classification
, which is placed before the
...
argument. Where users have used ...
to
pass additional unnamed arguments by position, they will need to update
their code to pass these arguments by name so that they are not confused
as an input to sector_classfication
. The new argument
sector_classification
is optional and defaults to
r2dii.data::sector_classifications
. Explicitly passing a
sector_classification
to match_name()
, enables
users to use their own sector classification systems to match loan books
to the abcd, assuming the sector_classification
is mapped
appropriately to PACTA sectors and has the same format as
r2dii.data::sector_classifications
. Own sector
classifications can no longer be passed to match_name()
via
options.Complete deprecation of ald
in favour of
abcd
(#399).
match_name
gains argument join_id
allowing an optional perfect join based on a mutual ID column between
loanbook
and abcd
inputs, prior to attempting
fuzzy matching (#135).
to_alias
can now handle strange encodings without error
(#425, @kalashsinghal @Tilmon).r2dii.match
has transferred to a new organization
https://github.com/RMI-PACTA/.abcd
of match_name()
supersedes the argument ald
(#399).With
options(r2dii.match.sector_classifications = own)
users can
inject their own
sector_classififications
instead of the default r2dii.data::sector_classifications
.
With this feature, user may or may not choose to request their
sector_classifications
to be added to r2dii.data. This
feature is experimental and may be dropped and/or become a new argument
to match_name()
(#356 @georgeharris2deg @daisy-pacheco)`.
match_name()
now errors if the column
id_loan
of the input loanbook has any duplicated value
id_loan
(@georgeharris2deg #349).
match_name()
gains ...
to pass additional
arguments to stringdist::stringsim()
(@evgeniadimi
#310).prioritize()
with 0-row input now returns the input
untouched (#284).match_name()
: Remove dependency on
nest_by()
from dplyr 1.0.0 (#303).match_name()
with large
loanbooks.match_name()
now outputs a new column
borderline
(#258).crucial_lbk()
helps select the minimum loanbook
columns for match_name()
to run (#236).match_name()
now runs faster and uses less memory
(@georgeharris2deg #214).match_name()
now converts ald$sector
to
lower case before matching (@georgeharris2deg #257). It now
returns identical output with, for example, either “POWER” or “power”.
Notice that the input “POWER” in ald$sector
becomes “power”
in the column sector_ald
of the output.match_name()
now errors with a more informative message
if loanbook
has reserved columns – alias
,
rowid
, or sector
(#233).This version includes only internal changes.