NEWS
RecordLinkage 0.4-12 (2020-04-09)
- -adapted factor handling to the new R standard
-switched from sweave to knitr for the vignettes
-fixed some SQL statement issues in RSQLite
-fixed the Pointer Mismatch (const char *, const unsigned char *) in sqlite_extensions.c
-removed the phonetic code function due to its encoding-dependent mapping tables
-made the package independent from the now orphaned ffbase package
-Replace class(x)==y statementy by is(x,y) statements
RecordLinkage 0.4-11 (2019-03-14)
- -added missing S3 method exports
-fixed broken/non-canonical URLs
-fixed some logical expressions to deal with non-scalar values
(contributed by Kurt Hornik)
RecordLinkage 0.4-10 (2016-07-27)
- -added SQLite header files to be compatible with upcoming RSQLite package,
which does not export them anymore
RecordLinkage 0.4-9 (2016-05-02)
- -added import declarations to pass R 3.3.0 package checks
RecordLinkage 0.4-8 (2015-05-28)
- -fixed import declaration for ffbase imports
-translated source code omments
-removed non-ASCII characters
-added S3method-declarations in NAMESPACE
RecordLinkage 0.4-7 (2015-01-15)
- -added unit tests (inst/unitTests) to .Rbuildignore
-accelerated example for epiWeights and epiClassify
RecordLinkage 0.4-6 (2014-12-11)
- -updated maintainer, author and contact information
-fixed notes "no visible binding" in R >= 2.15.1
-moved packages in "Depends" to "Imports" where appropriate and added
import statement in NAMESPACE
RecordLinkage 0.4-5
- -fixed truncated lines in PDF manual
RecordLinkage 0.4-4
- -fixed memory access problem in C code
RecordLinkage 0.4-3
- -added check if bringToTop() is availabe in getParetoThreshold
-avoid warning in countpattern
RecordLinkage 0.4-2
- -changed the way keys are set for data tables for compatibility with data.table 1.7.8
RecordLinkage 0.4-1 (2012-01-12)
- -Removed printf statements from compiled code to prevent NOTE in R CMD check
RecordLinkage 0.4-0
- -The classes for linkage of big data sets have been redesigned. The mechanism of creating comparison patterns on the fly from a database is replaced by saving all comparison patterns in a ffdf object (disk-based data frame, see package ff). This approach limits the amount of comparison patterns with respect to available disk space (the database approach was designed to handle even more patterns, at least in theory), but is more efficient regarding execution speed. In addition, some methods that were only usable with the S3 class RecLinkData, can now be used with RLBigData objects, most notably getMinimalTrain(). For the structure of the redesigned classes, see the documention entries for classes "RLBigData", "RLBigDataDedup" and "RLBigDataLinkage".
RecordLinkage 0.3-5 (2011-09-12)
- -bug fixes:
-getExpectedSize failed in rare cases
RecordLinkage 0.3-4 (2011-08-10)
- -added summary method for RLResult objects
-bug fixes:
-summary.RecLinkResult failed for object without known
matching status
RecordLinkage 0.3-3 (2011-07-02)
- -added new classification method "bumping", see ?trainSupv
-bug fixes:
- string comparators crashed for character(0) input
(reported by Dominik Reusser)
- getPairs could fail when the result is emtpy
-various performance improvements
-renamed vignette "Fellegi-Sunter Deduplication" to
"Weight-based deduplication"
-package is now in a namespace
RecordLinkage 0.3-2 (2011-05-11)
- performance improvements in various functions
- bug fixes:
- epiClassify failed for datasets with only one column
(reported by Richard Herron)
- getErrorMeasures returned wrong values in rare cases
RecordLinkage 0.3-1 (2011-03-24)
- fixed bug in emClassify that threw an error during classification
with on-the-fly calculation of weights
- added checks in unit tests
- fixed bug in optimalThreshold which caused wrong results in some
cases when ny was supplied
- added status messages and progress bars to various classification functions
- fixed numerical problem that produced wrong thresholds in some cases
for emClassify (with my / ny supplied)
RecordLinkage 0.3-0 (2011-03-15)
- A framework of S4 methods and classes tailored for processing large
data sets has been added. See the vignette 'BigData' for details.
- The output format of getPairs (existing method for RecLinkData and
RecLinkResult objects) has changed: Column "Weight" is now last instead
of first. Also, for single.rows=FALSE, record pairs are speparated by
a blank line.
RecordLinkage 0.2-6 (2011-02-15)
- added subset operator [ for RecLinkData and RecLinkPairs objects
- improved running time of epiWeights significantly
RecordLinkage 0.2-5
- Fixed a bug in getMinimalTrain which caused the function to fail
RecordLinkage 0.2-4 (2011-01-05)
- Fixed a bug in getMinimalTrain that caused pairs with unknown
matching status to become non-matches in the training set.
- Some examples were modified to reduce execution time.
- Unit tests have been added in inst/unitTests.
- Checking of arguments improved in some functions.
- Small documentation changes.
RecordLinkage 0.2-3 (2010-09-28)
- getParetoThreshold now prompts a message if no data lies in the
interactively selected interval and asks the user to choose again
- Corrected maintainer e-mail address
- Several bug fixes