Package: RecordLinkage 0.4-12.6
RecordLinkage: Record Linkage Functions for Linking and Deduplicating Data Sets
Provides functions for linking and deduplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain. For details, see our paper "The RecordLinkage Package: Detecting Errors in Data" Sariyar M / Borg A (2010) <doi:10.32614/RJ-2010-017>.
Authors:
RecordLinkage_0.4-12.6.tar.gz
RecordLinkage_0.4-12.6.zip(r-4.7)RecordLinkage_0.4-12.6.zip(r-4.6)RecordLinkage_0.4-12.6.zip(r-4.5)
RecordLinkage_0.4-12.6.tgz(r-4.6-x86_64)RecordLinkage_0.4-12.6.tgz(r-4.6-arm64)RecordLinkage_0.4-12.6.tgz(r-4.5-x86_64)RecordLinkage_0.4-12.6.tgz(r-4.5-arm64)
RecordLinkage_0.4-12.6.tar.gz(r-4.7-arm64)RecordLinkage_0.4-12.6.tar.gz(r-4.7-x86_64)RecordLinkage_0.4-12.6.tar.gz(r-4.6-arm64)RecordLinkage_0.4-12.6.tar.gz(r-4.6-x86_64)
RecordLinkage_0.4-12.6.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
RecordLinkage/json (API)
| # Install 'RecordLinkage' in R: |
| install.packages('RecordLinkage', repos = c('https://sym33.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/sym33/recordlinkage/issues
- identity.RLdata10000 - Test data for Record Linkage
- identity.RLdata500 - Test data for Record Linkage
- RLdata10000 - Test data for Record Linkage
- RLdata500 - Test data for Record Linkage
Last updated from:fb8c03fae4. Checks:4 ERROR, 2 OK, 7 NOTE. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-arm64 | ERROR | 197 | ||
| linux-devel-x86_64 | ERROR | 167 | ||
| source / vignettes | OK | 205 | ||
| linux-release-arm64 | ERROR | 174 | ||
| linux-release-x86_64 | ERROR | 140 | ||
| macos-release-arm64 | NOTE | 150 | ||
| macos-release-x86_64 | NOTE | 328 | ||
| macos-oldrel-arm64 | NOTE | 170 | ||
| macos-oldrel-x86_64 | NOTE | 264 | ||
| windows-devel | NOTE | 135 | ||
| windows-release | NOTE | 143 | ||
| windows-oldrel | NOTE | 126 | ||
| wasm-release | OK | 150 |
Exports:[.RecLinkData[.RecLinkResult[.RLBigData[.RLResult%append%beginblockfldfunclassifySupvclassifyUnsupclearclonecompare.dedupcompare.linkagecountpatterndeleteNULLseditMatchemClassifyemWeightsepiClassifyepiWeightserrorMeasuresfsClassifyfsWeightsgenSamplesgetColumnNamesgetErrorMeasuresgetExpectedSizegetFalsegetFalseNeggetFalsePosgetFrequenciesgetMatchCountgetMinimalTraingetNACountgetNonMatchCountgetPairsgetPairsBackendgetParetoThresholdgetPatternCountsgetSQLStatementgetTablegetThresholdsgpdEsthasWeightsinit_sqlite_extensionsisFALSEjarowinklerlevenshteinDistlevenshteinSimloadRLObjectmakeBlockingPairsmrlmygllmnextPairsoptimalThresholdplotMRLprint.summaryRLBigDataDedupprint.summaryRLBigDataLinkageprint.summaryRLResultresampleRLBigDataDedupRLBigDataLinkagesaveRLObjectsoundexsplitDatasummary.RecLinkDatasummary.RecLinkResultsummary.RLBigDataDedupsummary.RLBigDataLinkagesummary.RLResulttexSummarytrainSupvunorderedPairs
Dependencies:bitbit64blobcachemclassclicodetoolscpp11data.tableDBIdiagramdigeste1071evdfarverfastmapfffuturefuture.applyggplot2globalsgluegtableipredisobandKernSmoothlabelinglatticelavalifecyclelistenvMASSMatrixmemoisennetnumDerivparallellypkgconfigprodlimprogressrproxyR6RColorBrewerRcpprlangrpartRSQLiteS7scalesshapeSQUAREMsurvivalvctrsviridisLitewithrxtable
Last update: 2026-01-24
Started: 2026-01-24
Last update: 2026-01-24
Started: 2026-01-24
Last update: 2026-01-24
Started: 2026-01-24
Last update: 2026-01-24
Started: 2026-01-24
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Concatenate comparison patterns or classification results | %append% %append%,RecLinkData,RecLinkData-method %append%,RecLinkResult,RecLinkResult-method %append%-methods |
| Supervised Classification | classifySupv classifySupv,RecLinkClassif,RecLinkData-method classifySupv,RecLinkClassif,RLBigData-method classifySupv-methods |
| Unsupervised Classification | classifyUnsup |
| Serialization of record linkage object. | clone clone,RLBigData-method clone,RLResult-method clone-methods loadRLObject saveRLObject saveRLObject,RLBigData-method saveRLObject,RLResult-method saveRLObject-methods |
| Compare Records | compare.dedup compare.linkage |
| Remove NULL Values | deleteNULLs |
| Edit Matching Status | editMatch editMatch,RecLinkData-method editMatch,RLBigData-method editMatch-methods |
| Weight-based Classification of Data Pairs | emClassify emClassify,RecLinkData,ANY,ANY-method emClassify,RecLinkData,missing,missing-method emClassify,RLBigData,ANY,ANY-method emClassify,RLBigData,missing,missing-method emClassify,RLBigData-method |
| Calculate weights | emWeights emWeights,RecLinkData-method emWeights,RLBigData-method emWeights-methods |
| Classify record pairs with EpiLink weights | epiClassify epiClassify,RecLinkData-method epiClassify,RLBigData-method epiClassify-methods |
| Calculate EpiLink weights | epiWeights epiWeights,RecLinkData-method epiWeights,RLBigData-method epiWeights-methods |
| Class '"ff_vector"' | ff_vector-class |
| Class '"ffdf"' | ffdf-class |
| Generate Training Set | genSamples |
| Calculate Error Measures | errorMeasures getErrorMeasures getErrorMeasures,RecLinkResult-method getErrorMeasures,RLResult-method getErrorMeasures-methods |
| Estimate number of record pairs. | getExpectedSize getExpectedSize,data.frame-method getExpectedSize,RLBigDataDedup-method getExpectedSize,RLBigDataLinkage-method getExpectedSize-methods |
| Get attribute frequencies | getFrequencies getFrequencies,RLBigData-method getFrequencies-methods |
| Create a minimal training set | getMinimalTrain getMinimalTrain,RecLinkData-method getMinimalTrain,RLBigData-method getMinimalTrain-methods |
| Extract Record Pairs | getFalse getFalseNeg getFalsePos getPairs getPairs,RecLinkData-method getPairs,RecLinkResult-method getPairs,RLBigData-method getPairs,RLResult-method getPairs-methods |
| Estimate Threshold from Pareto Distribution | getParetoThreshold getParetoThreshold,RecLinkData-method getParetoThreshold,RLBigData-method getParetoThreshold-methods |
| Build contingency table | getTable getTable,RecLinkResult-method getTable,RLResult-method getTable-methods |
| Estimate Threshold from Pareto Distribution | gpdEst |
| Check for FALSE | isFALSE |
| Generalized Log-Linear Fitting | mygllm |
| Optimal Threshold for Record Linkage | optimalThreshold optimalThreshold,RecLinkData-method optimalThreshold,RLBigData-method optimalThreshold-methods |
| Phonetic Code | phonetics soundex |
| Class "RecLinkClassif" | RecLinkClassif RecLinkClassif-class |
| Class "RecLinkData" | RecLinkData-class |
| Record Linkage Data Object | RecLinkData RecLinkData.object |
| Class "RecLinkResult" | RecLinkResult-class |
| Record Linkage Result Object | RecLinkResult RecLinkResult |
| Record Linkage Result Object | RecLinkResult RecLinkResult |
| Safe Sampling | resample |
| Class "RLBigData" | RLBigData-class |
| Constructors for big data objects. | RLBigDataDedup RLBigDataLinkage |
| Class "RLBigDataDedup" | RLBigDataDedup-class |
| Class "RLBigDataLinkage" | RLBigDataLinkage-class |
| Test data for Record Linkage | identity.RLdata10000 identity.RLdata500 RLdata10000 RLdata500 |
| Class "RLResult" | RLResult-class |
| Show a RLBigData object | show show,RLBigData-method |
| Split Data | splitData |
| Stochastic record linkage. | fsClassify fsClassify,RecLinkData-method fsClassify,RLBigData-method fsClassify-methods fsWeights fsWeights,RecLinkData-method fsWeights,RLBigData-method fsWeights-methods |
| String Metrics | jaro jarowinkler levenshtein levenshteinDist levenshteinSim strcmp winkler |
| Subset operator for record linkage objects | [.RecLinkData [.RecLinkResult [.RLBigData [.RLResult |
| Print Summary of Record Linkage Data | summary.RecLinkData summary.RecLinkResult |
| summary methods for '"RLBigData"' objects. | print.summaryRLBigDataDedup print.summaryRLBigDataLinkage summary.RLBigData summary.RLBigDataDedup summary.RLBigDataLinkage |
| Summary method for '"RLResult"' objects. | print.summaryRLResult summary,RLResult-method summary.RLResult |
| Train a Classifier | trainSupv |
| Create Unordered Pairs | unorderedPairs |
