Integrating data from multiple sources with the aim to identify records that correspond to the same entity is required in many real-world applications including healthcare, national security, and businesses. However, privacy and confidentiality concerns impede the sharing of personal identifying values to conduct linkage across different organizations. Privacy-preserving record linkage (PPRL) techniques have been developed to tackle this problem by performing clustering based on the similarity between encoded record values, such that each cluster contains (similar) records corresponding to one single entity. When employing PPRL on databases from multiple parties, one major challenge is the prohibitively large number of similarity comparisons required for clustering, especially when the number and size of databases are large. While there have been several private blocking methods proposed to reduce the number of comparisons, they fall short in providing an efficient and effective solution for linking multiple large databases. Further, all of these methods are largely dependent on data. In this paper, we propose a novel private blocking method for efficiently linking multiple databases by exploiting the data characteristics in the form of probabilistic signatures and introduce a local blocking evaluation step for validating blocking methods without knowing the ground-truth. Experimental results show the efficacy of our method in comparison to several state-of-the-art methods.
Contributors:Marc Schulder, Yury Bakanouski
ATC-Anno is an annotation tool for the transcription and semantic annotation of air traffic control utterances.
It was developed at the Spoken Language Systems (LSV) group at Saarland University.
The latest version of the tool can always be found on the LSV GitHub account.
If you use the tool in your research, please cite the associated paper:
Marc Schulder, Johannah O'Mahony, Yury Bakanouski, Dietrich Klakow (2020). ATC-Anno: Semantic Annotation for Air Traffic Control with Assistive Auto-Annotation. In Proceedings of the International Conference on Language Resources and Evaluation (LREC), Marseilles, France.
Contributors:Bullen, Jay C
MATLAB codes used to model arsenic(III) remediation using a composite TiO2-Fe2O3 sorbent in batch and continuous-flow systems, using a modified form of the pseudo-second order (PSO) adsorption kinetic model.
This data supports the manuscript provisionally titled 'A kinetic adsorption model to inform the design of arsenic(III) treatment plants using photocatalyst-sorbent materials'
Contributors:Júlio Hoffimann, Fredrik Ekre, Martijn Visser, Tony Kelman, Morten Piibeleht, M. A. Siddique, Durand D'souza, Anshul Singhvi
Diff since v0.10.2
Update URLs everywhere in the codebase (#47)
Remove dependency on MLJBase.jl (#53)
Merged pull requests:
MassInstallAction: Install the CompatHelper workflow on this repository (#50) (@juliohm)
MassInstallAction: Install the TagBot workflow on this repository (#51) (@juliohm)
Contributors:Philipp Rudiger, Jean-Luc Stevens, James A. Bednar, Bas Nijholt, Andrew, Chris B, Achim Randelhoff, Vasco Tenner, Jon Mease, maxalbert, Markus Kaiser, ea42gh, stonebig, Jordan Samuels, henriqueribeiro, John Bampton, Scott Lowe, Florian LB, Daniel Stephan, Andrew Tolmie, arabidopsis, Yuval Langer, Lukas Barth, Leopold Talirz, Justin Bois, Julia Signell, Irv Lustig, Benjamin W. Portner, Anthony Monthe, Anar Z. Yusifov
This is a minor patch release fixing a number of regressions introduced as part of the 1.13.x releases. Many thanks to the contributors including @eddienko, @poplarShift, @wuyuani135, @maximlt and the maintainer @philippjfr.
Add PressUp and PanEnd streams (#4334)
Fix regression in single node Sankey computation (#4337)
Fix color and alpha option on bokeh Arrow plot (#4338)
Fix undefined JS varaibles in various bokeh links (#4341)
Fix matplotlib >=3.2.1 deprecation warnings (#4335)
Fix handling of document in server mode (#4355)
Contributors:Juniper L. Simonis
Tools for interacting with the publicly available California Delta Fish Salvage Database, including continuous deployment of data access, analysis, and presentation.
Diff since v1.0.4
Register v1.0.4 (#4)
Merged pull requests:
CompatHelper: bump compat for "PrettyTables" to "0.8" (#6) (@github-actions[bot])
CompatHelper: bump compat for "Requires" to "1.0" (#7) (@github-actions[bot])
CompatHelper: bump compat for "Compose" to "0.8" (#8) (@github-actions[bot])
Contributors:Rob J Goedman, Richard Torkar, alecloudenback, Tamas K. Papp
Diff since v2.1.3
Merged pull requests:
CompatHelper: bump compat for "Distributions" to "0.23" (#87) (@github-actions[bot])
CompatHelper: bump compat for "StatsBase" to "0.33" (#88) (@github-actions[bot])
CompatHelper: bump compat for "CSV" to "0.6" (#89) (@github-actions[bot])