Basketball, P. (2000). From inside the P. Ball, H. F. Spirer, & L. Spirer (Eds.), Putting some Situation: Investigating Large scale Peoples Legal rights Violations Having fun with Advice Options and Study Data. AAAS.
Belin, T. Roentgen., & Rubin, D. B. (1995). A strategy having calibrating not the case-meets cost in the list linkage. Diary of your own Western Mathematical Organization, 90(430), 694–707.
Bilenko, Yards., & Mooney, Roentgen. J. (2003). Transformative Duplicate Detection Having fun with Learnable Sequence Resemblance Measures. During the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automatic Record Linkage Having fun with Seeded Nearby Neighbour and you will Support Vector Machine Group. Inside KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A survey away from indexing methods for scalable list linkage and you may deduplication. IEEE Purchases towards Studies and you can Analysis Systems, 24(9), 1537–1555.
Cohen, W., Raviku). An assessment off string metrics having complimentary names and info. Inside the KDD working area on data cleaning and you may object consolidation (Vol. 3, pp. 73–78).
Copas, J., & Hilton, F. (1990). Number linkage: Analytical patterns to have coordinating computer facts. Log of one’s Royal Analytical Area, Series An effective, 153(3), 287–320.
Dai, A good. M., & Storkey https://internationalwomen.net/fi/israelilaiset-naiset/, A. J. (2011). The newest categorized writer-point design for unsupervised entity solution. During the Phony neural sites and host studying–icann 2011 (pp. 241–249). Springer.
Fortini, Meters., Liseo, B., Nuccitelli, Good., & Scanu, Meters. (2001). To your Bayesian Number Linkage. Research inside Formal Statistics, 4(1), 185–198.
Gutman, R., Afendulis, C., & Zaslavsky, Good. (2013). An effective bayesian process of document connecting to analyze prevent- of-existence scientific can cost you. Journal of your Western Analytical Connection, 108(501), 34–47.
Hsu, W., Lee, M. L., Liu, B., & Ling, T. W. (2000). Exploration Mining for the Diabetics Database: Results and Results. During the KDD ’00 (pp. 430–436). ACM.
A torn-blend Markov chain Monte Carlo procedure of new Dirichlet processes combination model
Jewell, Letter. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and you will Casualty Counts: Presumptions, Translation, and you may Demands. From inside the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Counting Civilian Casualties: An overview of Recording and you may Quoting Nonmilitary Fatalities incompatible. Oxford, UK: Oxford University Force.
Larsen, Meters. D. (2002)ments into Hierarchical Bayesian List Linkage. In the Proceedings of your own shared analytical conferences, section toward questionnaire research strategies (pp. 1995–2000). The new Western Statistical Association.
Steorts, R
Larsen, Meters. D. (2005). Advances from inside the Record Linkage Theory: Hierarchical Bayesian Checklist Linkage Idea. Inside Procedures of your own joint analytical group meetings, point with the survey look strategies (pp. 3277–3284). The Western Analytical Organization.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automatic listing linkage playing with mix models. Diary of Western Statistical Connection, 96(453), 32–41.
Lum, K., Speed, Yards. Elizabeth., & Banks, D. (2013). Software out-of Multiple Options Estimate into the Human Legal rights Search. The new American Statistician, 67(4), 191–2 hundred.
Marchant, N. Grams., C., Kaplan, Good., Rubinstein, B. I. P., & Elazar, D. Letter. (2019). D-blink: Marketed avoid-to-avoid bayesian organization solution.
McCallum, A good., & Wellner, B. (2004). Conditional Types of Label Uncertainty with Application so you’re able to Noun Coreference. Inside the Advances from inside the neural suggestions control expertise (nips ’04) (pp. 905–912). MIT Press.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A website-Particular Equipment into Deduplication from Vaccination Records Details inside Youthfulness Immunization Registriesputers and you may Biomedical Search, 33(2), 126–143.
Murphy, J., Brackbill, R. M., Thalji, L., Dolan, Meters., Pulliam, P., & Walker, D. J. (2007). Computing and you will Increasing Visibility worldwide Trading Center Health Registry. Statistics for the Treatments, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic list linkage and you may deduplication shortly after indexing, blocking, and you can selection. Log away from Confidentiality and you can Privacy, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Meters., Axford, S. J., & James, A. P. (1959). Automatic linkage away from vital records computers are often used to extract” follow-up” statistics from families regarding documents out of regime information. Technology, 130(3381), 954–959.
Sadinle, M. (2014). Detecting Duplicates in a homicide Registry Playing with a beneficial Bayesian Partitioning Approach. Annals off Used Analytics, 8(4), 2404–2434.
Sariyar, M., Borg, A., & Pommerening, K. (2012). Productive Discovering Tips for the fresh new Deduplication from Electronic Diligent Analysis Using Classification Woods. Log out-of Biomedical Informatics, 45(5), 893–900.
C., Hall, Roentgen., & Fienberg, S. Age. (2016). An excellent Bayesian Method of Visual Record Linkage and you can Deduplication. Record of the Western Analytical Relationship, 111(516), 1660–1672.
Tancredi, A great., & Liseo, B. (2011). A good hierarchical Bayesian approach to listing linkage and people size issues. Annals of Used Statistics, 5(2B), 1553–1585.
Leave a comment