Literaturverzeichnis
- [Anan03]
R. Ananthakrishna, S. Chaudhuri, V. Ganti (2003): Eliminating Fuzzy Duplicates in Data Warehouses.Hong Kong.
- [Balouch03]
A. Balouch, A. Heuer,H. Meyer (2003): Schema Integration und XQuery Core in Digitalen Bibliotheken. Lehrstuhl Datenbank- und Info-systemen, Rostock.
- [Bhattdmkd04]
Indrajit Bhattacharya, Lise Getoor (2004): Iterative Record Linkage for Cleaning and Integration. .Maryland.
- [Bilen03]
Mikhail Bilenko, Raymond J. Mooney (): Adaptive Duplicate Detection Using Learnable String Similarity Measures. ..
- [Brockhaus99]
Brockhaus(1999): Brockhaus. Die Enzyklopädie in 24 Bänden.. Bibliographisches Institut & F. A. Brockhaus AG., Mannheim.
- [Cai05]
Rita Sharma, David Poole (): Probability and Equality: A Probabilistic Model of Identity Uncertainty. .British Columbia.
- [Callumnigamco]
A. McCallum, K. Nigam, J. Rennie, K. Seymore (1999): A Machine Learning Approach to Building Domain-Specific Search Engines. .Pittsburg.
- [Callumnigamc2]
A. McCallum, K. Nigam, J. Rennie, K. Seymore (2000): Automating the Construction of Internet Portals with Machine Learning. .Pittsburg.
- [CDay]
C. Day (1997): A Checklist for Evaluating Record Linkage Software. .,.
- [CIAC2000]
Yusuke Shibata, Takuya Kida, Shuichi Fukamachi et al. (2000):Speeding Up Pattern Matching by Text Compression. Kyushu University 33.Dept. of Informatics, Fukuoka 812-8581.
- [Cohen03]
W. W. Cohen, P.Ravikumar, St. E. Fienberg (2003): A Comparison of String Distance Metrics for Name-Matching Tasks. American Association for Artificial Intelligence..
- [Cupid01]
E. Rahm, P.A. Bernstein, J. MadhavanGeneric Schema Matching with Cupidt. Proceedings of the 27th VLDB Conference (2001)
- [Doan01]
AnHai Doan, Ying Lu, Yoonkyong Lee, Jiawei Han (2001): Object Matching for Information Integration: A Profiler-Based Approach. .Illinois.
- [DuplDet06]
Tobias Ackermann, Florian WeigoldDuplicate Detection using Machine Learning2006
- [Febrl05]
Peter Christen, Tim Churches(2003): Febrl - freely extensible biomedical record linkage. Technical Manual.
- [Fellegi69]
Fellegi, I. P., Sunter, A. B. (1969): A Theory for Record Linkage. Journal of the American Statistical Association.,Washington.
- [Ferber03]
Reginald Ferber(2003): Information Retrieval - Suchmodelle und Data- Mining.. dpunkt Verlag, Heidelberg.
- [Frakes04]
W.B. Frakes, R. Baeza-Yates(2004): Information Retrieval: Data Structures & Algorithms.. unknown, .
- [Gamma94]
Erich Gamma, Richard Helm, Rslph Johnson, John Vlissides(1994): Design Patterns - Elements of Reusable Object-Oriented Software.. Addison-Wesley, Person, 201 W. 103rd Street, indianapolis, IN 46290.
- [Garcia05]
Hector Garcia-MolinacHandling Data Quality in Entity Resolution2005
- [Giunchiglia04]
F. Giunchiglia, P. Shvaiko (2004): Semantic Matching. University of Trento, Italien.
- [glue]
Matthias Kerzel (2003): GLUE - Learning to map between ontologies on the semantic web.
- [Gravano03]
L. Gravano, P.G. Ipeirotis, N. Koudas, D. Srivastavat (2003): Text Joins in an RDBMS for Web Data Integration. Columbia University..
- [Haskell98]
P. Hudak, J. Peterson, J.H. Fasel (1999): A Gentle Introduction to Haskell 98. Yale University, Department of Computerscience..
- [Hjelm01]
Hjelm, Johan(2001): Creating the semantic Web with RDF.Creating the semantic Web with RDF. Wiley, New York.
- [HS95]
M. Hernandez, S. StolfoThe merge/purge problem for large databases. (1995)
- [Inte03]
M.Bilenko, R.Mooney, W.Cohen, P. Ravikumar, St.Fienberg (2003): Adaptive Name Matching in Information Integration. Information Integration on the Web.,.
- [Ishkur06]
Keneth Taylor(2006): Ishkurs Guide to electronic music. www.di.fm/edmguide/edmguide.html
- [JSM2002]
Marco Fortini, Alessandra Nuccitelli, Brunero Liseo, Mauro Scanu (2002): Modelling Issues in Record Linkage: a Bayesian Perspective. .Rom.
- [Kalfoglou05]
Y. Kalfoglou, M. Schorlemmer (2005): Ontology Mapping: The State of the Arth. Dagstuhl Seminar Proceedingsa..
- [Kdd03]
Rohan Baxter, Peter Christen, Tim Churches (2003): A Comparison of fast Blocking Methods for Record Linkage. .Canberra.
- [Keogh04]
Eamonn Keogh, Stefano Lonardi, Chotirat Ann RatanamahatanaTowards Parameter-Free Data Mining2004
- [Klein]
Michel Klein (2001): Combining and relating ontologies - an analysis of problems and solutions. Vrije Universiteit Amsterdam..
- [Kordon05]
Dr.-ing. Ullrich Kordon (2005): Vorlesung Sprachsynthese. TU Dresden.
- [Lev06]
Die Levenshtein-Distanz (): www.levenshtein.de
- [Meh05]
Dipl.-Wi.-Ing. Marc Ehrig (2005): Ontology Alignment: Bridging the semantical gap.
- [Mehr04]
M.Ehrig, P.Haase, M.Hefke, N.Stojanovic (2004): Similarity for Ontologies - a Comprehensive Framework. Institute AIFB, University of Karlsruhe..
- [Monge97]
A.E.Monge, C.P. Elkan (1997): An efficient domain-indendent algorithm for detecting approximately duplicate database records. University of California, San Diego.
- [Mooney03]
Mikhail Bilenko, Raymond J. Mooney (2003): On Evaluation and Training-Set Construction for Duplicate Detection
- [NahmBilenk02]
Un Yong Nahm, Mikhail Bilenko, Raymond J. MooneyTwo Approaches to Handling Noisy Variation in Text Mining. (2002)
- [Niermann03]
Andrew Nierman H. V. Jagadish (2003): ProTDB: Probabilistic Data in XML. University of Michigan.
- [Nigam01]
Kamal Paul NigamUsing Unlabeled Data to Improve Text Classification2001
- [Nigam2000]
Andrew McCallum, Kamal Nigam, Lyle H. UngarEfficient Clustering of High- Dimensional Data-Set with Application to Refer
- [Noy05]
Natasha NoyOntology Mapping and Alignment2005
- [OLA04]
J. Euzenat, D. Loup, M. Touzani, P. Valtchev (2004): Ontology alignment with OLA. University of Montréal..
- [onto06]
Wikipedia(4.2006): Ontologie (Informatik).
- [openthesaur06]
Daniel Naber(2006): OpenThesaurus - Deutscher Thesaurus. www.openthesaurus.de
- [Portnoy00]
Leonid Portnoy (2000): Intrusion detection with unlabeled data using clustering. Columbia.
- [Rahm01]
E. Rahm, P.A. Bernstein(2001): A survey of approaches to automatic schema matching.
- [Rahm02]
H.-H. Do, S. Melnik, E. Rahm (2002): Comparison of Schema Matching Evaluations. University of Leipzig..
- [Rahm04]
Erhard Rahm, Hong Hai Do (): Data Cleaning: Problems and Current Approaches. .University of Leipzig, Germany.
- [RistadYanilo96]
Eric Sven Ristad, Peter N. Yainilos (): Learning Edit String Distance. .Princeton University.
- [Schaffert04]
S. Schaffert, F. Bry:Querying the Web Reconsidered (2004)
- [Schefels06]
Clemens SchefelsServing Xcerpt to the Web2006
- [Schreiber06]
M. SchreiberNeighbourhood-conscious Record Linkage.(2006)
- [Secstring06]
William W. Cohen, Pradeep Ravikumar, Stephen Fienberg(2006): SecondString. secondstring.sourceforge.net
- [SF06]
(2006): SourceForge. www.sourceforge.net
- [Simmetric06]
Sam Chapman(): Sam's String Metrics. http://www.dcs.shef.ac.uk/~sam/stringmetrics.html
- [Soundex00]
The U.S. National Archives and Records Administration(2000): The Soundex Indexing System. www.archives.gov/genealogy/census/soundex
- [Spire02]
I. Bartolini, P. Ciaccia, M. Patellai (2002): String Matching with Metric Trees. Using an Approximate Distance. University of Bologna, Italien..
- [Stoilos05]
Giorgos Stoilos, Giorgos Stamou, and Stefanos Kollias (2005): A String Metric for Ontology Alignment. Springer-Verlag.Berlin Heidelberg.
- [Studer05]
R. Studer, M. Ehrig, Y. Sure (2005): Automatische Wissensintegration mit Ontologien. Institut AIFB, Universität Karlsruhe..
- [Sven06]
Hans Eric SvenssonExtending Xcerpt with Ontology Queries.(2004)
- [Tailor02]
M. G. Elfeky, V.S.Verykios, A. K. ElmagarmidTAILOR: A Record Linkage Toolbox2002
- [vanReijsenn75]
Cornelis Joost van Rijsbergen(1975): Information retrieval.. Butterworths, London (UK).
- [Vldb01]
Luis Gravano, Panagiotis G. Ipeirotis, H. V. Jagadish, Nick Koudas (2001): Approximate String Joins in a Database (Almost) for Free. AT&T.Columbia University.
- [weka06]
Dr. Eibe Frank et.al.(2006): Weka 3: Data Mining Software in Java. www.cs.waikato.ac.nz/~ml/
- [Wiki06]
(2006): Precision and Recall. de.wikipedia.org/wiki/Precision
- [wikisound]
Wikipedia(2006): Soundex. de.wikipedia.org/wiki/Soundex
- [Wilk06]
H. E. Svensson, E. Wilk (2006): XML Querying Using Ontological Information. Linköping University, Schweden..
- [Winkler00]
William E. Winkler (2000): Frequency-Based Matching in Fellegi-Sunter Model of. .Washington.
- [Winkler03]
William E. Winkler (2003): Data Cleaning Methods. .Washington.
- [Winkler90]
W. E. Winkler (1990): An Application of the FELLEGI-SUNTER Model of Reco. Washington.
- [Winkler91]
William E. Winkler (): Record Linkage Software and Methods for Merging A. Washington.
- [Winkler93]
William E Winkler (1993): Matching and Record Linkage. .Washington.
- [Winkler99]
W.E. Winkler (1999): The State of Record Linkage and Current Research Problems. U. S. Bureau of the Censuso..
- [wordnet06]
George A. Miler(2006): WordNet - a lexical database for the English language. wordnet.princeton.edu
- [Wu03]
Tianhao Wu (2003): Theory and Algorithms for Information Extraction and Classification in Text. .CSE Department, Lehigh University.
- [Xcerpt04]
S.Schaffert (2004): Xcerpt: A Rule-Based Query and Transformation Language for the web.
- [XcerptFact05]
S. Berger, F. Bry, T. Furche, B. Linse, S. SchaffertThe Web and semantic web query language Xcerpt2005
- [XcerptUseCase04]
Sebastian Kraus (2004): Use Cases für Xcerpt - eine positionelle Anfrage und Transformationssprache für das Web
- [Zhu01]
J. Joanne Zhu, Lyle H. Ungar (): String Edit Analysis for Merging Databases. Philadelphia.