Literaturverzeichnis

[Anan03]

R. Ananthakrishna, S. Chaudhuri, V. Ganti (2003): Eliminating Fuzzy Duplicates in Data Warehouses.Hong Kong.

[Balouch03]

A. Balouch, A. Heuer,H. Meyer (2003): Schema Integration und XQuery Core in Digitalen Bibliotheken. Lehrstuhl Datenbank- und Info-systemen, Rostock.

[Bhattdmkd04]

Indrajit Bhattacharya, Lise Getoor (2004): Iterative Record Linkage for Cleaning and Integration. .Maryland.

[Bilen03]

Mikhail Bilenko, Raymond J. Mooney (): Adaptive Duplicate Detection Using Learnable String Similarity Measures. ..

[Brockhaus99]

Brockhaus(1999): Brockhaus. Die Enzyklopädie in 24 Bänden.. Bibliographisches Institut & F. A. Brockhaus AG., Mannheim.

[Cai05]

Rita Sharma, David Poole (): Probability and Equality: A Probabilistic Model of Identity Uncertainty. .British Columbia.

[Callumnigamco]

A. McCallum, K. Nigam, J. Rennie, K. Seymore (1999): A Machine Learning Approach to Building Domain-Specific Search Engines. .Pittsburg.

[Callumnigamc2]

A. McCallum, K. Nigam, J. Rennie, K. Seymore (2000): Automating the Construction of Internet Portals with Machine Learning. .Pittsburg.

[CDay]

C. Day (1997): A Checklist for Evaluating Record Linkage Software. .,.

[CIAC2000]

Yusuke Shibata, Takuya Kida, Shuichi Fukamachi et al. (2000):Speeding Up Pattern Matching by Text Compression. Kyushu University 33.Dept. of Informatics, Fukuoka 812-8581.

[Cohen03]

W. W. Cohen, P.Ravikumar, St. E. Fienberg (2003): A Comparison of String Distance Metrics for Name-Matching Tasks. American Association for Artificial Intelligence..

[Cupid01]

E. Rahm, P.A. Bernstein, J. MadhavanGeneric Schema Matching with Cupidt. Proceedings of the 27th VLDB Conference (2001)

[Doan01]

AnHai Doan, Ying Lu, Yoonkyong Lee, Jiawei Han (2001): Object Matching for Information Integration: A Profiler-Based Approach. .Illinois.

[DuplDet06]

Tobias Ackermann, Florian WeigoldDuplicate Detection using Machine Learning2006

[Febrl05]

Peter Christen, Tim Churches(2003): Febrl - freely extensible biomedical record linkage. Technical Manual.

[Fellegi69]

Fellegi, I. P., Sunter, A. B. (1969): A Theory for Record Linkage. Journal of the American Statistical Association.,Washington.

[Ferber03]

Reginald Ferber(2003): Information Retrieval - Suchmodelle und Data- Mining.. dpunkt Verlag, Heidelberg.

[Frakes04]

W.B. Frakes, R. Baeza-Yates(2004): Information Retrieval: Data Structures & Algorithms.. unknown, .

[Gamma94]

Erich Gamma, Richard Helm, Rslph Johnson, John Vlissides(1994): Design Patterns - Elements of Reusable Object-Oriented Software.. Addison-Wesley, Person, 201 W. 103rd Street, indianapolis, IN 46290.

[Garcia05]

Hector Garcia-MolinacHandling Data Quality in Entity Resolution2005

[Giunchiglia04]

F. Giunchiglia, P. Shvaiko (2004): Semantic Matching. University of Trento, Italien.

[glue]

Matthias Kerzel (2003): GLUE - Learning to map between ontologies on the semantic web.

[Gravano03]

L. Gravano, P.G. Ipeirotis, N. Koudas, D. Srivastavat (2003): Text Joins in an RDBMS for Web Data Integration. Columbia University..

[Haskell98]

P. Hudak, J. Peterson, J.H. Fasel (1999): A Gentle Introduction to Haskell 98. Yale University, Department of Computerscience..

[Hjelm01]

Hjelm, Johan(2001): Creating the semantic Web with RDF.Creating the semantic Web with RDF. Wiley, New York.

[HS95]

M. Hernandez, S. StolfoThe merge/purge problem for large databases. (1995)

[Inte03]

M.Bilenko, R.Mooney, W.Cohen, P. Ravikumar, St.Fienberg (2003): Adaptive Name Matching in Information Integration. Information Integration on the Web.,.

[Ishkur06]

Keneth Taylor(2006): Ishkurs Guide to electronic music. www.di.fm/edmguide/edmguide.html

[JSM2002]

Marco Fortini, Alessandra Nuccitelli, Brunero Liseo, Mauro Scanu (2002): Modelling Issues in Record Linkage: a Bayesian Perspective. .Rom.

[Kalfoglou05]

Y. Kalfoglou, M. Schorlemmer (2005): Ontology Mapping: The State of the Arth. Dagstuhl Seminar Proceedingsa..

[Kdd03]

Rohan Baxter, Peter Christen, Tim Churches (2003): A Comparison of fast Blocking Methods for Record Linkage. .Canberra.

[Keogh04]

Eamonn Keogh, Stefano Lonardi, Chotirat Ann RatanamahatanaTowards Parameter-Free Data Mining2004

[Klein]

Michel Klein (2001): Combining and relating ontologies - an analysis of problems and solutions. Vrije Universiteit Amsterdam..

[Kordon05]

Dr.-ing. Ullrich Kordon (2005): Vorlesung Sprachsynthese. TU Dresden.

[Lev06]

Die Levenshtein-Distanz (): www.levenshtein.de

[Meh05]

Dipl.-Wi.-Ing. Marc Ehrig (2005): Ontology Alignment: Bridging the semantical gap.

[Mehr04]

M.Ehrig, P.Haase, M.Hefke, N.Stojanovic (2004): Similarity for Ontologies - a Comprehensive Framework. Institute AIFB, University of Karlsruhe..

[Monge97]

A.E.Monge, C.P. Elkan (1997): An efficient domain-indendent algorithm for detecting approximately duplicate database records. University of California, San Diego.

[Mooney03]

Mikhail Bilenko, Raymond J. Mooney (2003): On Evaluation and Training-Set Construction for Duplicate Detection

[NahmBilenk02]

Un Yong Nahm, Mikhail Bilenko, Raymond J. MooneyTwo Approaches to Handling Noisy Variation in Text Mining. (2002)

[Niermann03]

Andrew Nierman H. V. Jagadish (2003): ProTDB: Probabilistic Data in XML. University of Michigan.

[Nigam01]

Kamal Paul NigamUsing Unlabeled Data to Improve Text Classification2001

[Nigam2000]

Andrew McCallum, Kamal Nigam, Lyle H. UngarEfficient Clustering of High- Dimensional Data-Set with Application to Refer

[Noy05]

Natasha NoyOntology Mapping and Alignment2005

[OLA04]

J. Euzenat, D. Loup, M. Touzani, P. Valtchev (2004): Ontology alignment with OLA. University of Montréal..

[onto06]

Wikipedia(4.2006): Ontologie (Informatik).

[openthesaur06]

Daniel Naber(2006): OpenThesaurus - Deutscher Thesaurus. www.openthesaurus.de

[Portnoy00]

Leonid Portnoy (2000): Intrusion detection with unlabeled data using clustering. Columbia.

[Rahm01]

E. Rahm, P.A. Bernstein(2001): A survey of approaches to automatic schema matching.

[Rahm02]

H.-H. Do, S. Melnik, E. Rahm (2002): Comparison of Schema Matching Evaluations. University of Leipzig..

[Rahm04]

Erhard Rahm, Hong Hai Do (): Data Cleaning: Problems and Current Approaches. .University of Leipzig, Germany.

[RistadYanilo96]

Eric Sven Ristad, Peter N. Yainilos (): Learning Edit String Distance. .Princeton University.

[Schaffert04]

S. Schaffert, F. Bry:Querying the Web Reconsidered (2004)

[Schefels06]

Clemens SchefelsServing Xcerpt to the Web2006

[Schreiber06]

M. SchreiberNeighbourhood-conscious Record Linkage.(2006)

[Secstring06]

William W. Cohen, Pradeep Ravikumar, Stephen Fienberg(2006): SecondString. secondstring.sourceforge.net

[SF06]

(2006): SourceForge. www.sourceforge.net

[Simmetric06]

Sam Chapman(): Sam's String Metrics. http://www.dcs.shef.ac.uk/~sam/stringmetrics.html

[Soundex00]

The U.S. National Archives and Records Administration(2000): The Soundex Indexing System. www.archives.gov/genealogy/census/soundex

[Spire02]

I. Bartolini, P. Ciaccia, M. Patellai (2002): String Matching with Metric Trees. Using an Approximate Distance. University of Bologna, Italien..

[Stoilos05]

Giorgos Stoilos, Giorgos Stamou, and Stefanos Kollias (2005): A String Metric for Ontology Alignment. Springer-Verlag.Berlin Heidelberg.

[Studer05]

R. Studer, M. Ehrig, Y. Sure (2005): Automatische Wissensintegration mit Ontologien. Institut AIFB, Universität Karlsruhe..

[Sven06]

Hans Eric SvenssonExtending Xcerpt with Ontology Queries.(2004)

[Tailor02]

M. G. Elfeky, V.S.Verykios, A. K. ElmagarmidTAILOR: A Record Linkage Toolbox2002

[vanReijsenn75]

Cornelis Joost van Rijsbergen(1975): Information retrieval.. Butterworths, London (UK).

[Vldb01]

Luis Gravano, Panagiotis G. Ipeirotis, H. V. Jagadish, Nick Koudas (2001): Approximate String Joins in a Database (Almost) for Free. AT&T.Columbia University.

[weka06]

Dr. Eibe Frank et.al.(2006): Weka 3: Data Mining Software in Java. www.cs.waikato.ac.nz/~ml/

[Wiki06]

(2006): Precision and Recall. de.wikipedia.org/wiki/Precision

[wikisound]

Wikipedia(2006): Soundex. de.wikipedia.org/wiki/Soundex

[Wilk06]

H. E. Svensson, E. Wilk (2006): XML Querying Using Ontological Information. Linköping University, Schweden..

[Winkler00]

William E. Winkler (2000): Frequency-Based Matching in Fellegi-Sunter Model of. .Washington.

[Winkler03]

William E. Winkler (2003): Data Cleaning Methods. .Washington.

[Winkler90]

W. E. Winkler (1990): An Application of the FELLEGI-SUNTER Model of Reco. Washington.

[Winkler91]

William E. Winkler (): Record Linkage Software and Methods for Merging A. Washington.

[Winkler93]

William E Winkler (1993): Matching and Record Linkage. .Washington.

[Winkler99]

W.E. Winkler (1999): The State of Record Linkage and Current Research Problems. U. S. Bureau of the Censuso..

[wordnet06]

George A. Miler(2006): WordNet - a lexical database for the English language. wordnet.princeton.edu

[Wu03]

Tianhao Wu (2003): Theory and Algorithms for Information Extraction and Classification in Text. .CSE Department, Lehigh University.

[Xcerpt04]

S.Schaffert (2004): Xcerpt: A Rule-Based Query and Transformation Language for the web.

[XcerptFact05]

S. Berger, F. Bry, T. Furche, B. Linse, S. SchaffertThe Web and semantic web query language Xcerpt2005

[XcerptUseCase04]

Sebastian Kraus (2004): Use Cases für Xcerpt - eine positionelle Anfrage und Transformationssprache für das Web

[Zhu01]

J. Joanne Zhu, Lyle H. Ungar (): String Edit Analysis for Merging Databases. Philadelphia.

top