Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Machine learning, data mining, and the World Wide Web : design of special-purpose search engines

Thesis (MSc)--Stellenbosch University, 2003.

Saved in:
Bibliographic Details
Main Author: Kruger, Andries F.
Other Authors: Omlin, Christian W.
Format: Thesis
Language:en_ZA
Published: Stellenbosch : Stellenbosch University 2012
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867614139751333888
access_status_str Open Access
author Kruger, Andries F.
author2 Omlin, Christian W.
author_browse Kruger, Andries F.
Omlin, Christian W.
author_facet Omlin, Christian W.
Kruger, Andries F.
author_sort Kruger, Andries F.
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MSc)--Stellenbosch University, 2003.
format Thesis
id oai:scholar.sun.ac.za:10019.1/53492
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:47:17.937Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2012
publishDateRange 2012
publishDateSort 2012
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/53492 Machine learning, data mining, and the World Wide Web : design of special-purpose search engines Kruger, Andries F. Omlin, Christian W. Stellenbosch University. Faculty of Science. Department of Mathematical Sciences. Search engines Data mining Computer-assisted instruction Web sites -- Abstracting and indexing Machine learning DEADLINER Bayesian framework Thesis (MSc)--Stellenbosch University, 2003. ENGLISH ABSTRACT: We present DEADLINER, a special-purpose search engine that indexes conference and workshop announcements, and which extracts a range of academic information from the Web. SVMs provide an efficient and highly accurate mechanism for obtaining relevant web documents. DEADLINER currently extracts speakers, locations (e.g. countries), dates, paper submission (and other) deadlines, topics, program committees, abstracts, and affiliations. Complex and detailed searches are possible on these fields. The niche search engine was constructed by employing a methodology for rapid implementation of specialised search engines. Bayesian integration of simple extractors provides this methodology, that avoids complex hand-tuned text extraction methods. The simple extractors exploit loose formatting and keyword conventions. The Bayesian framework further produces a search engine where each user can control each fields false alarm rate in an intuitive and rigorous fashion, thus providing easy-to-use metadata. AFRIKAANSE OPSOMMING: Ons stel DEADLINER bekend: 'n soekmasjien wat konferensie en werkvergaderingsaankondigings katalogiseer en wat uiteindelik 'n wye reeks akademiese byeenkomsmateriaal sal monitor en onttrek uit die Web. DEAD LINER herken en onttrek tans sprekers, plekke (bv. landname), datums, o.a. sperdatums vir die inlewering van akademiese verrigtings, onderwerpe, programkomiteë, oorsigte of opsommings, en affiliasies. 'n Grondige soek is moontlik oor en deur hierdie velde. Die nissoekmasjien is gebou deur gebruik te maak van 'n metodologie vir die vinnige oprigting van spesialiteitsoekmasjiene. Die metodologie vermy komplekse instelling m.b.v. hande-arbeid van die teksuittreksels deur gebruik te maak van Bayesiese integrering van eenvoudige ontsluiters. Die ontsluiters buit dan styl- en gewoonte-sleutelwoorde uit. Die Bayesiese raamwerk skep hierdeur 'n soekmasjien wat gebruikers toelaat om elke veld se kans om verkeerd te kies op 'n intuïtiewe en deeglike manier te beheer. 2012-08-27T11:35:30Z 2012-08-27T11:35:30Z 2003-04 Thesis http://hdl.handle.net/10019.1/53492 en_ZA Stellenbosch University 1 v. (various pagings) : illustrations application/pdf Stellenbosch : Stellenbosch University
spellingShingle Search engines
Data mining
Computer-assisted instruction
Web sites -- Abstracting and indexing
Machine learning
DEADLINER
Bayesian framework
Kruger, Andries F.
Machine learning, data mining, and the World Wide Web : design of special-purpose search engines
title Machine learning, data mining, and the World Wide Web : design of special-purpose search engines
title_full Machine learning, data mining, and the World Wide Web : design of special-purpose search engines
title_fullStr Machine learning, data mining, and the World Wide Web : design of special-purpose search engines
title_full_unstemmed Machine learning, data mining, and the World Wide Web : design of special-purpose search engines
title_short Machine learning, data mining, and the World Wide Web : design of special-purpose search engines
title_sort machine learning data mining and the world wide web design of special purpose search engines
topic Search engines
Data mining
Computer-assisted instruction
Web sites -- Abstracting and indexing
Machine learning
DEADLINER
Bayesian framework
url http://hdl.handle.net/10019.1/53492
work_keys_str_mv AT krugerandriesf machinelearningdataminingandtheworldwidewebdesignofspecialpurposesearchengines