Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Collection, evaluation and selection of scientific literature : machine learning, bibliometrics and the World Wide Web

Thesis (MSc)--University of Stellenbosch, 2004.

Saved in:
Bibliographic Details
Main Author: Connan, James
Other Authors: Omlin, Christian W.
Format: Thesis
Language:en_ZA
Published: Stellenbosch : Stellenbosch University 2012
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867614004439941120
access_status_str Open Access
author Connan, James
author2 Omlin, Christian W.
author_browse Connan, James
Omlin, Christian W.
author_facet Omlin, Christian W.
Connan, James
author_sort Connan, James
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (MSc)--University of Stellenbosch, 2004.
format Thesis
id oai:scholar.sun.ac.za:10019.1/49886
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:45:08.467Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2012
publishDateRange 2012
publishDateSort 2012
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/49886 Collection, evaluation and selection of scientific literature : machine learning, bibliometrics and the World Wide Web Connan, James Omlin, Christian W. Stellenbosch University. Faculty of Science. Dept. of Mathematical Sciences. Machine learning Online bibliographic searching Bibliometrics Markov processes World Wide Web Information storage and retrieval systems -- Science Dissertations -- Computer science Theses -- Computer science Thesis (MSc)--University of Stellenbosch, 2004. ENGLISH ABSTRACT: We present a system that uses statistical machine learning to identify and extract bibliography information from scientific literature. Techniques for finding and gathering useful information from the ever growing volume of knowledge on the World Wide Web (WWW), are investigated. We use hidden Markov models both for recognition of bibliography styles and extraction of bibliographic information with an accuracy of up to 97%. The accuracy with which we are able to extract this information allows us to present a case study in which we apply methods of citation analysis to information extracted from three areas of machine learning. We use this information to identify core sets of papers that have made significant contributions to the fields of hidden Markov models, neural networks and recurrent neural networks. AFRIKAANSE OPSOMMING: Ons bied 'n sisteem aan wat gebruik maak van statistiese masjiene wat leer om bibliografiese inligting uit wetenskaplikke literatuur te identifiseer en ontgin. Tegnieke wat aangewend word vir die verkenning en insameling van nuttige inligting vanaf die snel groeiende kennisbron van die WWW, word ondersoek. Ons gebruik verskuilde Markov modelle vir die herkenning van verwysingsstyl en ontginning van verwysingsinligting met 'n akuraatheidspeil van to 97%. Hierdie hoë ontginningsakuraatheid stelons in staat om 'n toepassing van die tegniek op die veld van masjiene wat leer toe te pas. Ons rapporteer hoe ons die tegnieke gebruik het om literatuur wat beduidende bydraes in die velde van verskuilde Markov modelle, neurale netwerke en terugkerende neurale netwerke, te identifiseer. 2012-08-27T11:33:08Z 2012-08-27T11:33:08Z 2004-12 Thesis http://hdl.handle.net/10019.1/49886 en_ZA Stellenbosch University 63 p. : ill. application/pdf Stellenbosch : Stellenbosch University
spellingShingle Machine learning
Online bibliographic searching
Bibliometrics
Markov processes
World Wide Web
Information storage and retrieval systems -- Science
Dissertations -- Computer science
Theses -- Computer science
Connan, James
Collection, evaluation and selection of scientific literature : machine learning, bibliometrics and the World Wide Web
title Collection, evaluation and selection of scientific literature : machine learning, bibliometrics and the World Wide Web
title_full Collection, evaluation and selection of scientific literature : machine learning, bibliometrics and the World Wide Web
title_fullStr Collection, evaluation and selection of scientific literature : machine learning, bibliometrics and the World Wide Web
title_full_unstemmed Collection, evaluation and selection of scientific literature : machine learning, bibliometrics and the World Wide Web
title_short Collection, evaluation and selection of scientific literature : machine learning, bibliometrics and the World Wide Web
title_sort collection evaluation and selection of scientific literature machine learning bibliometrics and the world wide web
topic Machine learning
Online bibliographic searching
Bibliometrics
Markov processes
World Wide Web
Information storage and retrieval systems -- Science
Dissertations -- Computer science
Theses -- Computer science
url http://hdl.handle.net/10019.1/49886
work_keys_str_mv AT connanjames collectionevaluationandselectionofscientificliteraturemachinelearningbibliometricsandtheworldwideweb