Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Training neural word embeddings for transfer learning and translation

Thesis (D. Phi)--Stellenbosch University, 2016.

Saved in:
Bibliographic Details
Main Author: Gouws, Stephan
Other Authors: Van Rooyen, G-J.
Format: Thesis
Language:en_ZA
Published: Stellenbosch : Stellenbosch University 2016
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613792013123584
access_status_str Open Access
author Gouws, Stephan
author2 Van Rooyen, G-J.
author_browse Gouws, Stephan
Van Rooyen, G-J.
author_facet Van Rooyen, G-J.
Gouws, Stephan
author_sort Gouws, Stephan
collection Thesis
dc_rights_str_mv Stellenbosch University
description Thesis (D. Phi)--Stellenbosch University, 2016.
format Thesis
id oai:scholar.sun.ac.za:10019.1/98758
institution Stellenbosch University (South Africa)
language en_ZA
last_indexed 2026-06-10T12:41:46.341Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2016
publishDateRange 2016
publishDateSort 2016
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/98758 Training neural word embeddings for transfer learning and translation Gouws, Stephan Van Rooyen, G-J. Bengio, Yoshua Hovy, Eduard Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering. Semantic computing UCTD Neural computers Learning systems (Automatic control) Translators (computer programs) Thesis (D. Phi)--Stellenbosch University, 2016. ENGLISH ABSTRACT: In contrast to only a decade ago, it is now easy to collect large text corpora from theWeb on any topic imaginable. However, in order for information processing systems to perform a useful task, such as answer a user’s queries on the content of the text, the raw text first needs to be parsed into the appropriate linguistic structures, like parts of speech, named-entities or semantic entities. Contemporary natural language processing systems rely predominantly on supervised machine learning techniques for performing this task. However, the supervision required to train these models are expensive to come by, since human annotators need to mark up relevant pieces of text with the required labels of interest. Furthermore, machine learning practitioners need to manually engineer a set of task-specific features which represents a wasteful duplication of efforts for each new task. An alternative approach is to attempt to automatically learn representations from raw text that are useful for predicting a wide variety of linguistic structures. In this dissertation, we hypothesise that neural word embeddings, i.e. representations that use continuous values to represent words in a learned vector space of meaning, are a suitable and efficient approach for learning representations of natural languages that are useful for predicting various aspects related to their meaning. We show experimental results which support this hypothesis, and present several contributions which make inducing word representations faster and applicable for monolingual and various cross-lingual prediction tasks. The first contribution to this end is SimTree, an efficient algorithm for jointly clustering words into semantic classes while training a neural network language model with the hierarchical softmax output layer. The second is an efficient subsampling training technique for speeding up learning while increasing accuracy of word embeddings induced using the hierarchical softmax. The third is BilBOWA, a bilingual word embedding model that can efficiently learn to embed words across multiple languages using only a limited sample of parallel raw text, and unlimited amounts of monolingual raw text. The fourth is Barista, a bilingual word embedding model that efficiently uses additional semantic information about how words map into equivalence classes, such as parts of speech or word translations, and includes this information during the embedding process. In addition, this dissertation provides an in-depth overview of the different neural language model architectures, and a detailed, tutorial-style overview of the available popular techniques for training these models. AFRIKAANSE OPSOMMING: Geen opsomming beskikbaar Doctoral 2016-03-09T14:58:11Z 2016-03-09T14:58:11Z 2016-03 Thesis http://hdl.handle.net/10019.1/98758 en_ZA Stellenbosch University xii, 129 pages application/pdf Stellenbosch : Stellenbosch University
spellingShingle Semantic computing
UCTD
Neural computers
Learning systems (Automatic control)
Translators (computer programs)
Gouws, Stephan
Training neural word embeddings for transfer learning and translation
title Training neural word embeddings for transfer learning and translation
title_full Training neural word embeddings for transfer learning and translation
title_fullStr Training neural word embeddings for transfer learning and translation
title_full_unstemmed Training neural word embeddings for transfer learning and translation
title_short Training neural word embeddings for transfer learning and translation
title_sort training neural word embeddings for transfer learning and translation
topic Semantic computing
UCTD
Neural computers
Learning systems (Automatic control)
Translators (computer programs)
url http://hdl.handle.net/10019.1/98758
work_keys_str_mv AT gouwsstephan trainingneuralwordembeddingsfortransferlearningandtranslation