Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

On supporting K-anonymisation and L-diversity of crime databases with genetic algorithms in a resource constrained environment

The social benefits derived from analysing crime data need to be weighed against issues relating to privacy loss. To facilitate such analysis of crime data Burke and Kayem [7] proposed a framework (MCRF) to enable mobile crime reporting in a developing country. Here crimes are reported via mobile ph...

Full description

Saved in:
Bibliographic Details
Main Author: Verster, Cornelis Thomas
Other Authors: Kayem, Anne
Format: Thesis
Language:English
Published: Department of Computer Science 2016
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613332682309632
access_status_str Open Access
author Verster, Cornelis Thomas
author2 Kayem, Anne
author_browse Kayem, Anne
Verster, Cornelis Thomas
author_facet Kayem, Anne
Verster, Cornelis Thomas
author_sort Verster, Cornelis Thomas
collection Thesis
description The social benefits derived from analysing crime data need to be weighed against issues relating to privacy loss. To facilitate such analysis of crime data Burke and Kayem [7] proposed a framework (MCRF) to enable mobile crime reporting in a developing country. Here crimes are reported via mobile phones and stored in a database owned by a law enforcement agency. The expertise required to perform analysis on the crime data is however unlikely to be available within the law enforcement agency. Burke and Kayem [7] proposed anonymising the data(using manual input parameters) at the law enforcement agency before sending it to a third party for analysis. Whilst analysis of the crime data requires expertise, adequate skill to appropriately anonymise the data is also required. What is lacking in the original MCRF is therefore an automated scheme for the law enforcement agency to adequately anonymise the data before sending it to the third party. This should, however, be done whilst maximising information utility of the anonymised data from the perspective of the third party. In this thesis we introduce a crime severity scale to facilitate the automation of data anonymisation within the MCRF. We consider a modified loss metric to capture information loss incurred during the anonymisation process. This modified loss metric also gives third party users the flexibility to specify attributes of the anonymised data when requesting data from the law enforcement agency. We employ a genetic algorithm(GA) approach called "Crime Genes"(CG) to optimise utility of the anonymised data based on our modified loss metric whilst adhering to notions of privacy denned by k-anonymity and l-diversity. Our CG implementation is modular and can therefore be easily integrated with the original MCRF. We also show how our CG approach is designed to be suitable for implementation in a developing country where particular resource constraints exist.
format Thesis
id oai:open.uct.ac.za:11427/20016
institution University of Cape Town (South Africa)
language eng
last_indexed 2026-06-10T12:34:27.383Z
license_str Not specified — see source repository
provenance_str_mv Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate 2016
publishDateRange 2016
publishDateSort 2016
publisher Department of Computer Science
publisherStr Department of Computer Science
record_format dspace
source_str UCTD — University of Cape Town Open Access Repository
spelling oai:open.uct.ac.za:11427/20016 On supporting K-anonymisation and L-diversity of crime databases with genetic algorithms in a resource constrained environment Verster, Cornelis Thomas Kayem, Anne Information Technology The social benefits derived from analysing crime data need to be weighed against issues relating to privacy loss. To facilitate such analysis of crime data Burke and Kayem [7] proposed a framework (MCRF) to enable mobile crime reporting in a developing country. Here crimes are reported via mobile phones and stored in a database owned by a law enforcement agency. The expertise required to perform analysis on the crime data is however unlikely to be available within the law enforcement agency. Burke and Kayem [7] proposed anonymising the data(using manual input parameters) at the law enforcement agency before sending it to a third party for analysis. Whilst analysis of the crime data requires expertise, adequate skill to appropriately anonymise the data is also required. What is lacking in the original MCRF is therefore an automated scheme for the law enforcement agency to adequately anonymise the data before sending it to the third party. This should, however, be done whilst maximising information utility of the anonymised data from the perspective of the third party. In this thesis we introduce a crime severity scale to facilitate the automation of data anonymisation within the MCRF. We consider a modified loss metric to capture information loss incurred during the anonymisation process. This modified loss metric also gives third party users the flexibility to specify attributes of the anonymised data when requesting data from the law enforcement agency. We employ a genetic algorithm(GA) approach called "Crime Genes"(CG) to optimise utility of the anonymised data based on our modified loss metric whilst adhering to notions of privacy denned by k-anonymity and l-diversity. Our CG implementation is modular and can therefore be easily integrated with the original MCRF. We also show how our CG approach is designed to be suitable for implementation in a developing country where particular resource constraints exist. 2016-06-10T10:55:05Z 2016-06-10T10:55:05Z 2015 Master Thesis Masters MSc http://hdl.handle.net/11427/20016 eng application/pdf Department of Computer Science Faculty of Science University of Cape Town
spellingShingle Information Technology
Verster, Cornelis Thomas
On supporting K-anonymisation and L-diversity of crime databases with genetic algorithms in a resource constrained environment
thesis_degree_str Master's
title On supporting K-anonymisation and L-diversity of crime databases with genetic algorithms in a resource constrained environment
title_full On supporting K-anonymisation and L-diversity of crime databases with genetic algorithms in a resource constrained environment
title_fullStr On supporting K-anonymisation and L-diversity of crime databases with genetic algorithms in a resource constrained environment
title_full_unstemmed On supporting K-anonymisation and L-diversity of crime databases with genetic algorithms in a resource constrained environment
title_short On supporting K-anonymisation and L-diversity of crime databases with genetic algorithms in a resource constrained environment
title_sort on supporting k anonymisation and l diversity of crime databases with genetic algorithms in a resource constrained environment
topic Information Technology
url http://hdl.handle.net/11427/20016
work_keys_str_mv AT verstercornelisthomas onsupportingkanonymisationandldiversityofcrimedatabaseswithgeneticalgorithmsinaresourceconstrainedenvironment