Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

ADR-Miner: An Ant-based data reduction algorithm for classification

Classi cation is a central problem in the elds of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classi er) that can be used to predict the class of new unlabeled instances. Data preparation is crucial to the data mining process, a...

Full description

Saved in:
Bibliographic Details
Main Author: Abdel Salam, Ismail Mohamed Anwar
Format: Thesis
Published: AUC Knowledge Fountain 2015
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613407846334464
access_status_str Open Access
author Abdel Salam, Ismail Mohamed Anwar
author_browse Abdel Salam, Ismail Mohamed Anwar
author_facet Abdel Salam, Ismail Mohamed Anwar
author_sort Abdel Salam, Ismail Mohamed Anwar
collection Thesis
dc_rights_str_mv The author retains all rights with regard to copyright. The author certifies that written permission from the owner(s) of third-party copyrighted matter included in the thesis, dissertation, paper, or record of study has been obtained. The author further certifies that IRB approval has been obtained for this thesis, or that IRB approval is not necessary for this thesis. Insofar as this thesis, dissertation, paper, or record of study is an educational record as defined in the Family Educational Rights and Privacy Act (FERPA) (20 USC 1232g), the author has granted consent to disclosure of it to anyone who requests a copy.
description Classi cation is a central problem in the elds of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classi er) that can be used to predict the class of new unlabeled instances. Data preparation is crucial to the data mining process, and its focus is to improve the tness of the training data for the learning algorithms to produce more e ective classi ers. Two widely applied data preparation methods are feature selection and instance selection, which fall under the umbrella of data reduction. For my research I propose ADR-Miner, a novel data reduction algorithm that utilizes ant colony optimization (ACO). ADR-Miner is designed to perform instance selection to improve the predictive e ectiveness of the constructed classi cation models. Two versions of ADR-Miner are developed: a base version that uses a single classi cation algorithm during both training and testing, and an extended version which uses separate classi cation algorithms for each phase. The base version of the ADR-Miner algorithm is evaluated against 20 data sets using three classi cation algorithms, and the results are compared to a benchmark data reduction algorithm. The non-parametric Wilcoxon signed-ranks test will is employed to gauge the statistical signi cance of the results obtained. The extended version of ADR-Miner is evaluated against 37 data sets using pairings from fi ve classi cation algorithms and these results are benchmarked against the performance of the classi cation algorithms but without reduction applied as pre-processing. Keywords: Ant Colony Optimization (ACO), Data Mining, Classi cation, Data Reduction.
format Thesis
id oai:fount.aucegypt.edu:etds-1129
institution American University in Cairo (Egypt)
last_indexed 2026-06-10T12:35:39.635Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from AUC Knowledge Fountain — bepress
publishDate 2015
publishDateRange 2015
publishDateSort 2015
publisher AUC Knowledge Fountain
publisherStr AUC Knowledge Fountain
record_format dspace
source_str AUC Knowledge Fountain — bepress
spelling oai:fount.aucegypt.edu:etds-1129 ADR-Miner: An Ant-based data reduction algorithm for classification Abdel Salam, Ismail Mohamed Anwar Classi cation is a central problem in the elds of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classi er) that can be used to predict the class of new unlabeled instances. Data preparation is crucial to the data mining process, and its focus is to improve the tness of the training data for the learning algorithms to produce more e ective classi ers. Two widely applied data preparation methods are feature selection and instance selection, which fall under the umbrella of data reduction. For my research I propose ADR-Miner, a novel data reduction algorithm that utilizes ant colony optimization (ACO). ADR-Miner is designed to perform instance selection to improve the predictive e ectiveness of the constructed classi cation models. Two versions of ADR-Miner are developed: a base version that uses a single classi cation algorithm during both training and testing, and an extended version which uses separate classi cation algorithms for each phase. The base version of the ADR-Miner algorithm is evaluated against 20 data sets using three classi cation algorithms, and the results are compared to a benchmark data reduction algorithm. The non-parametric Wilcoxon signed-ranks test will is employed to gauge the statistical signi cance of the results obtained. The extended version of ADR-Miner is evaluated against 37 data sets using pairings from fi ve classi cation algorithms and these results are benchmarked against the performance of the classi cation algorithms but without reduction applied as pre-processing. Keywords: Ant Colony Optimization (ACO), Data Mining, Classi cation, Data Reduction. 2015-06-01T07:00:00Z thesis application/pdf https://fount.aucegypt.edu/etds/130 https://fount.aucegypt.edu/context/etds/article/1129/viewcontent/Thesis__MSc_IsmailAnwar_Final.pdf The author retains all rights with regard to copyright. The author certifies that written permission from the owner(s) of third-party copyrighted matter included in the thesis, dissertation, paper, or record of study has been obtained. The author further certifies that IRB approval has been obtained for this thesis, or that IRB approval is not necessary for this thesis. Insofar as this thesis, dissertation, paper, or record of study is an educational record as defined in the Family Educational Rights and Privacy Act (FERPA) (20 USC 1232g), the author has granted consent to disclosure of it to anyone who requests a copy. Theses and Dissertations AUC Knowledge Fountain Ant Colony Optimization (ACO) Data Mining
spellingShingle Ant Colony Optimization (ACO)
Data Mining
Abdel Salam, Ismail Mohamed Anwar
ADR-Miner: An Ant-based data reduction algorithm for classification
title ADR-Miner: An Ant-based data reduction algorithm for classification
title_full ADR-Miner: An Ant-based data reduction algorithm for classification
title_fullStr ADR-Miner: An Ant-based data reduction algorithm for classification
title_full_unstemmed ADR-Miner: An Ant-based data reduction algorithm for classification
title_short ADR-Miner: An Ant-based data reduction algorithm for classification
title_sort adr miner an ant based data reduction algorithm for classification
topic Ant Colony Optimization (ACO)
Data Mining
url https://fount.aucegypt.edu/etds/130
https://fount.aucegypt.edu/context/etds/article/1129/viewcontent/Thesis__MSc_IsmailAnwar_Final.pdf
work_keys_str_mv AT abdelsalamismailmohamedanwar adrmineranantbaseddatareductionalgorithmforclassification