Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
Classi cation is a central problem in the elds of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classi er) that can be used to predict the class of new unlabeled instances. Data preparation is crucial to the data mining process, a...
| Main Author: | |
|---|---|
| Format: | Thesis |
| Published: |
AUC Knowledge Fountain
2015
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867613407846334464 |
|---|---|
| access_status_str | Open Access |
| author | Abdel Salam, Ismail Mohamed Anwar |
| author_browse | Abdel Salam, Ismail Mohamed Anwar |
| author_facet | Abdel Salam, Ismail Mohamed Anwar |
| author_sort | Abdel Salam, Ismail Mohamed Anwar |
| collection | Thesis |
| dc_rights_str_mv | The author retains all rights with regard to copyright. The author certifies that written permission from the owner(s) of third-party copyrighted matter included in the thesis, dissertation, paper, or record of study has been obtained. The author further certifies that IRB approval has been obtained for this thesis, or that IRB approval is not necessary for this thesis. Insofar as this thesis, dissertation, paper, or record of study is an educational record as defined in the Family Educational Rights and Privacy Act (FERPA) (20 USC 1232g), the author has granted consent to disclosure of it to anyone who requests a copy. |
| description | Classi cation is a central problem in the elds of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classi er) that can be used to predict the class of new unlabeled instances. Data preparation is crucial to the data mining process, and its focus is to improve the tness of the training data for the learning algorithms to produce more e ective classi ers. Two widely applied data preparation methods are feature selection and instance selection, which fall under the umbrella of data reduction. For my research I propose ADR-Miner, a novel data reduction algorithm that utilizes ant colony optimization (ACO). ADR-Miner is designed to perform instance selection to improve the predictive e ectiveness of the constructed classi cation models. Two versions of ADR-Miner are developed: a base version that uses a single classi cation algorithm during both training and testing, and an extended version which uses separate classi cation algorithms for each phase. The base version of the ADR-Miner algorithm is evaluated against 20 data sets using three classi cation algorithms, and the results are compared to a benchmark data reduction algorithm. The non-parametric Wilcoxon signed-ranks test will is employed to gauge the statistical signi cance of the results obtained. The extended version of ADR-Miner is evaluated against 37 data sets using pairings from fi ve classi cation algorithms and these results are benchmarked against the performance of the classi cation algorithms but without reduction applied as pre-processing. Keywords: Ant Colony Optimization (ACO), Data Mining, Classi cation, Data Reduction. |
| format | Thesis |
| id | oai:fount.aucegypt.edu:etds-1129 |
| institution | American University in Cairo (Egypt) |
| last_indexed | 2026-06-10T12:35:39.635Z |
| license_str | Other — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from AUC Knowledge Fountain — bepress |
| publishDate | 2015 |
| publishDateRange | 2015 |
| publishDateSort | 2015 |
| publisher | AUC Knowledge Fountain |
| publisherStr | AUC Knowledge Fountain |
| record_format | dspace |
| source_str | AUC Knowledge Fountain — bepress |
| spelling | oai:fount.aucegypt.edu:etds-1129 ADR-Miner: An Ant-based data reduction algorithm for classification Abdel Salam, Ismail Mohamed Anwar Classi cation is a central problem in the elds of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classi er) that can be used to predict the class of new unlabeled instances. Data preparation is crucial to the data mining process, and its focus is to improve the tness of the training data for the learning algorithms to produce more e ective classi ers. Two widely applied data preparation methods are feature selection and instance selection, which fall under the umbrella of data reduction. For my research I propose ADR-Miner, a novel data reduction algorithm that utilizes ant colony optimization (ACO). ADR-Miner is designed to perform instance selection to improve the predictive e ectiveness of the constructed classi cation models. Two versions of ADR-Miner are developed: a base version that uses a single classi cation algorithm during both training and testing, and an extended version which uses separate classi cation algorithms for each phase. The base version of the ADR-Miner algorithm is evaluated against 20 data sets using three classi cation algorithms, and the results are compared to a benchmark data reduction algorithm. The non-parametric Wilcoxon signed-ranks test will is employed to gauge the statistical signi cance of the results obtained. The extended version of ADR-Miner is evaluated against 37 data sets using pairings from fi ve classi cation algorithms and these results are benchmarked against the performance of the classi cation algorithms but without reduction applied as pre-processing. Keywords: Ant Colony Optimization (ACO), Data Mining, Classi cation, Data Reduction. 2015-06-01T07:00:00Z thesis application/pdf https://fount.aucegypt.edu/etds/130 https://fount.aucegypt.edu/context/etds/article/1129/viewcontent/Thesis__MSc_IsmailAnwar_Final.pdf The author retains all rights with regard to copyright. The author certifies that written permission from the owner(s) of third-party copyrighted matter included in the thesis, dissertation, paper, or record of study has been obtained. The author further certifies that IRB approval has been obtained for this thesis, or that IRB approval is not necessary for this thesis. Insofar as this thesis, dissertation, paper, or record of study is an educational record as defined in the Family Educational Rights and Privacy Act (FERPA) (20 USC 1232g), the author has granted consent to disclosure of it to anyone who requests a copy. Theses and Dissertations AUC Knowledge Fountain Ant Colony Optimization (ACO) Data Mining |
| spellingShingle | Ant Colony Optimization (ACO) Data Mining Abdel Salam, Ismail Mohamed Anwar ADR-Miner: An Ant-based data reduction algorithm for classification |
| title | ADR-Miner: An Ant-based data reduction algorithm for classification |
| title_full | ADR-Miner: An Ant-based data reduction algorithm for classification |
| title_fullStr | ADR-Miner: An Ant-based data reduction algorithm for classification |
| title_full_unstemmed | ADR-Miner: An Ant-based data reduction algorithm for classification |
| title_short | ADR-Miner: An Ant-based data reduction algorithm for classification |
| title_sort | adr miner an ant based data reduction algorithm for classification |
| topic | Ant Colony Optimization (ACO) Data Mining |
| url | https://fount.aucegypt.edu/etds/130 https://fount.aucegypt.edu/context/etds/article/1129/viewcontent/Thesis__MSc_IsmailAnwar_Final.pdf |
| work_keys_str_mv | AT abdelsalamismailmohamedanwar adrmineranantbaseddatareductionalgorithmforclassification |