Dessureault, J.-S. et Massicotte, D. (2023). DPDR: A novel machine learning method for the decision process for dimensionality reduction. SN Computer Science, 5 (1). p. 124. ISSN 2661-8907 DOI 10.1007/s42979-023-02394-9
PDF
Sous embargo jusqu'au 21 décembre 2024. Télécharger (494kB) |
Résumé
This paper discusses the critical decision process of extracting or selecting the features in a supervised learning context. It is often confusing to find a suitable method to reduce dimensionality. There are pros and cons to deciding between a feature selection and feature extraction according to the data’s nature and the user’s preferences. Indeed, the user may want to emphasize the results toward integrity or interpretability and a specific data resolution. This paper proposes a new method to choose the best dimensionality reduction method in a supervised learning context. It also helps to drop or reconstruct the features until a target resolution is reached. This target resolution can be user defined, or it can be automatically defined by the method. The method applies a regression or a classification, evaluates the results, and gives a diagnosis about the best dimensionality reduction process in this specific supervised learning context. The main algorithms used are the random forest algorithms, the principal component analysis algorithm, and the multilayer perceptron neural network algorithm. Six use cases are presented, and every one is based on some well-known technique to generate synthetic data. This research also discusses each choice that can be made in the process, aiming to clarify the issues about the entire decision process of selecting or extracting the features.
Type de document: | Article |
---|---|
Mots-clés libres: | Feature extraction Feature selection Random forest algorithm PCA algorithm MLP neural network |
Date de dépôt: | 29 janv. 2024 13:43 |
Dernière modification: | 29 janv. 2024 13:43 |
Version du document déposé: | Post-print (version corrigée et acceptée) |
URI: | https://depot-e.uqtr.ca/id/eprint/11093 |
Actions (administrateurs uniquement)
Éditer la notice |