Feature Selection

Feature Selection (.pdf)

Feature selection (also known as subset selection) is a process commonly used in machine learning, wherein a subset of the features available from the data are selected for application of a learning algorithm. The best subset contains the least number of dimensions that most contribute to accuracy; we discard the remaining, unimportant dimensions. This is an important stage of pre-processing and is one of two ways of avoiding the curse of dimensionality (the other is feature extraction). There are two approaches: \begin{description} \item [forward selection] Start with no variables and add them one by one, at each step adding the one that decreases the error the most, until any further addition does not significantly decrease the error. \item [backward selection] Start with all the variables and remove them one by one, at each step removing the one that decreases the error the most (or increases it only slightly), until any further removal increases the error significantly. \end{description} To reduce overfitting, the error referred to above is the error on a validation set that is distinct from the training set.

"There are two main methods for reducing dimensionality: feature selection and feature extraction. In feature selection, we are interested in finding k of the d dimensions that give us the most information and we discard the other (d - k) dimensions. We are going to discuss subset delection as a feature selection method.
[...]
In subset selection, we are interested in finding the best subset of the set of features. The best subset contains the least number of dimensions that most contribute to accuracy. We discard the remaining, unimportant dimensions. Using a suitable error function, this can be used in both regression and classification problems. There are 2_d possible subsets of d variables, but we cannot test for all of them unless d is small and we employ heuristics to get a reasonable (but not optimal) solution in reasonable (polynomial) time.
There are two approaches: In forward selection, we start with no variables and add them one by one, at each step adding the one that decreases the error the most, until any further addition does not decrease the error (or decreases it only sightly (sic)). In backward selection, we start with all variables and remove them one by one, at each step removing the one that decreases the error the most (or increases it only slightly), until any further removal increases the error significantly. In either case, checking the error should be done on a validation set distinct from the training set because we want to test the generalization accuracy. With more features, generally we have lower training error, but not necessarily lower validation error.
[...]"
Alpaydin (2004), p 106

"An important issue that often confronts data miners in practice is the problem of having too many variables. Simply put, not all variables that are measured are likely to be necessary for accurate discrimination and including them in the classification model may in fact lead to a worse model than if they were removed. Consider the simple example of building a system to discriminate between images of male and female faces (a task that humans perform effortlessly and relatively accurately but that is quite challenging for an image classification algorithm), The colors of a person's eyes, hair, or skin are hardly likely to be useful in this discriminative context. These are variables that are easy to measure (and indeed are general characteristics of a person's appearance) but carry little information as to the class identity in this particular case."
Hand, Mannila and Smyth (2001), p 362

"One of the central issues in induction concerns the selection of useful features. Although most learning methods attempt to either select attributes or assign them degrees of importance, both theoretical analyses and experimental studies indicate that many algorithms scale poorly to domains with large numbers of irrelevant features. For example, the number of training cases needed for simple nearest neighbor [...] to reach a given level of accuracy appears to grow exponentially with the number of irrelevant features, independent of the target concept. Even methods for inducing univariate decision trees, which explicitly select some attributes in favor of others, exhibit this behavior for some target concepts. And some techniques, like the naive Bayesian classifier [...], can be very sensitive to domains with correlated attributes. This suggests the need for additional methods to select a useful subset of features when many are available."
Langley (1996), p 233, p 253

"Feature selection, also known as subset selection or variable selection, is a process commonly used in machine learning, wherein a subset of the features available from the data are selected for application of a learning algorithm. Feature selection is necessary either because it is computationally infeasible to use all available features, or because of problems of estimation when limited data samples (but a large number of features) are present. The latter problem is related to the so-called curse of dimensionality."
Wikipedia (2006)

Most Cited

MILLER, Alan, 2002. Subset Selection in Regression. books.google.com. [Cited by 489] (114.65/year)
YANG, Yiming and Jan O. PEDERSEN, 1997. A comparative study on feature selection in text categorization. Proceedings of the Fourteenth International Conference on …. [Cited by 901] (97.25/year)
KOHAVI, Ron and George H. JOHN, 1997. Wrappers for Feature Subset Selection, Artificial Intelligence. [Cited by 768] (82.89/year)
GUYON, I. and A. ELISSEEFF, 2003. An introduction to variable and feature selection, Journal of Machine Learning Research, Volume 3, March, Pages 1157-1182. [Cited by 266] (81.46/year)
JOHN, George H., R. KOHAVI and Karl PFLEGER, 1994. Irrelevant features and the subset selection problem, Machine Learning: Proceedings of the Eleventh International Conference, edited by William W. Cohen and Haym Hirsh, pages 121-129. [Cited by 669] (54.54/year)
BLUM, Avrim L. and Pat LANGLEY, 1997. Selection of Relevant Features and Examples in Machine Learning, Artificial Intelligence, Volume 97, Issues 1-2, December 1997, Pages 245-271. [Cited by 455] (49.11/year)
JAIN, Anil and Douglas ZONGKER, 1997. Feature Selection: Evaluation, Application, and Small Sample Performance, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 2, pp. 153-158. [Cited by 418] (45.11/year)
LIU, Huan and Hiroshi MOTODA, 1998. Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers Norwell, MA, USA. [Cited by 329] (39.81/year)
KOLLER, D. and M. SAHAMI, 1996. Toward optimal feature selection, Proceedings of the Thirteenth International Conference on Machine Learning, Pages 284-292. [Cited by 363] (35.36/year)
WESTON, Jason, et al., 2001. Feature selection for SVMs, Advances in Neural Information Processing Systems 13 [Cited by 203] (32.40/year)
DASH, M. and H. LIU, 1997. Feature selection for classification, Intelligent Data Analysis, Volume 1, Issues 1-4, 1997, Pages 131-156. [Cited by 287] (30.98/year)
PUDIL, P., J. NOVOVI?OVÁ and J. KITTLER, 1994. Floating search methods in feature selection, Pattern Recognition Letters, Volume 15, Issue 11, November 1994, Pages 1119-1125. [Cited by 377] (30.74/year)
YANG, Jihoon and Vasant HONAVAR, 1998. Feature subset selection using a genetic algorithm, IEEE Intelligent Systems, Volume 13, Issue 2, Pages 44-49. [Cited by 248] (30.01/year)
FORMAN, George, 2003. An extensive empirical study of feature selection metrics for text classification, Journal of Machine Learning Research, Volume 3, March, Pages 1289-1305. [Cited by 75] (22.97/year)
XING, E.P., M.I. JORDAN and R.M. KARP, 2001. Feature selection for high-dimensional genomic microarray data, ICML '01: Proceedings of the Eighteenth International Conference on Machine Learning, pages 601-608. [Cited by 118] (22.41/year)
KIRA, Kenji and L.A. RENDELL, 1992. A practical approach to feature selection. Proceedings of the ninth international workshop on Machine …. [Cited by 294] (20.61/year)
KIRA, K. and L.A. RENDELL, 1992. The feature selection problem: Traditional methods and a new algorithm. AAAI-92: Proceedings of the 10th National Conference on Artifical Intelligence [Cited by 279] (19.54/year)
BATTITI, Roberto,, 1994. Using mutual information for selecting features in supervisedneural net learning. Neural Networks, IEEE Transactions on. [Cited by 224] (18.26/year)
NG, Hwee Tou, Wei Boon GOH and Kok Leong LOW, 1997. Feature selection, perception learning, and a usability case study for text categorization. Proceedings of the 20th annual international ACM SIGIR …. [Cited by 165] (17.81/year)
BRADLEY, P.S. and O.L. MANGASARIAN, 1998. Feature selection via concave minimization and support vector machines. Machine Learning Proceedings of the Fifteenth International …. [Cited by 144] (17.42/year)
MITRA, P., C.A. MURTHY and S.K. PAL, 2002. Unsupervised feature selection using feature similarity. Pattern Analysis and Machine Intelligence, IEEE Transactions …. [Cited by 71] (16.64/year)
SKALAK, David B., 1994. Prototype and feature selection by sampling and random mutation hill climbing algorithms. Proceedings of the Eleventh International Conference on …. [Cited by 194] (15.82/year)
KWAK, N. and C.H. CHOI, 2002. Input feature selection for classification problems. Neural Networks, IEEE Transactions on. [Cited by 66] (15.47/year)
ALMUALLIM, Hussein and Thomas G. DIETTERICH, 1991. Learning with many irrelevant features. Proceedings of the Ninth National Conference on Artificial …. [Cited by 236] (15.46/year)
B?, T.H. and I. JONASSEN, 2002. New feature subset selection procedures for classification of expression profiles. Genome Biology. [Cited by 65] (15.24/year)
HALL, M.A., 1999. Correlation-based Feature Selection for Machine Learning. [Cited by 101] (13.90/year)
HALL, M.A., 2000. Correlation-based feature selection for discrete and numeric class machine learning. Proceedings of the Seventeenth International Conference on …. [Cited by 87] (13.89/year)
SWINIARSKI, R.W. and A. SKOWRON, 2003. Rough set methods in feature selection and recognition. Pattern Recognition Letters. [Cited by 44] (13.47/year)
MLADENIC, D. and M. GROBELNIK, 1999. Feature selection for unbalanced class distribution and naive Bayes. Machine Learning: Proceedings of the Sixteenth International …. [Cited by 93] (12.80/year)
MINGERS, J., 1989. An empirical comparison of selection measures for decision-tree induction. Machine Learning. [Cited by 219] (12.68/year)
SIEDLECKI, W. and J. SKLANSKY, 1993. On automatic feature selection. Handbook of pattern recognition & computer vision table of …. [Cited by 166] (12.51/year)
MLADENIC, D., 1998. Feature subset selection in text-learning. Proceedings of the 10th European Conference on Machine …. [Cited by 103] (12.46/year)
YU, L. and H. LIU, 2003. Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution. Proceedings of the twentieth International Conference on …. [Cited by 39] (11.94/year)
LANGLEY, Pat, 1994. Selection of relevant features in machine learning. Proceedings of the AAAI Fall Symposium on Relevance. [Cited by 142] (11.58/year)
MINGERS, J., 1989. An Empirical Comparison of Pruning Methods for Decision Tree Induction. Machine Learning. [Cited by 200] (11.58/year)
SIEDLECKI, W. and J. SKLANSKY, 1989. A note on genetic algorithms for large-scale feature selection. Pattern Recognition Letters. [Cited by 200] (11.58/year)
BRADLEY, P.S., O.L. MANGASARIAN and W.N. STREET, 1998. Feature selection via mathematical programming. INFORMS Journal on Computing. [Cited by 92] (11.13/year)
LIU, H. and R. SETIONO, 1996. A probabilistic approach to feature selection-a filter solution. Proceedings of the 13 thICML. [Cited by 110] (10.72/year)
KIM, K. and W.B. LEE, 2004. Stock market prediction using artificial neural networks with optimal feature transformation, Neural Computing & Applications. [Cited by 1] (0.44/year)

Books

LIU, Huan and Hiroshi MOTODA, 1998. Feature Extraction, Construction and Selection: A Data Mining Perspective, Kluwer Academic Publishers.
LIU, Huan and Hiroshi MOTODA, 1998. Feature Selection for Knowledge Discovery and Data Mining, Kluwer Academic Publishers.

Bibliography

AHA, D.W. and R.L. BANKERT, 1994. Feature selection for case-based classification of cloud types: An empirical comparison. Proceedings of the 1994 AAAI Workshop on Case-Based …. [Cited by 76] (6.20/year)
AHA, D.W. and R.L. BANKERT, 1995. A comparative evaluation of sequential feature selection algorithms. Proceedings of the Fifth International Workshop on …. [Cited by 108] (9.59/year)
ANLLO-VENTO, L. and S.A. HILLYARD, 1996. … and direction of moving stimuli: electrophysiological correlates of hierarchical feature selection.. Percept Psychophys. [Cited by 85] (8.28/year)
B?, T.H. and I. JONASSEN, 2002. New feature subset selection procedures for classification of expression profiles. Genome Biology. [Cited by 65] (15.24/year)
BERLIN, J. and A. MOTRO, 2002. Database Schema Matching Using Machine Learning with Feature Selection. Proceedings of the 14th International Conference on Advanced …. [Cited by 52] (12.19/year)
BRADLEY, P.S. and O.L. MANGASARIAN, 1998. Feature selection via concave minimization and support vector machines. Machine Learning Proceedings of the Fifteenth International …. [Cited by 144] (17.42/year)
BRADLEY, P.S., O.L. MANGASARIAN and W.N. STREET, 1998. Feature selection via mathematical programming. INFORMS Journal on Computing. [Cited by 92] (11.13/year)
CHAKRABARTI, S., et al., 1998. Scalable feature selection, classification and signature generation for organizing large text …. The VLDB Journal The International Journal on Very Large …. [Cited by 102] (12.34/year)
CHEN, C.H., 1996. A lower for the correct subset-selection probability and its application to discrete-event system …. IEEE transactions on automatic control. [Cited by 57] (5.55/year)
CLYDE, M., G. PARMIGIANI and B. VIDAKOVIC, 1998. Multiple shrinkage and subset selection in wavelets. Biometrika. [Cited by 122] (14.76/year)
COUVREUR, C. and Y. BRESLER, 2000. On the optimality of the backward greedy algorithm for the subset selection problem - ?num=100&hl=en&lr=&ie=UTF-8&cluster=8924061170320353040">group of 4 ». SIAM Journal on Matrix Analysis and Applications. [Cited by 35] (5.59/year)
CUNNINGHAM, P. and J. CARNEY, LNCS-European Conference on Machine Learning. Diversity versus quality in classification ensembles based on feature selection. [Cited by 53] (?/year)
DASH, M. and H. LIU, 1997. Feature selection for classification. Intelligent Data Analysis. [Cited by 287] (30.98/year)
DASH, M. and H. LIU, 2000. Feature selection for clustering. Proceedings of the 4th Pacific-Asia Conference on Knowledge …. [Cited by 37] (5.91/year)
DERKSEN, S. and H.J. KESELMAN, 1992. Backward, forward and stepwise automated subset selection algorithms: frequency of obtaining …. British journal of mathematical & statistical psychology. [Cited by 61] (4.28/year)
DEVANEY, M. and A. RAM, 1997. Efficient feature selection in conceptual clustering. Proceedings of the Fourteenth International Conference on …. [Cited by 55] (5.94/year)
DOAK, J., 1992. An Evaluation of Feature Selection Methods and Their Application to Computer Security. University of California. [Cited by 52] (3.65/year)
DOMINGOS, F., 1997. Control-Sensitive Feature Selection for Lazy Learners. Artificial Intelligence Review. [Cited by 68] (7.34/year)
DY, J.G. and C.E. BRODLEY, 2000. Feature subset selection and order identification for unsupervised learning. Proceedings of the Seventeenth International Conference on …. [Cited by 62] (9.90/year)
FERRI, F.J., et al., 1994. Comparative study of techniques for large-scale feature selection. Pattern Recognition in Practice IV. [Cited by 52] (4.24/year)
FORMAN, G., 2003. An extensive empirical study of feature selection metrics for text classification. Journal of Machine Learning Research. [Cited by 75] (22.97/year)
FOROUTAN, I. and J. SKLANSKY, 1987. Feature selection for automatic classification of non-Gaussian data. IEEE Transactions on Systems, Man and Cybernetics. [Cited by 40] (2.08/year)
FUKUNAGA, K. and W. KOONTZ, 1970. Application of the Karhunen-Loeve expansion to feature selection and ordering. IEEE Trans. Computers. [Cited by 96] (2.65/year)
GALAVOTTI, L., F. SEBASTIANI and M. SIMI, 2000. Experiments on the use of feature selection and negative evidence in automated text categorization. Proceedings of ECDL-00, 4th European Conference on Research …. [Cited by 39] (6.22/year)
GATHERCOLE, C. and P. ROSS, 1994. Dynamic training subset selection for supervised learning in genetic programming. Parallel Problem Solving from Nature III. [Cited by 37] (3.02/year)
GORE, D.A. and A.J. PAULRAJ, 2002. MIMO antenna subset selection with space-time coding. Signal Processing, IEEE Transactions on [see also Acoustics, …. [Cited by 119] (27.90/year)
GUYON, I. and A. ELISSEEFF, 2003. An introduction to variable and feature selection. Journal of Machine Learning Research. [Cited by 266] (81.46/year)
HALL, M.A. and L.A. SMITH, Proceedings of the 21st Australasian Computer Science …. Practical feature subset selection for machine learning. [Cited by 21] (?/year)
HALL, M.A., 1999. Correlation-based Feature Selection for Machine Learning. [Cited by 101] (13.90/year)
HALL, M.A., 2000. Correlation-based feature selection for discrete and numeric class machine learning. Proceedings of the Seventeenth International Conference on …. [Cited by 87] (13.89/year)
HANDELS, H., et al., 1999. Feature selection for optimized skin tumor recognition using genetic algorithms.. Artif Intell Med. [Cited by 39] (5.37/year)
INZA, I., et al., 2000. Feature Subset Selection by Bayesian network-based optimization. Artificial Intelligence. [Cited by 49] (7.82/year)
JACK, L.B. and A.K. NANDI, 2000. Genetic algorithms for feature selection in machine conditionmonitoring with vibration signals. Vision, Image and Signal Processing, IEE Proceedings-. [Cited by 35] (5.59/year)
JAIN, A. and D. ZONGKER, 1997. Feature Selection: Evaluation, Application, and Small Sample Performance. IEEE Transactions on Pattern Analysis and Machine …. [Cited by 418] (45.11/year)
JEBARA, T. and T. JAAKKOLA, 2000. Feature selection and dualities in maximum entropy discrimination. Proc. of 16th Conference on Uncertainty in Artificial …. [Cited by 39] (6.22/year)
JOHN, G.H., R. KOHAVI and K. PFLEGER, 1994. Irrelevant features and the subset selection problem. Proceedings of the Eleventh International Conference on …. [Cited by 669] (54.54/year)
KELES…, S., 2002. Identification of regulatory elements using a feature selection method. Bioinformatics. [Cited by 56] (13.13/year)
KIM, Y.S., W.N. STREET and F. MENCZER, 2000. Feature selection in unsupervised learning via evolutionary search. Proceedings of the sixth ACM SIGKDD international conference …. [Cited by 52] (8.30/year)
KIRA, K. and L.A. RENDELL, 1992. A practical approach to feature selection. Proceedings of the ninth international workshop on Machine …. [Cited by 294] (20.61/year)
KIRA, K. and L.A. RENDELL, 1992. The feature selection problem: Traditional methods and a new algorithm. Proceedings of the Tenth National Conference on Artificial …. [Cited by 279] (19.56/year)
KITTLER, J., 1975. Mathematical Methods of Feature Selection in Pattern Recognition.. International Journal of Man-Machine Studies. [Cited by 30] (0.96/year)
KITTLER, J., 1986. Feature selection and extraction. Handbook of Pattern Recognition and Image Processing. [Cited by 124] (6.12/year)
KOHAVI, R. and D. SOMMERFIELD, 1995. Feature Subset Selection Using the Wrapper Method: Overfitting and Dynamic Search Space Topology. Proceedings of the First International Conference on …. [Cited by 74] (6.57/year)
KOHAVI, R. and G.H. JOHN, 1997. Wrappers for Feature Subset Selection. Artificial Intelligence. [Cited by 768] (82.89/year)
KOHAVI, R. and H. GEORGE, 1997. John. Wrappers for feature subset selection. Artificial Intelligence. [Cited by 120] (12.95/year)
KOHAVI, R., 1994. Feature subset selection as search with probabilistic estimates. AAAI Fall Symposium on Relevance. [Cited by 30] (2.45/year)
KOLLER, D. and M. SAHAMI, 1996. Toward optimal feature selection. International Conference on Machine Learning. [Cited by 363] (35.36/year)
KWAK, N. and C.H. CHOI, 2002. Input feature selection for classification problems. Neural Networks, IEEE Transactions on. [Cited by 66] (15.47/year)
LAU, T.W.E. and Y.C. HO, 1997. Universal Alignment Probabilities and Subset Selection for Ordinal Optimization. Journal of Optimization Theory and Applications. [Cited by 42] (4.53/year)
LEARDI…, R., 1998. Genetic algorithms applied to feature selection in PLS regression: how and when to use them. Chemometrics and Intelligent Laboratory Systems. [Cited by 39] (4.72/year)
LEARDI, R., 1994. Application of a genetic algorithm to feature selection under full validation conditions and to …. J. Chemometrics. [Cited by 41] (3.34/year)
LEARDI, R., R. BOGGIA and M. TERRILE, 1992. Genetic algorithms as a strategy for feature selection. Journal of Chemometrics. [Cited by 74] (5.19/year)
LEW, M.S., T.S. HUANG and K. WONG, 1994. Learning and Feature Selection in Stereo Matching. IEEE Transactions on Pattern Analysis and Machine …. [Cited by 57] (4.65/year)
LEWIS, D.D., 1992. Feature selection and feature extraction for text categorization. Proceedings of the workshop on Speech and Natural Language. [Cited by 107] (7.50/year)
LIU, H. and H. MOTODA, 1998. Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers Norwell, MA, USA. [Cited by 329] (39.80/year)
LIU, H. and R. SETIONO, 1995. Chi2: Feature selection and discretization of numeric attributes. Proceedings of 7th IEEE Int'l Conference on Tools with …. [Cited by 93] (8.26/year)
LIU, H. and R. SETIONO, 1996. A probabilistic approach to feature selection-a filter solution. Proceedings of the 13 thICML. [Cited by 110] (10.72/year)
LIU, H. and R. SETIONO, 1996. Feature selection and classification-a probabilistic wrapper approach. Proceedings of Ninth International Conference on Industrial …. [Cited by 33] (3.21/year)
LIU, H. and R. SETIONO, 1997. Feature selection via discretization. Knowledge and Data Engineering, IEEE Transactions on. [Cited by 50] (5.40/year)
LIU, H., J. LI and L. WONG, 2002. A comparative study on feature selection and classification methods using gene expression profiles …. Genome Informatics. [Cited by 37] (8.67/year)
MCNITT-GRAY, M.F., H.K. HUANG and J.W. SAYRE, 1995. Feature selection in the pattern classification problem of digitalchest radiograph segmentation. Medical Imaging, IEEE Transactions on. [Cited by 40] (3.55/year)
MILLER, A. and M. MILLER, 2002. Subset Selection in Regression. books.google.com. [Cited by 489] (114.65/year)
MITRA, P., C.A. MURTHY and S.K. PAL, 2002. Unsupervised feature selection using feature similarity. Pattern Analysis and Machine Intelligence, IEEE Transactions …. [Cited by 71] (16.65/year)
MLADENIC, D. and M. GROBELNIK, 1998. Feature selection for classification based on text hierarchy. Proceedings of the Workshop on Learning from Text and the …. [Cited by 46] (5.57/year)
MLADENIC, D. and M. GROBELNIK, 1999. Feature selection for unbalanced class distribution and naive Bayes. Machine Learning: Proceedings of the Sixteenth International …. [Cited by 93] (12.80/year)
MLADENIC, D., 1998. Feature subset selection in text-learning. Proceedings of the 10th European Conference on Machine …. [Cited by 103] (12.46/year)
MODRZEJEWSKI, M., 1993. Feature Selection Using Rough Sets Theory. Proceedings of the European Conference on Machine Learning. [Cited by 70] (5.28/year)
MOORE…, J., 1998. Web Page Categorization and Feature Selection Using Association Rule and Principal Component …. maya.cs.depaul.edu. [Cited by 37] (4.48/year)
NARENDRA, P.M. and K. FUKUNAGA, 1977. A Branch and Bound Algorithm for Feature Subset Selection. IEEE Transactions on Computers. [Cited by 262] (8.95/year)
NG, A.Y., 1998. On feature selection: learning with exponentially many irrevelant features as training examples. [Cited by 33] (3.99/year)
NG, H.T., W.B. GOH and K.L. LOW, 1997. Feature selection, perception learning, and a usability case study for text categorization. Proceedings of the 20th annual international ACM SIGIR …. [Cited by 165] (17.81/year)
OPITZ, D., 1999. Feature selection for ensembles. Proceedings of 16th National Conference on Artificial …. [Cited by 55] (7.57/year)
PUDIL, P., J. NOVOVICOV? and J. KITTLER, 1994. Floating search methods in feature selection. Pattern Recognition Letters. [Cited by 377] (30.74/year)
PUNCH, W.F., et al., 1993. Further research on feature selection and classification using genetic algorithms. Proceedings of the Fifth International Conference on Genetic …. [Cited by 112] (8.44/year)
ROGATI, M., 2002. High-performing feature selection for text classification. Proceedings of the eleventh international conference on …. [Cited by 29] (6.80/year)
ROMANI, L., 1997. An immunoregulatory role for neutrophils in CD4+ T helper subset selection in mice with candidiasis. The Journal of Immunology. [Cited by 79] (8.53/year)
RUCK, D.W., S.K. ROGERS and M. KABRISKY, 1990. Feature selection using a multilayer perceptron. J. Neural Network Comput. [Cited by 42] (2.58/year)
SAHINER, B., et al., 1996. Image feature selection by a genetic algorithm: Application to classification of mass and normal …. Medical Physics. [Cited by 49] (4.77/year)
SCHILLER, N.O. and A. CARAMAZZA, 2003. Grammatical feature selection in noun phrase production: Evidence from German and Dutch. Journal of Memory and Language. [Cited by 43] (13.17/year)
SIEDLECKI, W. and J. SKLANSKY, 1989. A note on genetic algorithms for large-scale feature selection. Pattern Recognition Letters. [Cited by 200] (11.58/year)
SIEDLECKI, W. and J. SKLANSKY, 1993. On automatic feature selection. Handbook of pattern recognition & computer vision table of …. [Cited by 166] (12.51/year)
SKALAK, D.B., 1994. Prototype and feature selection by sampling and random mutation hill climbing algorithms. Proceedings of the Eleventh International Conference on …. [Cited by 194] (15.82/year)
SOLBERG, A.H.S. and A.K. JAIN, 1997. Texture Fusion and Feature Selection Applied to SAR Imagery. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. [Cited by 38] (4.10/year)
SOMOL, P., et al., 1999. Adaptive floating search methods in feature selection. Pattern Recognition Letters. [Cited by 62] (8.53/year)
STASZEWSKI, W.J., 1998. WAVELET BASED COMPRESSION AND FEATURE SELECTION FOR VIBRATION ANALYSIS. Journal of Sound and Vibration. [Cited by 34] (4.11/year)
SWINIARSKI, R.W. and A. SKOWRON, 2003. Rough set methods in feature selection and recognition. Pattern Recognition Letters. [Cited by 44] (13.48/year)
TALAVERA, L., 1999. Feature selection as a preprocessing step for hierarchical clustering. Proceedings of the Sixteenth International Conference on …. [Cited by 37] (5.09/year)
TRAINA, C., et al., 2000. Fast feature selection using the fractal dimension. XV Brazilian Symposium on Databases (SBBD). [Cited by 48] (7.66/year)
TROPP, J.A., 2004. Just relax: Convex programming methods for subset selection and sparse approximation. ICES Report. [Cited by 55] (24.28/year)
VAFAIE, H. and I.F. IMAM, 1994. Feature selection methods: genetic algorithms vs. greedy-like search. Proceedings of International Conference on Fuzzy and …. [Cited by 34] (2.77/year)
VAFAIE, H. and K. DE, 1992. Genetic algorithms as a tool for feature selection in machinelearning. Tools with Artificial Intelligence, 1992. TAI'92, …. [Cited by 45] (3.15/year)
VAFAIE, H. and K. DE, 1993. Robust feature selection algorithms. Tools with Artificial Intelligence, 1993. TAI'93. …. [Cited by 91] (6.86/year)
WESTON, J., et al., 2000. Feature selection for SVMs. Advances in Neural Information Processing Systems. [Cited by 203] (32.40/year)
XING, E.P., M.I. JORDAN and R.M. KARP, 2001. Feature selection for high-dimensional genomic microarray data. Proceedings of the Eighteenth International Conference on …. [Cited by 118] (22.41/year)
YANG, J. and V. HONAVAR, 1998. Feature subset selection using a genetic algorithm. Intelligent Systems and Their Applications, IEEE [see also …. [Cited by 248] (30.00/year)
YANG, Y. and J.O. PEDERSEN, 1996. Feature Selection in Statistical Learning of Text Categorization. Center for Machine Translation, Carnegie Mellon University. [Cited by 74] (7.21/year)
YANG, Y. and J.O. PEDERSEN, 1997. A comparative study on feature selection in text categorization. Proceedings of the Fourteenth International Conference on …. [Cited by 901] (97.24/year)
YAO, Q. and H. TONG, 1994. On subset selection in non-parametric stochastic regression. Statistica Sinica. [Cited by 36] (2.94/year)
YU, L. and H. LIU, 2003. Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution. Proceedings of the twentieth International Conference on …. [Cited by 39] (11.94/year)
ZHONG, N., J. DONG and S. OHSUGA, 2001. Using Rough Sets with Heuristics for Feature Selection. Journal of Intelligent Information Systems. [Cited by 41] (7.79/year)

Feature Selection

Most Cited

Books

Links

Bibliography