Yıl: 2016 Cilt: 17 Sayı: 5 Sayfa Aralığı: 845 - 852 Metin Dili: İngilizce İndeks Tarihi: 29-07-2022

A MODIFIED T-SCORE FOR FEATURE SELECTION

Öz:
In this study, an alternative approach to t-score method, one of the feature selection methods, has been suggested and some analyses have been executed in order to compare t-score method and our approach. When comparing them, commonly used data sets in data mining studies, Arcene, Gisette and Madelon have been used. In line with the purpose of this study, the first 50, 100, 150 and 200 features for each data set has been selected, in consequence, 24 data subsets have been created. The classification accuracies of t-score and suggested method has been compared by using these data subsets. When calculating the classification accuracies, two commonly used methods in literature, Artificial Neural Networks and Support Vector Machines have been used. According to this study, the result of the suggested feature selection method is statistically more successful than t-score.
Anahtar Kelime:

Belge Türü: Makale Makale Türü: Araştırma Makalesi Erişim Türü: Erişime Açık
  • [1] Ladha L, Deepa T. Feature selection methods and algorithms, International Journal on Computer Science and Engineering 2011; 3: 1787-1797.
  • [2] Guyon I, Elisseeff A. An introduction to variable and feature selection. Journal of Machine Learning Research 2003; 3: 1157-1182.
  • [3] Forman G. An extensive empirical study of feature selection metrics for text classification, Journal of Machine Learning Research 2003; 3: 1289-1305.
  • [4] SPSS Inc, SPSS Clementine 12.0 Algorithms Guide. SPSS Inc, USA: Chicago, 2007.
  • [5] Saeys Y, Inza I, Larranaga P. A review of feature selection techniques in bioinformatics, Bioinformatics 2007; 23: 2507-2517.
  • [6] Yıldız O, Tez M, Bilge HŞ, Akcayol MA, Güler İ. Meme kanseri sınıflandırması için gen seçimi (Gene selection for breast cancer), IEEE 20nd Signal Processing and Communications Applications Conference; 18-20 April 2012; Muğla, Turkey: IEEE, p.p 988-991.
  • [7] Novakavic J, Strbac P, Bulatovic D. Toward optimal feature selection using ranking methods and classification algorithms, Yugoslav Journal of Operations Research 2011; 21: 119-135.
  • [8] Hall MA. Correlation-based Feature Selection for Machine Learning, PhD, The University of Waikato, New Zealand: Hamilton, 1999.
  • [9] Kira K, Rendell LA. The feature selection problem: traditional methods and a new algorithm, AAAI 1992; 2: 129-134.
  • [10] Ding C, Peng H. Minimum redundancy feature selection from microarray gene expression data, Journal of Bioinformatics and Computational Biology 2005; 3: 185-205.
  • [11] Whitney AW. A direct method of nonparametric measurement selection, IEEE Transactions on Computers 1972; 20: 1100-1103.
  • [12] Pudil P, Novovicova J, Kittler J. Floating search methods in feature selection, Pattern Recognition Letters 1994; 15: 1119-1125.
  • [13] Stearns SD. On selecting features for pattern classifiers, 3rd International Conference on Pattern Recognition; 8-11 November 1976; Coronado, California, USA: pp 71-75.
  • [14] Guyon I, Weston J, Barnhil S, Vapnik V. Gene Selection for cancer classification using support vector machines, Machine Learning 2002; 46: 389-422.
  • [15] Liu H, Yu L. Towards integrating feature selection algorithms for classification and clustering knowledge and data engineering, IEEE Transactions on Computers 2005; 17: 491-502.
  • [16] http://www.nipsfsc.ecs.soton.ac.uk/datasets/ [accessed 11.01.2015]
  • [17] http://www.nipsfsc.ecs.soton.ac.uk/papers/NIPS2003-Datasets.pdf [accessed 11.01.2015]
  • [18] Petricoin EF, Ardekani AM, Hitt BA, Levine PJ, Fusaro VA, Steinberg SM, & Liotta LA, Use of proteomic patterns in serum to identify ovarian cancer, The lancet 2002; 359: 572-577.
  • [19] LeCun Y, Bottou L, Bengio Y, Haffner P, Gradient-based learning applied to document recognition, Proceedings of the IEEE 1998; 86: 2278-2324.
  • [20] Perkins S, Lacker K, Theiler J. Grafting: Fast, Incremental feature selection by gradient descent in function space, The Journal of Machine Learning Research 2003; 3: 1333-1356.
  • [21] Holte R C. Very simple classification rules perform well on most commonly used datasets, Machine Learning 1993; 11: 63-91.
  • [22] Marill T, Green D M. On the effectiveness of receptors in recognition system, IEEE Trans. Inform. Theory 1963, 9: 11-17.
APA Budak H, Erpolat Taşabat S (2016). A MODIFIED T-SCORE FOR FEATURE SELECTION. , 845 - 852.
Chicago Budak Hüseyin,Erpolat Taşabat Semra A MODIFIED T-SCORE FOR FEATURE SELECTION. (2016): 845 - 852.
MLA Budak Hüseyin,Erpolat Taşabat Semra A MODIFIED T-SCORE FOR FEATURE SELECTION. , 2016, ss.845 - 852.
AMA Budak H,Erpolat Taşabat S A MODIFIED T-SCORE FOR FEATURE SELECTION. . 2016; 845 - 852.
Vancouver Budak H,Erpolat Taşabat S A MODIFIED T-SCORE FOR FEATURE SELECTION. . 2016; 845 - 852.
IEEE Budak H,Erpolat Taşabat S "A MODIFIED T-SCORE FOR FEATURE SELECTION." , ss.845 - 852, 2016.
ISNAD Budak, Hüseyin - Erpolat Taşabat, Semra. "A MODIFIED T-SCORE FOR FEATURE SELECTION". (2016), 845-852.
APA Budak H, Erpolat Taşabat S (2016). A MODIFIED T-SCORE FOR FEATURE SELECTION. Anadolu Üniversitesi Bilim ve Teknoloji Dergisi :A-Uygulamalı Bilimler ve Mühendislik, 17(5), 845 - 852.
Chicago Budak Hüseyin,Erpolat Taşabat Semra A MODIFIED T-SCORE FOR FEATURE SELECTION. Anadolu Üniversitesi Bilim ve Teknoloji Dergisi :A-Uygulamalı Bilimler ve Mühendislik 17, no.5 (2016): 845 - 852.
MLA Budak Hüseyin,Erpolat Taşabat Semra A MODIFIED T-SCORE FOR FEATURE SELECTION. Anadolu Üniversitesi Bilim ve Teknoloji Dergisi :A-Uygulamalı Bilimler ve Mühendislik, vol.17, no.5, 2016, ss.845 - 852.
AMA Budak H,Erpolat Taşabat S A MODIFIED T-SCORE FOR FEATURE SELECTION. Anadolu Üniversitesi Bilim ve Teknoloji Dergisi :A-Uygulamalı Bilimler ve Mühendislik. 2016; 17(5): 845 - 852.
Vancouver Budak H,Erpolat Taşabat S A MODIFIED T-SCORE FOR FEATURE SELECTION. Anadolu Üniversitesi Bilim ve Teknoloji Dergisi :A-Uygulamalı Bilimler ve Mühendislik. 2016; 17(5): 845 - 852.
IEEE Budak H,Erpolat Taşabat S "A MODIFIED T-SCORE FOR FEATURE SELECTION." Anadolu Üniversitesi Bilim ve Teknoloji Dergisi :A-Uygulamalı Bilimler ve Mühendislik, 17, ss.845 - 852, 2016.
ISNAD Budak, Hüseyin - Erpolat Taşabat, Semra. "A MODIFIED T-SCORE FOR FEATURE SELECTION". Anadolu Üniversitesi Bilim ve Teknoloji Dergisi :A-Uygulamalı Bilimler ve Mühendislik 17/5 (2016), 845-852.