Yıl: 2017 Cilt: 13 Sayı: 4 Sayfa Aralığı: 873 - 881 Metin Dili: İngilizce İndeks Tarihi: 29-07-2022

Web Proxy Log Data Mining System for Clustering Users and Search Keywords

Öz:
In this study, Internet users were clustered by the search keywords which they type into search bars of searchengines. Our proposed software is called UQCS (User Queries Clustering System) and it is developed todemonstrate the efficiency of our hypothesis. UQCS co-operates with the Strehl’s relationship based clusteringtoolkit and performs segmentation on users based on the keywords they use for searching the web. Internet Proxyserver logs were parsed and query strings were extracted from the search engine URL’s and the resulting IP-Termmatrix was converted into a similarity matrix using Euclidean, Jaccard, Cosine Distance and Pearson CorrelationDistance metrics. K- Means and graph-based OPOSSUM algorithm were used to perform clustering on thesimilarity matrices. Results were illustrated by using CLUSION visualization toolkit.
Anahtar Kelime:

Belge Türü: Makale Makale Türü: Araştırma Makalesi Erişim Türü: Erişime Açık
  • 1. Pal, S. K., Talwar, V., Mitra, P. Web mining in soft computing framework: relevance, state of the art and future directions. IEEE Transactions on Neural Networks, 2002, 13(5), 1163-1177.
  • 2. Etzioni, O. The World-Wide Web: quagmire or gold mine?, Communications of the ACM, 1996, 39(11), 65-68.
  • 3. Kosala, R., Blockeel, H., Web mining research: A survey. ACM Sigkdd Explorations Newsletter, 2000, 2(1), 1-15.
  • 4. Zhang, Q., Segall, R. S.,Web mining: a survey of current research, techniques, and software. International Journal of Information Technology & Decision Making, 2008, 7(04), 683-720.
  • 5. Zhang, C. L., Liu, Z. F., Yin, J. R., The Application Research on Web Log Mining in E-Marketing. In e-Business and Information System Security (EBISS), Harbin, China, 2010, pp. 1-4.
  • 6. Strehl, A. Relationship-based clustering and cluster ensembles for high-dimensional data mining, Doctoral dissertation, University of Texas, 2002.
  • 7. Strehl, A., Ghosh, J., Relationship-based clustering and visualization for high-dimensional data mining. INFORMS Journal on Computing, 2003, 15(2), 208-230.
  • 8. Karypis, G., Kumar, V., A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM Journal on scientific Computing, 1998, 20(1), 359-392.
  • 9. Zemberek 2 open source NLP library for Turkic languages, http://code.google.com/p/zemberek/, 2016, (accessed: 08.10.2016).
  • 10. Cluster Analysis and Cluster Ensemble Software, http://strehl.com/soft.html,2016, (accessed: 10.10. 2016)
APA BİLGİN T, AYTEKİN M (2017). Web Proxy Log Data Mining System for Clustering Users and Search Keywords. , 873 - 881.
Chicago BİLGİN Turgay Tugay,AYTEKİN Mustafa Koray Web Proxy Log Data Mining System for Clustering Users and Search Keywords. (2017): 873 - 881.
MLA BİLGİN Turgay Tugay,AYTEKİN Mustafa Koray Web Proxy Log Data Mining System for Clustering Users and Search Keywords. , 2017, ss.873 - 881.
AMA BİLGİN T,AYTEKİN M Web Proxy Log Data Mining System for Clustering Users and Search Keywords. . 2017; 873 - 881.
Vancouver BİLGİN T,AYTEKİN M Web Proxy Log Data Mining System for Clustering Users and Search Keywords. . 2017; 873 - 881.
IEEE BİLGİN T,AYTEKİN M "Web Proxy Log Data Mining System for Clustering Users and Search Keywords." , ss.873 - 881, 2017.
ISNAD BİLGİN, Turgay Tugay - AYTEKİN, Mustafa Koray. "Web Proxy Log Data Mining System for Clustering Users and Search Keywords". (2017), 873-881.
APA BİLGİN T, AYTEKİN M (2017). Web Proxy Log Data Mining System for Clustering Users and Search Keywords. Celal Bayar Üniversitesi Fen Bilimleri Dergisi, 13(4), 873 - 881.
Chicago BİLGİN Turgay Tugay,AYTEKİN Mustafa Koray Web Proxy Log Data Mining System for Clustering Users and Search Keywords. Celal Bayar Üniversitesi Fen Bilimleri Dergisi 13, no.4 (2017): 873 - 881.
MLA BİLGİN Turgay Tugay,AYTEKİN Mustafa Koray Web Proxy Log Data Mining System for Clustering Users and Search Keywords. Celal Bayar Üniversitesi Fen Bilimleri Dergisi, vol.13, no.4, 2017, ss.873 - 881.
AMA BİLGİN T,AYTEKİN M Web Proxy Log Data Mining System for Clustering Users and Search Keywords. Celal Bayar Üniversitesi Fen Bilimleri Dergisi. 2017; 13(4): 873 - 881.
Vancouver BİLGİN T,AYTEKİN M Web Proxy Log Data Mining System for Clustering Users and Search Keywords. Celal Bayar Üniversitesi Fen Bilimleri Dergisi. 2017; 13(4): 873 - 881.
IEEE BİLGİN T,AYTEKİN M "Web Proxy Log Data Mining System for Clustering Users and Search Keywords." Celal Bayar Üniversitesi Fen Bilimleri Dergisi, 13, ss.873 - 881, 2017.
ISNAD BİLGİN, Turgay Tugay - AYTEKİN, Mustafa Koray. "Web Proxy Log Data Mining System for Clustering Users and Search Keywords". Celal Bayar Üniversitesi Fen Bilimleri Dergisi 13/4 (2017), 873-881.