Research Article Open Access

An Application of Session Based Clustering to Analyze Web Pages of User Interest from Web Log Files

C. P. Sumathi, R. Padmaja Valli and T. Santhanam

Abstract

Problem statement: With the continued growth and proliferation of e-commerce, Web services and Web-based information systems, the volumes of click-stream and user data collected by Web-based organizations in their daily operations have reached astronomical proportions. Analyzing such data can help these organizations optimize the functionality of web-based applications and provide more personalized content to visitors. This type of analysis involved the automatic discovery of usage interest on the web pages which are often stored in web and applications server access logs. Approach: The usage interest on the web pages in various sessions was partitioned into clusters such that sessions with “similar” interest were placed in the same cluster using expectation maximization clustering technique as discussed in this study. Results: The approach results in the generation of usage profiles and automatic identification of user interest in each profile. Conclusion: The significance of the results will be helpful for organizations for web site improvement based on their navigational interest and provide recommendations for page(s) not yet visited by the user.

Journal of Computer Science
Volume 6 No. 7, 2010, 785-793

DOI: https://doi.org/10.3844/jcssp.2010.785.793

Submitted On: 4 May 2010 Published On: 31 July 2010

How to Cite: Sumathi, C. P., Valli, R. P. & Santhanam, T. (2010). An Application of Session Based Clustering to Analyze Web Pages of User Interest from Web Log Files. Journal of Computer Science, 6(7), 785-793. https://doi.org/10.3844/jcssp.2010.785.793

  • 3,258 Views
  • 3,422 Downloads
  • 6 Citations

Download

Keywords

  • Web usage mining
  • expectation maximization
  • usage profile
  • web page interest