AusWeb07
 

Integrating Markov Model with Clustering for Predicting Web Page Accesses

Faten Khalil, Doctoral Student, Department of Mathematics & Computing, University of Southern Queensland, Toowoomba, Australia, Email: khalil@usq.edu.au

Jiuyong Li, Associate Professor, School of Computer and Information Science, University of South Australia, Mawson Lakes, Australia, Email: Jiuyong.Li@unisa.edu.au

Hua Wang, Senior Lecturer, Department of Mathematics and Computing, University of Southern Queensland, Toowoomba, Australia, Email: wang@usq.edu.au


Keywords

Markov model, clustering, Web page prediction, k-means, distance measures.


Abstract

Predicting the next page to be accessed by Web users has attracted a large amount of research work lately due to the positive impact of such prediction on different areas of Web based applications. Major techniques applied for this intention are Markov model and clustering. Low order Markov models are coupled with low accuracy, whereas high order Markov models are associated with high state space complexity. On the other hand, clustering methods are unsupervised methods, and normally are not used for classification directly. This paper involves incorporating clustering with low order Markov model techniques. The pre-processed data is divided into meaningful clusters then the clusters are used as training data while performing 2nd order Markov model techniques. Different distance measures of k-means clustering algorithm are examined in order to find an optimal one. Experiments reveal that incorporating clustering of Web documents according to Web services with low order Markov model improves the web page prediction accuracy.


[ Full Paper ] [ Presentation ] [ Proceedings ] [ AusWeb Home Page ]

 

 

 

 

All materials Copyright AusWeb07. The Thirteenth Australasian World Wide Web Conference, Pacific Bay Resort , 30th June to 4th July.
Contact: Norsearch Conference Services +61 2 66 20 3932 (outside Australia) (02) 6620 3932 (inside Australia) Fax (02) 6626 9317