AusWeb 05 Banner

Mining User Preferences from Web Access Logs and User Public Information

Laura Thomson, Lecturer, Web Discipline, School of Computer Science and Information Technology, RMIT University, GPO Box 2476V, Melbourne, 3001. Email: laura@cs.rmit.edu.au


Keywords

web mining, data mining, personalized documents


Abstract

Previous studies of web access logs for user profiling and hence document personalization conducted analyses based principally on web site content, usage, and topology. In this work the analysis was conducted on the basis of what could transparently be learned about the end user. The user's IP address was used to obtain publicly available information about the user, specifically their geographical location, top level domain, and the contents of their organizational homepage. This personal information was then combined with standard content and usage analysis. Standard data mining algorithms were applied to web logs from an academic website using a generalization-based clustering feature set with the various types of user data used as the class variable. A number of useful nuggets of information were found, demonstrating the feasibility of this approach.


[ Full Paper ] [ Presentation ] [ Proceedings ] [ AusWeb Home Page ]

 

All materials Copyright AusWeb05. The Eleventh Australasian World Wide Web Conference, Royal Pines Resort, Gold Coast, from 2nd to 6th July 2005 Contact: Norsearch Conference Services +61 2 66 20 3932 (from outside Australia) (02) 6620 3932 (from inside Australia) Fax (02) 6626 9317