287
Views
9
CrossRef citations to date
0
Altmetric
Original Articles

An Efficient Algorithm for Extracting High Utility Itemsets from Weblog Data

&
 

ABSTRACT

High utility itemset refers to those set of items which has high utility such as profit in a database. High utility of itemset plays a crucial role in real life. In recent years, various algorithms have been proposed for finding high utility itemset but unfortunately they are not completely relevant at the time and space point of view. In the data mining field, high utility itemset can be found in different categories of data like time series, categorical, etc. Log data is useful for finding behaviour of the user in different aspects. In this paper, we have proposed an algorithm named HUIM (High Utility Itemsets Mining) and construct HUI-FP (High Utility Itemsets-Frequent Pattern) Tree for efficiently mining high utility itemsets from log database. The behaviour of the user can be predicted through the high utility of every visited page. We have also proposed pattern generation technique based on cosine similarities among itemsets. These techniques generate strong patterns, and customized users profile according to that pattern. The proposed algorithm is better than the previous state of the art algorithm for high utility itemset generation.

Additional information

Notes on contributors

Brijesh Bakariya

Brijesh Bakariya received his graduation degree from Barkatullah University, Bhopal, MP in 2005, and his post-graduation degree in Computer Applications from Devi Ahilya Vishwavidyalaya, Indore, MP in the year 2009. He is currently pursuing a PhD degree in the Department of Computer Applications, Maulana Azad National Institute of Technology, Bhopal, MP. His research interests include web mining and clustering.

Email: [email protected]

G.S. Thakur

Ghanshyam Singh Thakur has received BSc degree from Dr. Harisingh Gour University, Sagar, MP in 2000. He has received MCA degree in 2003 from Pt. RaviShankar Shukla University, Raipur, CG and PhD degree from Barkhatullah University, Bhopal, MP in the year 2009. He is assistant professor in the Department of Computer Applications, Maulana Azad National Institute of Technology, Bhopal, MP, India. He has eight years of teaching and research experience. He has 26 published articles in national and international journals. His research interests include text mining, document clustering, information retrieval, and data warehousing. He is a member of the CSI, IAENG, and IACSIT.

Email: [email protected]

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.