An Efficient Algorithm for Extracting High Utility Itemsets from Weblog Data: IETE Technical Review: Vol 32, No 2

287

Views

CrossRef citations to date

Altmetric

ABSTRACT

High utility itemset refers to those set of items which has high utility such as profit in a database. High utility of itemset plays a crucial role in real life. In recent years, various algorithms have been proposed for finding high utility itemset but unfortunately they are not completely relevant at the time and space point of view. In the data mining field, high utility itemset can be found in different categories of data like time series, categorical, etc. Log data is useful for finding behaviour of the user in different aspects. In this paper, we have proposed an algorithm named HUIM (High Utility Itemsets Mining) and construct HUI-FP (High Utility Itemsets-Frequent Pattern) Tree for efficiently mining high utility itemsets from log database. The behaviour of the user can be predicted through the high utility of every visited page. We have also proposed pattern generation technique based on cosine similarities among itemsets. These techniques generate strong patterns, and customized users profile according to that pattern. The proposed algorithm is better than the previous state of the art algorithm for high utility itemset generation.

Keywords:

Additional information

Notes on contributors

Brijesh Bakariya

Brijesh Bakariya received his graduation degree from Barkatullah University, Bhopal, MP in 2005, and his post-graduation degree in Computer Applications from Devi Ahilya Vishwavidyalaya, Indore, MP in the year 2009. He is currently pursuing a PhD degree in the Department of Computer Applications, Maulana Azad National Institute of Technology, Bhopal, MP. His research interests include web mining and clustering.

Email: [email protected]

G.S. Thakur

Ghanshyam Singh Thakur has received BSc degree from Dr. Harisingh Gour University, Sagar, MP in 2000. He has received MCA degree in 2003 from Pt. RaviShankar Shukla University, Raipur, CG and PhD degree from Barkhatullah University, Bhopal, MP in the year 2009. He is assistant professor in the Department of Computer Applications, Maulana Azad National Institute of Technology, Bhopal, MP, India. He has eight years of teaching and research experience. He has 26 published articles in national and international journals. His research interests include text mining, document clustering, information retrieval, and data warehousing. He is a member of the CSI, IAENG, and IACSIT.

Email: [email protected]

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

An Efficient Algorithm for Extracting High Utility Itemsets from Weblog Data

Notes on contributors

Brijesh Bakariya

G.S. Thakur

Information for

Open access

Opportunities

Help and information

An Efficient Algorithm for Extracting High Utility Itemsets from Weblog Data

ABSTRACT

Additional information

Notes on contributors

Brijesh Bakariya

G.S. Thakur

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature