Recommendation system for web article based on association rules and topic modelling

Authors

  • Guntur Budi Herwanto Universitas Gadjah Mada
  • Annisa Maulida Ningtyas Universitas Gadjah Mada

Keywords:

Website, Recommendation, Topic Modelling, Latent Dirichlet Allocation, Association Rule

Abstract

The World Wide Web is now the primary source for information discovery. A user visits websites that provide information and browse on the particular information in ac-cordance with their topic interest. Through the navigational process, visitors often had to jump over the menu to find the right content. Recommendation system can help the visitors to find the right content immediately. In this study, we propose a two-level recommendation system, based on association rule and topic similarity. We generate association rule by applying Apriori algorithm. The dataset for association rule mining is a session of topics that made by combining the result of sessionization and topic modeling. On the other hand, the topic similarity made by comparing the topic proportion of web article. This topic proportion inferred from the Latent Dirichlet Allocation (LDA). The results show that in our dataset there are not many interesting topic relations in one session. This result can be resolved, by utilizing the second level of recommendation by looking into the article that has the similar topic.

References

U. Gretzel, “Intelligent systems in tourism,†Ann. Tour. Res., vol. 38, no. 3, pp. 757–779, Jul. 2011 [Online]. Available: https://linkinghub.elsevier.com/retrieve/pii/S0160738311000776

O. Arbelaitz, I. Gurrutxaga, A. Lojo, J. Muguerza, J. M. Pérez, and I. Perona, “Web usage and content mining to extract knowledge for modelling the users of the Bidasoa Turismo website and to adapt it,†Expert Syst. Appl., vol. 40, no. 18, pp. 7478–7491, Dec. 2013 [Online]. Available: https://linkinghub.elsevier.com/retrieve/pii/S0957417413005198

D. Buhalis and R. Law, “Progress in information technology and tourism management: 20 years on and 10 years after the Internet—The state of eTourism research,†Tour. Manag., vol. 29, no. 4, pp. 609–623, Aug. 2008 [Online]. Available: https://linkinghub.elsevier.com/retrieve/pii/S0261517708000162

B. Pan and D. R. Fesenmaier, “Travel information search on the internet: a preliminary analysis.,†2003, pp. 242–251.

J. Borges and M. Levene, “Data Mining of User Navigation Patterns,†pp. 92–112, 2007.

A. S. Lalani, “Data Mining of Web Access Logs,MS Thesis,†pp. 1–70, 2003.

S. S. Rawat and L. Rajamani, “Discovering Potential User Browsing Behaviors Using Custom-Built Apriori Algorithm,†Int. J. Comput. Sci. Inf. Technol., vol. 2, no. 4, pp. 28–37, 2010.

B. Liu, Web Data Mining. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011 [Online]. Available: http://link.springer.com/10.1007/978-3-642-19460-3

M. Dimitrijevic and Z. Bosnjak, “Pruning statistically insignificant association rules in the presence of high-confidence rules in web usage data,†Procedia Comput. Sci., vol. 35, no. C, pp. 271–280, 2014 [Online]. Available: http://dx.doi.org/10.1016/j.procs.2014.08.107

R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules in Large Databases,†in Proceedings of the 20th International Conference on Very Large Data Bases, 1994, pp. 487–499 [Online]. Available: http://dl.acm.org/citation.cfm?id=645920.672836

E. Lazcorreta, F. Botella, and A. Fernández-Caballero, “Towards Personalized Recommendation by Two-step Modified Apriori Data Mining Algorithm,†Expert Syst. Appl., vol. 35, no. 3, pp. 1422–1429, 2008 [Online]. Available: http://dx.doi.org/10.1016/j.eswa.2007.08.048

D. M. Blei, “www.cs.princeton.edu/~blei/papers/Blei2012.pdf,†Cs.Princeton.Edu, pp. 77–84 [Online]. Available: http://www.cs.princeton.edu/~blei/papers/Blei2012.pdf%5Cnpapers2://publication/uuid/228B8D31-9CA0-447B-A144-3D6B0EA97493

O. Arbelaitz, I. Gurrutxaga, A. Lojo, J. Muguerza, J. M. Pérez, and I. Perona, “Enhancing a Web Usage Mining based Tourism Website Adaptation with Content Information.â€

M. Zanker, M. Fuchs, W. Höpken, M. Tuta, and N. Müller, “Evaluating Recommender Systems in Tourism — A Case Study from Austria,†2008, pp. 24–34.

J. Vellingiri and S. C. Pandian, “A Survey on Web Usage Mining,†Glob. J. Comput. Sci. Technol., vol. 11, no. 4, pp. 67–72, 2011.

Sunil and M. Doja, Recommender System Based on Web Usage Mining for Personalized E-learning Platforms, vol. 5. 2017.

D. M, A. G. M. Erwin, and N. D, “2013, News Recommendation in Indonesian Language Based on User Click Behaviour.â€

N. More and N. P. More, “Recommendation of Books Using Improved Apriori Algorithm,†Int. J. Innov. Res. Sci. Technol, vol. 1, no. 4, pp. 80–82, 2014.

H. . Husin, “News Recommendation Based on Web Usage and Web Content Mining,†in ICDE Workshops, 2013.

N. Dave, K. Potts, V. Dinh, and H. U. Asuncion, “Combining association mining with topic modeling to discover more file relationships,†Int. J. Adv. Softw., vol. 7, no. 3, pp. 3–4, 2014.

Downloads

Published

2018-07-08

Issue

Section

Articles