Mining RDF (Linked Data) using Éclat algorithm
DOI:
https://doi.org/10.25098/3.2.21Keywords:
RDF, Data Mining, Linked Data, Éclat, SPARQLAbstract
Basket market analysis is one of the most widely used groups of data mining and have been extensively utilized for analyzing data to extract interesting information from huge amount of data. Also, Studies over the past two decades have provided important information on semantic web as it is part of (World Wide Web Consortium) W3C. Both data mining and semantic web have several key features to mine semi-structured dataset and having an accurate result. The methodological approach taken in this study is combining both Éclat algorithm and RDF (Resource Description Framework) dataset based on the process of converting RDF into dataset and mining it. Firstly, RDF data is checked for validation, and then it needs to convert into traditional dataset. This process requests SPARQL as a query language. Thus, it needs to imply Éclat (Equivalence class Transformation) algorithm on traditional dataset. This experiment illustrates that semantic web and data mining have significant results in mining semi-structured dataset. This paper hands out how mixing RDF and Éclat algorithm is influent. For this technique different data source can be used, however, for this paper particularly products in a supermarket are going to use as a main dataset.
References
AbedjanEmail, Z., & Naumann, F. (2013). Improving RDF Data Through Association Rule Mining. Datenbank Spektrum , 13 (2).
Aggarwal, C. C., & Han, J. (2014). Frequent Pattern Mining. Springer.
Baratia, M., Baia, Q., & Liub, Q. (2017). Mining semantic association rules from RDF data . 133, 183-196 .
Borgelt, C. (2012). Frequent item set mining . WIREs Data Mining And Knowledge discovery , 2 (6), 437–456.
DECKER, S., MELNIK, S., HARMELEN, F. V., FENSEL, D., KLEIN, A. M., BROEKSTRA, J., et al. (2000). The Semantic Web: The Roles of XML and RDF. IEEE Internet and Computing , 4 (5), 63-74.
DuCharme, B. (2013). Learning SPARQL: Querying and Updating with SPARQL 1.1 (Second edition ed.). O'Reilly Media.
Gayo, o. E., Prud'hommeaux, E., Boneva, I., & Kontokostas, D. (2018). Validating RDF Data (Vol. 7). Morgan & Claypool.
Han, J., Pei, J., & Kamber, M. (2012). Data Mining: Concepts and Techniques . Waltham, MA: Elsevier Inc.
Hidber, C. (1999). Online Association Rule Mining . Proceeding SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data , 28 (2), 145-156 .
kaur, M., & Grag, U. (2014). ECLAT Algorithm for Frequent Item sets Generation. International Journal of Computer Systems , 1 (2).
Koh, Y. S., & Rountree, N. (2010). Rare Association Rule Mining and Knowledge Discovery: Technologies for Infrequent and Critical Event Detection . SCOPUS.
Maimon, O., & Rokach, L. (2005). Data Mining and Knowledge Discovery Handbook . springer.
Mohammed, W. M., & Saraee, M. M. (2016). Mining Semantic Web Data Using K-means Clustering Algorithm. British Journal of Mathematics & Computer Science , 13 (1), 1-14.
Mohammed, W., & Saraee, M. M. (2016). Sematic Web Mining Using Fuzzy C-means Algorithm. 16 (4), 1-16.
ReynaudEmail, J., Toussaint, Y., & Napoli, A. (2019). Using Redescriptions and Formal Concept Analysis for Mining Definitions in Linked Data. International Conference on Formal Concept Analysis , 11511, 241-256.
Shimada, K., Hirasawa, K., & Hu, J. (2006). Association Rule Mining with Chi-Squared Test Using Alternate Genetic Network Programming . Advances in Data Mining. Applications in Medicine, Web Mining, Marketing, Image and Signal Mining , 202-216.
Venkata, N. p., Kappara, P., & Ichise, R. (2011). LiDDM: A Data Mining System for Linked Data . Workshop on Linked Data on the Web , 813.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
SJCUS's open access articles are published under a Creative Commons Attribution CC-BY-NC-ND 4.0 license.
