Hướng phát triển 58 - Khai thác luật tuần tự trên- 123docz.net

Luật tuần tự rất hữu ích trong việc khám phá những tri thức tiềm ẩn trong các nguồn dữ liệu ở dạng tuần tự. Tuy nhiên, với tình trạng bùng nổ thông tin hiện nay, khối lượng dữ liệu ngày càng trở nên đồ sộ, việc khai thác tập luật tuần tự sao cho

hiệu quả và tốn ít thời gian nhất là cần thiết. Do vậy, luận văn hướng tới việc cải thiện thuật toán hơn nữa đểđạt tốc độ tối ưu hơn.

Trong một số lĩnh vực ứng dụng, cần thiết phải khai thác những luật tuần tự

thiết yếu nhất đó là luật tuần tự không dư thừa. Vì vậy, nghiên cứu phương pháp khai thác luật tuần tự không dư thừa dựa trên cây tiền tố.

Hơn nữa, phát triển phương pháp sinh luật dựa trên cây tiền tố vào bài toán khai thác luật thú vị.

Đi sâu vào tính ứng dụng của bài toán khai thác luật tuần tự trên CSDL chuỗi.

TÀI LIỆU THAM KHẢO

Tiếng Anh

[1] Agrawal, R., Imielinski, T., and Swami, A. N. (1993), “Mining association rules between sets of items in large databases”, in Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, 207-216. [2] Agrawal, R., Srikant, R. (1994), “Fast algorithms for mining association

rules”, in: Proceedings of International Conference on Very Large Data Bases, pp. 487–499.

[3] Agrawal, R., Srikant, R. (1995), “Mining Sequential Patterns”, in: Proc. 11th Int’l Conf, Data Engineering, pp. 3–14.

[4] Ayres, J., Gehrke, J.E., Yiu, T., Flannick, J. (2002), “Sequential Pattern Mining using a Bitmap Representaion”, in SIGKDD Conf., pp. 1–7.

[5] Baralis, E., Chiusano, S., Dutto, R. (2008), “Applying Sequential Rules to Protein Localization Prediction”, in: Computer and Mathematics with Applications 55 (5), pp. 867–878.

[6] Berry, M.J., Linoff, G.S. (1997), “Data Mining Techniques for Marketing, Sales and Customer Support”, in: John Wiley & Sons.

[7] Brin, S., Motwani, R., Ullman, J., Tsur, S. (1997), “Dynamic Itemset Counting and Implication Rules for Market Basket Data”, In Proc. of the 1997 ACM- SIGMOD Int’l Conf. on the Management of Data, pp. 255-264.

[8] Dong, G., Pei, J. (2007), Sequence Data Mining, Springer Science + Business Media, LLC.

[9] Gouda, K., Hassaan, M., Zaki, M.J. (2010), “Prism: A Primal-Encoding Approach for Frequent Sequence Mining”, Journal of Computer and System Sciences, Volume 76, Issue 1, pp. 88-102.

[10] Han, J., Pei, J., Mortazavi-Asl, B., Chen, Q., Dayal, U., and Hsu, M.C., (2000), “Freespan: Frequent pattern-projected sequential pattern mining”, in Proc. 2000 Int. Conf. Knowledge Discovery and Data Mining (KDD’00), pp. 355–359.

[11] Han, J., Kamber, M., Data Mining : Concepts and Techniques 2nd Edition. [12] Hegland, M. (2003), “Algorithms for Association Rules”, Lecture Notes in

Computer Science, Volume 2600, Jan 2003, Pages 226 – 234.

[13] Lo, D., Khoo, S.C., Liu, C. (2008), “Efficient Mining of Recurrent Rules from a Sequence Database”, in: Proceedings of International Conference on Database Systems for Advanced Applications, pp. 67–83.

[14] Lo, D., Khoo, S.C., Wong, L. (2009), “Non-Redundant Sequential Rules- Theory and Algorithm”, in: Information Systems, Volume 34, Issue 4-5, pp. 438-453.

[15] Lo, D., Khoo, S.-C. (2006), “SMArTIC: Toward Building an Accurate, Robust and Scalable Specification Miner”, in: Proceedings of SIGSOFT Sympo- sium on the Foundations of Software Engineering, pp. 265–275.

[16] Mannila, H., Toivonen, H., Verkamo, A.I. (1997) Discovery of frequent episodes in event sequences. DMKD 1, 259–289.

[17] Masseglia, F., Cathala, F., & Poncelet, P. (1998), “The PSP Approach for Mining Sequential Patterns”, Proceedings of the 2nd European Symposium on Principles of Data Mining and Knowledge Discovery, Nantes, France, 176- 184.

[18] Pei, J., et al. (2004), “Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach”, in: IEEE Trans. Knowledge and Data Engineering 16(10), pp. 1424–1440.

[19] Spiliopoulou, M. (1999), “Managing Interesting Rules in Sequence Mining”, in: Proceedings of European Conference on Principles of Data Mining and Knowledge Discovery, pp. 554–560.

- 62 -

[20] Srikant, R., Agrawal, R. (1996), “Mining Sequential Patterns: Generalizations and Performance Improvements”, in: Proc. 5th Int’l Conf. Extending Database Technology, pp.3–17.

[21] Yan, X., Han, J., and Afshar, R. (2003), “CloSpan: Mining Closed Sequential Patterns in Large Databases”, in: Proceedings of SIAM International Conference on Data Mining

[22] Yang, J., Evans, D.,Bhardwaj, D., Bhat, T., Das, M. (2006), “Mining temporal API Rules from Imperfect Traces, in: Proceedings of International Conference on Software Engineering, 2006, pp. 282–291.

[23] Zaki, M.J. (2000), “SPADE: An Efficient Algorithm for Mining Frequent Sequences”, Machine Learning Journal 42(1/2), pp. 31–60.

[24] Zhang, M., Hsu, W., Lee, M. -L. (2006), “Mining progressive confident rules”, in: Proceedings of SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 803–808.