1. Trang chủ
  2. » Công Nghệ Thông Tin

Tài liệu Data Streams Models and Algorithms- P4 doc

30 354 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 30
Dung lượng 1,65 MB

Nội dung

Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. Please purchase PDF Split-Merge on www.verypdf.com to remove this watermark. [...]... 102 DATA STREAMS: MODELS AND ALGORITHMS [16] Hulten G., Spencer L., Domingos P (2001) Mining Time Changing Data Streams ACM KDD Conference [17] Jain A., Dubes R (1998) Algorithms for Clustering Data, Prentice Hall, New Jersey [18] Kifer D., David S.-B., Gehrke J (2004) Detecting Change in Data Streams VLDB Conference,2004 [19] Roddick J F et a1 (2000) Evolution and Change in Data Management: Issues and. .. Community Evolution in Data Streams ACM SIAM Conference on Data Mining [3] Aggarwal C (2003) A Framework for Diagnosing Changes in Evolving Data Streams ACM SIGMOD Conference [4] Aggarwal C (2002) An Intuitive Framework for understanding Changes in Evolving Data Streams IEEE ICDE Conference [5] Aggarwal C., Han J., Wang J., Yu P (2003) A Framework for Clustering Evolving Data Streams VLDB Conference... different from and orthogonal to the work in [13, 141 Specifically, the work in [13,14] is focussed on the effects of evolution on data mining models and algorithms While these results show some interesting results in terms of generalizing existing data mining algorithms, our view is that data streams have special mining requirements which cannot be satisfied by using existing data mining models and algorithms... dynamically, and changing rapidly Due to limited resources available and the usual requirements of fast response, most data streams may not be fully stored and may only be examined in a single pass These characteristics of stream data have been emphasized and explored in their investigationsby many researchers, such as ([6, 8, 17, 18, 16]), and efficient stream data querying, counting, clustering and classification... which the data is being reduced, those at which the data is increasing, and those from where the data is shifting to other locations: DEFINITION A data coagulationfor timeslice t and user defined threshold 5.1 min-coag is defined to be a connected region R in the data space, so that for eachpoint X E R, we have yh,,ht) t ) > min-coag > 0 (X, Thus, a data coagulation is a connected region in the data which... stream data Three importanttechniques are proposed for lease purchase PDF Split-Merge on www.verypdf.com to remove this watermark DATA STREAMS: MODELS AND ALGORITHMS the design and implementation of stream cubes First, a tilted timeframe model is proposed to register time-related data in a multi-resolution model: The more recent data are registered at finer resolution, whereas the more distant data. .. reversed and the data stream arrived in reverse order, starting at t ht and ending at t Examples of the forward and reverse density profiles are illustrated in Figures 5.1 and 5.2 respectively For a given spatial location X and time T , let us examine the nature of the functions F(h,,ht)( X ,T ) and R(h,,ht)( X ,T - ht) Note that both functions are almost exactly the same, and use the same data points... conjunction with data mining algorithms In this section, we will discuss the effects of evolution on data mining algorithms The problem of mining incremental data dynamically has often been studied in many data mining scenarios [7, 10, 12, 241 However, many of these methods are often lease purchase PDF Split-Merge on www.verypdf.com to remove this watermark 98 DATA STREAMS: MODELS AND ALGORITHMS not... amount of data in order to provide effective results, and historical clusters do provide good insights about the future clusters in the data Therefore, it makes more sense to use all the data, but with an application specific decaybased approach which provides the new data greater weight than the older data On the other hand, in problems such as classification, the advantages of using more data is much... estimation and its application to different kinds of visual representations of changes in the underlying data We also discussed the problem of online community evolution in fast data streams In many of these methods, clustering is a key component since it allows us to summarize the data effectively We also studied the reverse problem of how data mining models are maintained when the underlying data changes

Ngày đăng: 15/12/2013, 13:15

Nguồn tham khảo

Tài liệu tham khảo Loại Chi tiết
[13] Ganti V., Gehrke J., Ramakrishnan R (2002). Mining Data Streams under Block Evolution. ACM SIGKDD Explorations, 3(2), 2002 Sách, tạp chí
Tiêu đề: ACM SIGKDD Explorations
Tác giả: Ganti V., Gehrke J., Ramakrishnan R
Năm: 2002
[17] Jain A., Dubes R. (1998). Algorithms for Clustering Data, Prentice Hall, New Jersey Sách, tạp chí
Tiêu đề: Prentice Hall
Tác giả: Jain A., Dubes R
Năm: 1998
[18] Kifer D., David S.-B., Gehrke J. (2004). Detecting Change in Data Streams. VLDB Conference, 2004 Sách, tạp chí
Tiêu đề: Conference
Tác giả: Kifer D., David S.-B., Gehrke J
Năm: 2004
[19] Roddick J. F. et a1 (2000). Evolution and Change in Data Management: Issues and Directions. ACM SIGMOD Record, 29(1): pp. 21-25 Sách, tạp chí
Tiêu đề: ACM SIGMOD Record
Tác giả: Roddick J. F. et a1
Năm: 2000
[20] Roddick J. F., Spiliopoulou M (1999). A Bibliography of Temporal, Spa- tial, and Spatio-Temporal Data Mining Research. ACM SIGKDD Explo- rations, l(1) Sách, tạp chí
Tiêu đề: ACM SIGKDD Explo- rations
Tác giả: Roddick J. F., Spiliopoulou M
Năm: 1999
[23] Silverman B. W. (1 986). Density Estimation for Statistics and Data Anal- ysis. Chapman and Hall Sách, tạp chí
Tiêu đề: Density Estimation for Statistics and Data Anal- ysis
[25] Vitter J. S. (1985) Random Sampling with a Reservoir. ACM Transactions on Mathematical Software, Vol. 11(1), pp 37-57 Sách, tạp chí
Tiêu đề: ACM Transactions on Mathematical Software
[2] Aggarwal C., Yu P. S (2005). Online Analysis of Community Evolution in Data Streams. ACM SIAM Conference on Data Mining Khác
[3] Aggarwal C (2003). A Framework for Diagnosing Changes in Evolving Data Streams. ACM SIGMOD Conference Khác
[4] Aggarwal C (2002). An Intuitive Framework for understanding Changes in Evolving Data Streams. IEEE ICDE Conference Khác
[5] Aggarwal C., Han J., Wang J., Yu P (2003). A Framework for Clustering Evolving Data Streams. VLDB Conference Khác
[6] Aggarwal C., Han J., Wang J., Yu P (2004). A Framework for High Dimen- sional Projected Clustering of Data Streams. VLDB Conference Khác
[7] Aggarwal C, Han J., Wang J., Yu P. (2004). On-Demand Classification of Data Streams. ACM KDD Conference Khác
[8] Aggarwal C. (2006). On Biased Reservoir Sampling in the presence of stream evolution. VLDB Conference Khác
[9] Chawathe S., Garcia-Molina H. (1997). Meaningful Change Detection in Structured Data. ACM SIGMOD Conference Proceedings Khác
[12] Donjerkovic D., Ioannidis Y E., Ramakrishnan R. (2000). Dynamic His- tograms: Capturing Evolving Data Sets. IEEE ICDE Conference Proceed- ings Khác
[14] Ganti V., Gehrke J., Ramakrishnan R., Loh W.-Y. (1999). A Framework for Measuring Differences in Data Characteristics. ACMPODS Conference Proceedings Khác
[15] Gollapudi S., Sivakumar D. (2004) Framework and Algorithms for Trend Analysis in Massive Temporal Data ACM CIKM Conference Proceedings Khác
[16] Hulten G., Spencer L., Domingos P. (2001). Mining Time Changing Data Streams. ACM KDD Conference Khác
[21] Schweller R., Gupta A., Parsons E., Chen Y. (2004) Reversible Sketches for Efficient and Accurate Change Detection over Network Data Streams.Internet Measurement Conference Proceedings Khác

TỪ KHÓA LIÊN QUAN