... situation. 4.1 Progressive validation When learning from data streams the standard evaluation methodology where data is split into a separate training and test set is not applicable. An evaluation ... concept drift in data streams and on the evaluation of learning under its constraints. We also show that for evolving issue tracker data, in a large majority of cases SGD Re- gression handily outperforms ... pervasive and serious in real bug report streams. We then address this problem by leveraging state-of-the- art online learning techniques which automati- cally track the evolving data stream and incremen- tally...
Ngày tải lên: 24/03/2014, 03:20
... distributions of estimated fre- quency values for occurring and non-occurring sets. 170 CONTEXTUAL WORD SIMILARITY AND ESTIMATION FROM SPARSE DATA Ido Dagan ATãT Bell Laboratories 600 Mountain ... (Church and Hanks, 1990), machine transla- tion (Brown et al., ; Sadler, 1989), information retrieval (Maarek and Smadja, 1989) and various disambiguation tasks (Dagan et al., 1991; Hindle and ... for theories on generalization and anal- ogy in linguistic data. The literature suggests two major approaches for solving the sparse data problem: smoothing and class based methods. Smoothing...
Ngày tải lên: 08/03/2014, 07:20
Báo cáo khoa học: "A Method for Effective and Scalable Mining of Named Entity Transliterations from Large Comparable Corpora" doc
... effective and scala- ble mining method, called MINT (MIning Named-entity Transliteration equivalents), for mining of NETEs from large comparable corpo- ra. MINT addresses several challenges in mining ... 11 1 11 ,|,|| 1 jajajj A m j nm tstpsaapstP jj Here, j t (and resp. i s ) denotes the j th (and resp. i th ) character in w T (and resp. w S ) and m aA 1 is the hidden alignment between w T and w S where j t is aligned ... families (Hindi from the Indo-Aryan family, Russian from the Slavic fam- ily, Kannada and Tamil from the Dravidian fami- ly). Note that none of the five languages use a common script and hence identification...
Ngày tải lên: 24/03/2014, 03:20
Generating test case from user interface and mining requirements
... Non-Empty Data field, string expected 3 Symbol Empty, Non-Empty Data field, string expected 4 Atomic number Invalid, Valid Data field, data typed > 0 5 Properties Empty, Non-Empty Data field, ... manually by hand. One of the important works in the testing process is the creation of test case. To solve this problem, we will build a tool from the method of generating test case from GUI and requirement ... interface elements and then will automatically generate the parameters and the value of the parameter, and is to appear on the screen test. After receiving the parameters and values of each parameter,...
Ngày tải lên: 12/04/2014, 15:43
Báo cáo khoa học: "Learning Semantic Links from a Corpus of Parallel Temporal and Causal Relations" doc
... selected the best paraphrase of and from the following options: CAUSAL and as a result, and as a consequence, and enabled by that NO-REL and independently, and for similar reasons To build ... (R)ecall and (F1)- measure. The null label is NO-REL. train/test split from Table 1 and the feature sets: Syntactic The syntactic features from Section 4. Semantic The semantic features from Section ... 81.2% on temporal relations and 77.8% on causal re- lations. We trained machine learning mod- els using features derived from WordNet and the Google N-gram corpus, and they out- performed a variety...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web" pdf
... in- formation from the Web and generates ranked relational terms and surface patterns for each concept pair. ã Dependency Pattern Extractor generates dependency patterns for each concept pair from corresponding ... the number of unique patterns is loose, but many pat- terns are non-discriminative and correlated. A salient challenge and research interest for frequent pattern mining is abstraction away from different surface ... clus- tering approach based on combinations of patterns: dependency patterns from depen- dency analysis of texts in Wikipedia, and surface patterns generated from highly re- dundant information related...
Ngày tải lên: 23/03/2014, 16:21
Researching Design Learning: Issues and Findings from Two Decades of Research and Development pot
... research learning curve, and with our practitioner backgrounds, we explored a rich and scary terrain of methodologies and techniques, led instinctively by our beliefs about capability and learning, ... at any age – and the differences emerge merely in the quality and depth of the outcomes from that activity and the understandings that they demonstrate. This paradigm derives from a philosophy ... allowed 1ẵ hours of activity, including using Researching Design Learning Issues and Findings from Two Decades and of Research and Development RICHARD KIMBELL KAY STABLES Goldsmiths, University...
Ngày tải lên: 24/03/2014, 02:20
Productivity in the Mining Industry: Measurement and Interpretation pptx
... Feasibility analysis ã Mining services companies ã Mining companies ã Mining (Services to Mining) Mine development ã Acquire mining rights ã Construct access roads and infrastructure ... plant and equipment ã Contractors ã Mining companies ã Construction (if contracted) ã Mining (if in-house) Extraction ã Remove deposit from the ground ã Mining companies ã Mining ... involved and do not reflect those of the Productivity Commission. UNDERSTANDING PRODUCTIVITY IN MINING 65 4 Understanding productivity in mining: purchased inputs Key points ã Mining...
Ngày tải lên: 01/04/2014, 00:20
investigative data mining for security and criminal detection 2003
... the data mining analysis. In this chapter we will discuss the closed and open sources of data available both online and offline and how to integrate and prepare the data prior to its analysis. Data ... voicemail, and e-mail. Coupled with data mining techniques, this expanded ability to access multiple and diverse databases will allow the expanded ability to predict crime. Security and risk involving ... potential data sources for enhancing the value of an investigative data mining analysis. Users of data mining tools and techniques from industries in financial services, retailing, marketing, and...
Ngày tải lên: 04/06/2014, 13:16
DATA MINING IN BANKING AND FINANCE: A NOTE FOR BANKERS pdf
... Chun, Se-Hak and Kim, Steven, Data mining or financial prediction and trading: application to single and multiple markets (2003) ã J. M. Zytkow and W. Klửsgen, Handbook of Data Mining and Knowledge ... portfolio risk to market and credit risk Models through data mining 9 Data mining techniques are used to discover hidden knowledge, unknown patterns and new rules from large data sets, which ... macroeconomic and microeconomic variables and this data is available in a variety of disparate formats. Data mining comes in here since it helps discover information and hidden patterns from large data...
Ngày tải lên: 20/06/2014, 14:20
ADVANCES IN DATA MINING KNOWLEDGE DISCOVERY AND APPLICATIONS pot
Ngày tải lên: 28/06/2014, 10:20
Managing and Mining Graph Data part 62 pdf
... Graph Data Mining 601 dustry has generated a wealth of protein-ligand activity data for large com- pound libraries against many biomolecular targets. The data has been system- atically collected and ... biomolecular target’s chemical data analy- sis. In recent years, the trend has been to integrate chemical data with protein and genetic data (bioinformatics data) and analyze the problem over multiple proteins ... 327 Frequent Pattern, 29, 161, 365 Frequent Pattern Mining, 6, 29, 365 Frequent Subgraph Mining, 29, 365, 555 Frequent Subgraph Mining for Bug Localiza- tion, 521 Frequent Subgraphs in Chemical Data, ...
Ngày tải lên: 03/07/2014, 22:21
Managing and Mining Graph Data part 1 pptx
... 1 2. Graph Management and Mining Applications 3 3. Summary 8 References 9 2 Graph Data Management and Mining: A Survey of Algorithms and Applications 13 Charu C. Aggarwal and Haixun Wang 1. Introduction ... 27 3. Graph Mining Algorithms 29 3.1 Pattern Mining in Graphs 29 3.2 Clustering Algorithms for Graph Data 32 3.3 Classification Algorithms for Graph Data 37 3.4 The Dynamics of Time -Evolving Graphs ... AND MINING GRAPH DATA 6. Vector Space Embeddings of Graphs via Graph Matching 235 7. Conclusions 239 References 240 8 A Survey of Algorithms for Keyword Search on Graph Data 249 Haixun Wang and...
Ngày tải lên: 03/07/2014, 22:21