... make full use of the available training data, we propose a semi- supervised learning algorithm that exploits a form of entropy regularization on the unlabeled data. Specifically, for a semi- supervised ... practical for large data sets. In this paper, wepropose a new semi- supervised training method for conditional random fields (CRFs) that incorporates both labeled and unla- beled sequence data ... same notation as (Laf- ferty et al. 2001). Let be a random variable over data sequences to be labeled, and be a random variable over corresponding label sequences. All components, , of are assumed...
Ngày tải lên: 17/03/2014, 04:20
... 2006. c 2006 Association for Computational Linguistics Improving the Scalability of Semi- Markov Conditional Random Fields for Named Entity Recognition Daisuke Okanohara† Yusuke Miyao† Yoshimasa Tsuruoka ... a feature. All the feature functions are real-valued and can use adja- cent label information. Semi- CRFs are actually a restricted version of order-L CRFs in which all the labels in a chunk are the ... implau- sible phrase candidates are removed beforehand. We construct a binary naive Bayes classifier us- ing the same training data as those for semi- CRFs. In training and inference, we enumerate all possi- ble...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Word representations: A simple and general method for semi-supervised learning" doc
... general conditional models.) One of the advantages of the semi- supervised learning approach that we use is that it is simpler and more general than that of Ando and Zhang (2005) and Suzuki and ... generalization accuracy. Semi- supervised models such as Ando and Zhang (2005), Suzuki and Isozaki (2008), and Suzuki et al. (2009) achieve state-of-the-art accuracy. However, these approaches ... last word of each n-gram. ã We had a separate learning rate for the em- beddings and for the neural network weights. We found that the embeddings should have a learning rate generally 1000–32000...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Generalized Expectation Criteria for Semi-Supervised Learning of Conditional Random Fields" pdf
... unlabeled data. Tradi- tional approaches to semi- supervised learning are applied to cases in which there is a small amount of fully labeled data and a much larger amount of un- labeled data, ... test the advantages of this annotation paradigm. To simulate a human labeler, we randomly sam- ple (without replacement) tokens with the particu- lar feature in question, and generate a label using the ... Smith and J. Eisner. 2005. Contrastive estimation: Training log-linear models on unlabeled data. In ACL. Martin Szummer and Tommi Jaakkola. 2002. Partially labeled classification with markov random...
Ngày tải lên: 20/02/2014, 09:20
Which interest rate scenario is the worst one for a bank? Evidence from a tracking bank approach for German savings and cooperative banks potx
... cross-checking Volker Wieland 21 2007 Corporate marginal tax rate, tax loss carryforwards and investment functions – empirical analysis using a large German panel data set Fred Ramb x t,i,j j ... Welfare effects of financial integration Fecht, Grüner, Hartmann 12 2007 The marketability of bank assets and managerial Falko Fecht rents: implications for financial stability Wolf Wagner ... 23436383103 Maturity (months) Change in interest rates (percentage points) 26 11 2007 Exchange rate dynamics in a target zone - Christian Bauer a heterogeneous expectations approach Paul De Grauwe,...
Ngày tải lên: 22/03/2014, 23:20
Báo cáo khoa học: "A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction" pptx
... Conclusions This paper presented a novel graph-based projection approach for relation extraction. Our approach per- formed a label propagation algorithm on a proposed graph that represented the instance and ... graph-based projection approach for relation extraction. 3 Graph Construction The most crucial factor in the success of graph- based learning approaches is how to construct a graph that is appropriate for the target ... target language L t . To accomplish that goal, the method automatically creates a set of annotated text for f t , utilizing a well-made extractor f s for a resource-rich source language L s and...
Ngày tải lên: 23/03/2014, 14:20
Tài liệu Báo cáo khoa học: A systems biology approach for the analysis of carbohydrate dynamics during acclimation to low temperature in Arabidopsis thaliana doc
... acclimation in A. thaliana FEBS Journal 278 (2011) 506518 ê 2010 The Authors Journal compilation ê 2010 FEBS 509 A systems biology approach for the analysis of carbohydrate dynamics during acclimation ... Whereas many tropical and subtropical species have only limited capacities to cope with low temperature, plants from temperate climates, such as Arabidopsis thaliana, grow at low temperature and can ... metabolic pathways other than carbo- hydrate pathways was calculated as the difference between net carbon uptake and changes in cellular car- bohydrate content. The resulting surplus of carbon equivalents...
Ngày tải lên: 14/02/2014, 22:20
Báo cáo khoa học: "A Bio-inspired Approach for Multi-Word Expression Extraction" doc
... natural language are also observed. For example, we improve MWE min- ing result by EBP detection. Comparisons with variant n-gram approaches, which are the leading approaches, are performed for ... extraction, LCS approach is applied with great efficiency and per- formance guarantee. Experimental results show that LCS-based approach achieves better results than n-gram. 1 Introduction Language ... extraction of MWE plays an important role in several areas, such as machine translation (Pas- cale,1997), information extraction (Kalliopi,2000) etc. On the other hand, there is also a need for...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "A Hybrid Relational Approach for WSD – First Results" ppt
... give-cake big-cake … Figure 1. Attribute-value vector for syntactic relations Given that some types of information are not avail- able for certain instances, many attributes will have null values. ... a few are designed for specific applica- tions, such as MT. Existing multilingual approaches can be classified as (a) knowledge-based ap- proaches, which make use of linguistic knowledge manually ... only classified a few ex- amples (form 1 to 6). In Table 2 we show the accuracy of the theory learned for each verb, as well as accuracy achieved by two propositional machine learning algorithms...
Ngày tải lên: 23/03/2014, 18:20
Báo cáo khoa học: "A GENERATIVE GRAMMAR APPROACH FOR THE MORPHOLOGIC AND MORPHOSYNTACTIC ANALYSIS OF ITALIAN" ppt
... dv2. andarâ v.intran.simple I The other sets of data are contained in the Prolog workspace and are structured as tables of a relational data base. The set of the classes of endings is a table ... The form is passive, as "chiamare" (to call) is a transitive verb (the auxiliary verb for the active form is to have). In 36 this case morphosyntactic analysis has solved an ambiguity: ... lemma I stem ending dam synt=categ label matte matt da_bello adj.qualific. 1 mattino mattin dn_oggctto noun.common 3 di di prep.simple 2 andare vad dv 1 _andare v.intran.simple 1 andare and...
Ngày tải lên: 24/03/2014, 05:21
Báo cáo khoa học: "Active Learning-Based Elicitation for Semi-Supervised Word Alignment" pptx
... Engineering, Testing, and Quality Assurance for Natural Language Pro- cessing, pages 49–57, Columbus, Ohio, June. Association for Computational Linguistics. Gholamreza Haffari and Anoop Sarkar. 2009. Active learn- ing ... to 26.57 AER. The translation accuracy as measured by BLEU (Papineni et al., 2002) and METEOR (Lavie and Agarwal, 2007) also shows improve- ment over baseline and approaches gold standard quality. ... used. We also train a configuration using gold standard manual align- ment data for the parallel corpus. This is the max- imum translation accuracy that we can achieve by any link selection algorithm....
Ngày tải lên: 30/03/2014, 21:20
Báo cáo khoa học: "Typed Graph Models for Semi-Supervised Learning of Name Ethnicity" pptx
... network analysis, New York, NY, USA. ACM. Partha Pratim Talukdar, Joseph Reisinger, Marius Pasáca, Deepak Ravichandran, Rahul Bhagat, and Fernando Pereira. 2008. Weakly -supervised acquisition of la- beled ... task. References Shumeet Baluja, Rohan Seth, D. Sivakumar, Yushi Jing, Jay Yagnik, Shankar Kumar, Deepak Ravichandran, and Mohamed Aly. 2008. Video suggestion and dis- covery for youtube: taking random walks through ... Hausa-Fulani names. Further, Hausa-Fulani names are predominantly Arabic or Arabic derivatives and stand out from the rest of the ethnic groups, making their detection easier. 514 will be made available...
Ngày tải lên: 30/03/2014, 21:20
báo cáo hóa học: " A utility-based approach for secondary spectrum sharing" pot
... we focus on the case in which there is at least one channel available for each node. We shall also assume that each node can transmit on one channel at a time. For each node i ẻ V and channel c ∈ C we ... Leith-Clifford’s performance deteriorates for decreasing channel availability compared with the gradient ascent-based methods which again display close performance b etween each other. Gibbs algorithm underperforms ... transmitters and receivers. This results in the operational constraint that two such sessions cannot actively use a narrowband channel at the same instant. Dashouk and Alanyali EURASIP Journal...
Ngày tải lên: 21/06/2014, 02:20
Báo cáo hóa học: " Research Article A Machine Learning Approach for Locating Acoustic Emission" pdf
... performance. We note that the SVMs trained with 256- dimensional raw AE data had quite poor performance, where the AUC was 0.39 and 0.31 for datasets SR1 and SR2. We also examined the performance ... crucial patterns from the total AE data, as well as particular P-wave arrivals, may provide clues for distinguishing between real events and extraneous signals, thus improving the spatial accuracy ... Processing 00.20.40.60.81 Mel subbands AR Spectrum variance Wave le t packets Raw data 0 0.2 0.4 0.6 0.8 1 TP rate FP rate (a) 00.20.40.60.81 Mel subbands AR Spectrum variance Wave le t packets Raw data 0 0.2 0.4 0.6 0.8 1 TP...
Ngày tải lên: 21/06/2014, 08:20
Báo cáo hóa học: "Research Article A Rules-Based Approach for Configuring Chains of Classifiers in Real-Time Stream Mining Systems Brian Foo and Mihaela van der Schaar" pot
... Consequently, every parameter in (5) can be easily estimated based on some locally observable data. By exchanging these locally obtained parameters and configurations across all classifiers, each classifier can then ... than the small rule space. Finally, the large/distributed rule space provided the best performance as well as the lowest average delay and delay variance. As indicated by Tab le 4 , each classifier ... costs. 3. Background on Binary Classifier Chains 3.1. Characterizing Binary Classifiers and Classifier Chains. A binary classifier partitions input data objects into two classes, a “yes” class H and a “no”...
Ngày tải lên: 21/06/2014, 19:20
Báo cáo hóa học: " Research Article A Fault Diagnosis Approach for Gears Based on IMF AR Model and SVM" pot
Ngày tải lên: 21/06/2014, 22:20
Báo cáo hóa học: " Research Article A Cross-Layer Approach for Maximizing Visual Entropy Using Closed-Loop Downlink MIMO" docx
Ngày tải lên: 21/06/2014, 22:20
Báo cáo hóa học: " Research Article Rate-Distortion Optimization for Stereoscopic Video Streaming with Unequal Error Protection" ppt
Ngày tải lên: 21/06/2014, 22:20
Báo cáo hóa học: " Research Article A Statistical Multiresolution Approach for Face Recognition Using Structural Hidden Markov Models" pptx
Ngày tải lên: 22/06/2014, 06:20