Báo cáo khoa học: "Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty" potx
... Singapore, 2-7 August 2009. c 2009 ACL and AFNLP Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty Yoshimasa Tsuruoka †‡ Jun’ichi Tsujii †‡∗ Sophia ... pro- duce compact and accurate models much more quickly than a state-of-the-art quasi- Newton method for L1-regularized log- linear models. 1 Introduction Log-linear...
Ngày tải lên: 17/03/2014, 01:20
... unlabeled data resource. Our goal is to obtain better perfor- mance than a purely supervised approach without unreasonable computational effort. Unfortunately, although significant recent progress has ... self -training, generative models, semi- supervised support vector machines (S3VM), graph- based algorithms and multi-view algorithms (Zhu, 2005). Self -training is a commonly used techniqu...
Ngày tải lên: 08/03/2014, 01:20
... words) annotated with syntactic in- formation. We use the standard divisions: Sec- tions 2 through 21 are used for training, section 24 for held-out development, and section 23 for final testing. 3.3 ... NANC sentences to WSJ training data on parsing performance. f-scores for the parser with and without the WSJ reranker are shown when evaluating on BROWN develop- ment. For thi...
Ngày tải lên: 31/03/2014, 01:20
Báo cáo khoa học: "Creating a Gold Standard for Sentence Clustering in Multi-Document Summarization" potx
... e.g. information ex- traction through clustering or summary generation (using for example language regeneration) is re- sponsible for the lack of quality. However there is no gold standard for sentence clustering ... cluster label for a sentence. In this approach only sen- tences within the same document and even within the same paragraph are clustered together whereas our approach is...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "A Word-Order Database for Testing Computational Models of Language Acquisition" docx
... simulations on two variants of the TLA – one with the Greediness heuristic but without the SVC (TLA minus SVC, TLA–SVC) and one with the SVC but without Greediness (TLA minus Greediness, TLA–Greed). ... Likewise the patterns, without their derivations, could be used as input to statistical/connectionist models which eschew traditional (generative) structure altogether and search...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "AN ISLAND PARSING INTERPRETER FOR THE FULL AUGMENTED TRANSITION NETWORK FORMALISM" potx
... and some suggestions for future directions for island parsing research. I INTRODUCTION A. Island Parsing In an ordinary ATN parser, the parsing of a sentence is performed unidirectionally ... solid inputs from the acoustic anal yser. The main problems with previous implementations of island parsing for ATNs have been with scope clauses and LIFTR and SENDR actions; essential...
Ngày tải lên: 09/03/2014, 01:20
Báo cáo khoa học: "Phrase-based Statistical Language Generation using Graphical Models and Active Learning" potx
... area inform inform inform inform inform inform inform inform t = 1 t = 2 t = 3 t = 4 t = 5 t = 6 t = 7 t = 8 Table 1: Example semantic stacks aligned with an utterance for the dialogue act inform(name(Charlie ... riverside inform(area) area inform(area) area inform that inform inform EMPTY serves inform(food) food inform French inform(food(French)) French inform(food) food inform(food) fo...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "TotalRecall: A Bilingual Concordance for Computer Assisted Translation and Language Learning" potx
... and the information on phrase and word level alignment. With that ad- ditional information, TotalRecall provides various functions, including 1. viewing of the full text of the source with a ... differ- ent themes recur within an article or a collection of articles. Concordances have been indispensable for lexicographers and increasingly considered useful for language instructor...
Ngày tải lên: 17/03/2014, 06:20
Tài liệu Báo cáo khoa học: "Online Large-Margin Training of Dependency Parsers" docx
... method for training dependency parsers. We use simple linear parsing models trained with margin-sensitive online training algorithms, achieving state-of-the-art performance with relatively modest training ... train models with k = 1, 2, 5, 10, 20 for the English data set. Even though the parsing algorithm is propor- tional to O(k log k), empirically, the training times s...
Ngày tải lên: 20/02/2014, 15:20
Báo cáo khoa học: "Simple semi-supervised training of part-of-speech taggers" pptx
... self -training. Finally, we list results for a technique called co-forests (Li and Zhou, 2007), which is a recent alternative to tri -training presented by the same authors, and for tri -training with ... 1993) with the standard split: Sect. 0–18 is used for training, Sect. 19–21 for development, and Sect. 22–24 for testing. Since we need to train our classifiers on material...
Ngày tải lên: 07/03/2014, 22:20