Báo cáo khoa học: "Modeling Filled Pauses in Medical Dictations" docx
... Modeling Filled Pauses in Medical Dictations Serge)' V Pakhomov University of Minnesota 190 Klaeber Court 320-16 th Ave. S.E Minneapolis, MN 55455 pakh0002@tc.umn.edu Abstract Filled pauses ... in the CONTROLLED-FP-CORPUS. 620 4. Models The language modeling process in this study was conducted in two stages. First, a bigram model containing bigram probabilit...
Ngày tải lên: 31/03/2014, 04:20
... tion we are more interested in cross-linguistic structures, similar to the case of using interlin- gua to represent cross-linguistic information in knowledge-based MT. To obtain structures that ... changing the threshold, we obtain a different number of phrases. The two operators are iteratively applied to the training corpus in alternative steps. This results in hierarchica...
Ngày tải lên: 08/03/2014, 06:20
... consti- tutes our MCC baseline. Regarding the reranker, we divided the training set in two chunks of data: Train1 and Train2. The binary classifiers are trained on Train1 and tested on Train2 (and vice versa) ... (Child-free) Figure 6: Learning curves of the reranking models using STK in terms of MicroAverage-F1, according to increas- ing training set (child-free setting). 0.365 0.375 0.38...
Ngày tải lên: 23/03/2014, 14:20
Báo cáo khoa học: The hepoxilin connection in the epidermis docx
... ich- thyosiform erythroderma (NCIE, in layman’s terms translating as a nonblistering, inherited, scaly red skin). An independent study later extended these find- ings following the identification of 17 ... isomerase (hepoxilin synthase) The genetic findings concerning 12R-LOX and eLOX3 mutations in ichthyosis are intriguing from the bio- chemical point of view, partly because the eLOX3 pro-...
Ngày tải lên: 07/03/2014, 09:20
Báo cáo khoa học: "GENERATING PRECONDITION EXPRESSIONS IN INSTRUCTIONAL TEXT" docx
... proven useful in analyzing various kinds of conditions and circumstances that fre- quently arise in instructions. The analysis involves addressing two related issues: 1. Determining the range ... here the ter- minating condition). Finally, they may or may not be combined into a single sentence with the ex- pression of their related action (the issue of clause combining). Tex...
Ngày tải lên: 08/03/2014, 07:20
Báo cáo khoa học: "HANDLING SYNTACTICAL AMBIGUITY IN MACHINE TRANSLATION" docx
... illustration, here we confine to prob- lems to be met with (i), and, more concretely, to such English strings containing Vin f. These strings are mapped onto Bulgarian strings containing da-construction ... convenient to distinguish two cases: Case A, in which to each syntactically ambiguous string in En- glish corresponds a syntactically ambiguous string in Bulgarlan, and Case B,...
Ngày tải lên: 17/03/2014, 19:21
Báo cáo khoa học: "Measuring Syntactic Difference in British English" docx
... inserted in a path containing a leaf that is a leftmost sibling and a right bracket is inserted in a path containing a leaf that is a rightmost sibling. The bracket is inserted at the highest ... linguistic knowl- edge of the area being surveyed. These features, while probably lacking in completeness of coverage, certainly allowed a rough comparison of distance in all linguistic domai...
Ngày tải lên: 31/03/2014, 01:20
Báo cáo khoa học: "Detecting Verbal Participation in Diathesis Alternations" docx
... The minimum description length principle is then used to produce a model and cost for storing the head noun instances from a training corpus at the relevant argument slots. Alternating sub- ... FOOD hyponym of OBJECT. Finding the best set of classes is key to ob- taining a good preference model. Abe and Li use MDL to do this. MDL is a principle from in- formation theory (Rissane...
Ngày tải lên: 31/03/2014, 04:20
Tài liệu Báo cáo khoa học: "Modeling Latent Biographic Attributes in Conversational Genres" pptx
... speakers used for training and 100 speakers used for testing, re- sulting in a total of 4062 conversation sides for training and 808 conversation sides for testing. 4 Modeling Gender via Ngram ... training data and the ngram-based model was retrained on the remaining subset. Figure 2: Empirical differences in sociolinguistic features for Gender on the Switchboard corpus 6 Incorporating .....
Ngày tải lên: 20/02/2014, 07:20
Báo cáo khoa học: "Modeling Commonality among Related Classes in Relation Extraction" doc
... using the same feature set. 1 Introduction With the dramatic increase in the amount of tex- tual information available in digital archives and the WWW, there has been growing interest in ... of training examples for each class, since, in this case, a classifier learning approach can always learn a nearly optimal discriminative function for each class against the remaining clas...
Ngày tải lên: 08/03/2014, 02:21