Báo cáo khoa học: "An Online System for Corpus Management and Analysis in Support of Computing in the Humanities" pot
... aspect of pure resource management, processing and analysis of docu- ments have traditionally been the domain of desk- top applications. Sometimes even to the point of command line tools. Therefore ... Frankfurt am Main, 2 Universit ¨ at Bielefeld Abstract This paper introduces eHumanities Desk- top- an online system for corpus manage- ment and analysis in supp...
Ngày tải lên: 31/03/2014, 20:20
... the classifi- cation results of our system and compare them to the performance of a trained ma- chine learner in a series of in- and cross- domain experiments. 1 Introduction The recognition of ... used and the architecture of our system. In Sec- tion 5, we provide an evaluation of the system out- put and compare the results with those of a series o...
Ngày tải lên: 23/03/2014, 19:20
... consists of all of the entries from the SCL and the SEL, as well as all of the features for each entry. At this point an initial "window" on the training set is chosen. Since the inference ... t~ next word in the input. ~. Traininm Mode When UTTER is operating in training mode, the system allows the user to correct errors in transcription i...
Ngày tải lên: 24/03/2014, 05:21
Tài liệu Báo cáo khoa học: "An Unsupervised Model for Joint Phrase Alignment and Extraction" ppt
... Computational Linguistics, pages 25–28. John DeNero and Dan Klein. 2010. Discriminative mod- eling of extraction setsfor machine translation. In Pro- ceedings of the 48th AnnualMeeting of the Association for ... Ph.D. thesis, Massachusetts Institute of Tech- nology. John DeNero and Dan Klein. 2008. The complexity of phrase alignment problems. In Proceedings of the...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "An Ensemble Method for Selection of High Quality Parses" pdf
... both the generative parsing model number 2 of Collins (1999) and the reranking parser of Charniak and Johnson (2005), both when the training and test data belong to the same domain (the in- domain ... the way f-score is ordinarily calculated, by computing the labeled precision and recall of the constituents in the whole set and using these as the argum...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "An Approximate Approach for Training Polynomial Kernel SVMs in Linear Time" doc
... vector in D-dimension space of the i-th example, and y i is the label of xi either positive or negative. The training of SVMs involves in minimize the following object (primal form, soft-margin) ... trade off training error and mar- gin. A small value for C will increase the number of training errors. To determine the class (+1 or -1) of an example x can...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "An Improved Parser for Data-Oriented Lexical-Functional Analysis" doc
... respect to the corpus& quot;, thus increasing the robustness of the model. 2.3 The composition operation In LFG-DOP the operation for combining fragments is carried out in two steps. First the c- structures ... using initially the indexed subtrees only. Thus only the Category-matching condition is enforced during the chart-parsing process. The Uniqueness and Coh...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "An Integrated Environment for Computational Linguistics Experimentation" pot
... for theoretical models, and the accuracy of these models can be evalu- ated either with regard to their ability to account for the reality of a given corpus (pursuing descrip- tive aims), either ... models and methodological concerns. Finally, when other platforms usually enforce the use of a dedicated document format, LinguaStream is able to process any XML document. On...
Ngày tải lên: 22/02/2014, 02:20
Tài liệu Báo cáo khoa học: "An expressive formalism for describing tree-based grammars" docx
... con- sists of tree description and/ or of semantic formu- las. The XMG formalism furthermore supports the sharing of identifiers across dimension hence al- lowing for a straightforward encoding of the ... semantics The XMG formalism further supports the integration in the grammar of semantic information. More generally, the lan- guage manages dimensions of descri...
Ngày tải lên: 22/02/2014, 02:20
Tài liệu Báo cáo khoa học: "An annotation scheme for discourse-level argumentation in research articles" doc
... amount of train- ing, including the reading of coding instructions for the two versions of the scheme (6 pages for the basic scheme and 17 pages for the full scheme), four training papers and ... determining BASIS and CONTRAST. This might have to do with the loca- tion of those types of sentences in the paper: AIM and TEXTUAL are usually found at...
Ngày tải lên: 22/02/2014, 03:20