... was set to. maxlinks Used in Competitive Linking algo- rithm: Maximum number of words any word can be aligned with. Set to: 1, 2, 3. minscore Used in Competitive Linking algo- rithm: Minimum score ... is 2, then a word can align to 0, 1, or 2 words in the parallel sentence. In other settings, we en- forced a minimum score in the bilingual dictionary for a link to be accepted,...
Ngày tải lên: 08/03/2014, 07:20
... effective, while being simpler to instrument: since we use information from the search engine only during training, we can train a stand-alone POS tagger that can be run without access to additional resources. ... query training set. The simplest way to match query tokens to snip- pet tokens is to allow a query token to match any snippet token. This can be problematic when we have...
Ngày tải lên: 16/03/2014, 20:20
Tài liệu Báo cáo khoa học: "Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classification" doc
... as domain independent pivots and enable us to transfer the information regarding sentiment from one domain to another. Using the extended vectors d to represent re- views, we train a binary ... domain features are related to which target domain features. Second, it requires a learn- ing framework to incorporate the information re- 132 garding the relatedness of source and targe...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Using adaptor grammars to identify synergies in the unsupervised acquisition of linguistic structure" docx
... grammar, in which the subtrees are specified in advance, in an adaptor grammar the subtrees, as well as their probabilities, are learnt from the train- ing data. In order to make parsing and inference tractable ... properties of the adaptor grammar inference procedure is that it gives us a way of learning these interacting linguistic structures simultaneously. Adaptor grammars are al...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Using linguistic principles to recover empty categories" ppt
... try to insert *U* in X 7 try to insert a VP ellipsis site in X 8 try to insert S*T* or SBAR in X 9 try to insert trace of topicalized XP in X 10 try to insert trace of extraposition in X ... applying a different set of rules. 1 for each tree, iterate over nodes from top down 2 for each node X 3 try to insert NP* in X 4 try to insert 0 in X 5 try to ins...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "USING BRACKETED PARSES TO EVALUATE A GRAMMAR CHECKING APPLICATION" ppt
... selecting fronted parse trees some- times leads to false error critiques, it works well for most cases in our domain. BRACKETED INPUT STRINGS In order to coerce our system into accepting only ... we intend to change our current system to improve deficiencies and lack of cover- age revealed by this exercise. In effect, we plan to use the current test corpus as a traini...
Ngày tải lên: 20/02/2014, 21:20
Báo cáo khoa học: "Using Machine-Learning to Assign Function Labels to Parser Output for Spanish" ppt
... signif- icant 3.4% improvement over the baseline. 1 Introduction The research presented in this paper forms part of an ongoing effort to develop methods to induce wide-coverage multilingual Lexical- Functional ... grup.nom.ms (masculine singular), grup.nom.fs (fem- inine singular), grup.nom.mp (masculine plural) etc. This number and gender information is already encoded in the POS tag...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Using Grammatical Relations to Compare Parsers" pptx
... parsers. In addition, we perform an experiment using the extracted GRs as input to the Lappin and Leass (1994) anaphora resolution algorithm. This produces a second ranking of the parsers, and we investigate ... initial evaluation is provided by compar- ing the extracted GRs to a gold standard GR annotation of 500 Susanne sentences due to Carroll et al. To gain insight into the...
Ngày tải lên: 08/03/2014, 21:20
Tài liệu Báo cáo khoa học: "Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages" docx
... Models for Machine Translation of Highly In ected Languages Mei Yang Department of Electrical Engineering University of Washin g ton Seattle, WA, USA yangmei@ee.washington.edu Katrin Kirchhoff Department ... Engineering University of Washin g ton Seattle, WA, USA katrin@ee.washington.edu Abstract We propose a backoff model for phrase- based machine translation that translates unseen word for...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: Novel repressor of the human FMR1 gene ) identification ¨ of p56 human (GCC)n-binding protein as a Kruppel-like transcription factor ZF5 ppt
... (see above). Together, these findings indicate that p56 and p68 are DNA-binding proteins that interact specifically with composite cis-acting element of rpL32. Identification of rpL32-binding protein p56 ... substrate to remove contaminant nonspecific DNA-binding proteins. The unbound proteins were considered to be the final fraction enriched with the rpL32 fragment ()24 .+11) binding protei...
Ngày tải lên: 07/03/2014, 05:20