... 1998; Tufis and Barbu, 2002). However, All of these methods require a large-scale bilingual corpus for training. When the large-scale bilingual corpus is not available, some researchers use ... to improve word alignments for general words and the corpus in the specific domain for domain-specific words. In other words, we will adapt the word alignment information in the gener...
Ngày tải lên: 17/03/2014, 06:20
Báo cáo khoa học: "Leveraging Reusability: Cost-effective Lexical Acquisition for Large-scale Ontology Translation" potx
... Association for Computational Linguistics Leveraging Reusability: Cost-effective Lexical Acquisition for Large-scale Ontology Translation G. Craig Murray Bonnie J. Dorr Jimmy Lin Institute for ... of great value for access to segments of video, may be important for organizing other concepts and for browsing the hierarchy. These factors must be balanced in dev...
Ngày tải lên: 08/03/2014, 02:21
... lexicon (foreground lexicon) for IE applications by using both a small corpus and WordNet. 2 Developing IE Lexical Resources Lexical information in IE can be divided into three sources of information ... the entries and the ontology. Generic dictionaries can contribute in identifying entries for the FL, but generally do not provide useful information for the mapping with the...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "DATR AS A LEXICAL COMPONENT FOR PATR" pot
... of infor- marion in lexical entries. 2 REPRESENTATION OF LEXICAL INFORMATION The formalism of PATR offers two possible means of representing lexical informa- tion. First of all, the information ... be defined. For example, the lexical entry for frog would be FROG:<>==NOUN, where the noun-specific information is inherited from NOUN. This third approach forms the bas...
Ngày tải lên: 18/03/2014, 02:20
Tài liệu Báo cáo khoa học: "A Syntax-Driven Bracketing Model for Phrase-Based Translation" pptx
... then 8: Update bracketing instances for index j 9: end if 10: end if 11: end for 12: for each j ∈ c do 13: := ∪ {bracketing instances from j} 14: end for 15: Output: bracketing instances ... analy- sis show that the new model outperforms other previous methods and achieves a substantial improvement over the baseline which is not syntactically informed. 1 Introduction The phrase-ba...
Ngày tải lên: 20/02/2014, 07:20
Tài liệu Báo cáo khoa học: "Combination of Arabic Preprocessing Schemes for Statistical Machine Translation" ppt
... 2006. c 2006 Association for Computational Linguistics Combination of Arabic Preprocessing Schemes for Statistical Machine Translation Fatiha Sadat Institute for Information Technology National ... morphologi- cal preprocessing for SMT: deeper morph analysis helps for small data sets, but the effect is dimin- ished with more data. One interesting observation is that for our best...
Ngày tải lên: 20/02/2014, 11:21
Tài liệu Báo cáo khoa học: "TOWARDS A DICTIONARY SUPPORT ENVIRONMENT FOR REAL TIME PARSING" potx
... according to the Merriam-Webster codes for subject matter (see Walker & Amsler (1983) for a suggested use for these). The large amount of semi-formalised information concerning the interpretation ... fields except those providing cross-reference and usage information for complete homographs. Figure 2 illustrates a simple lexical entry before and after the application of...
Ngày tải lên: 22/02/2014, 09:20
Báo cáo khoa học: Wheat germ cell-free platform for eukaryotic protein production potx
... cell-free platform is used to screen constructs for the expression of sol- uble protein, to produce [ 15 N]-labeled protein for NMR screening for suitability as a struc- tural candidate, and for the ... most effective routes for improve- ment involve a combination of bioinformatics and small scale screening. Bioinformatics relies on prior information and mathematical models for co...
Ngày tải lên: 07/03/2014, 12:20
Báo cáo khoa học: "Discriminative Feature-Tied Mixture Modeling for Statistical Machine Translation" pdf
... Association for Computational Linguistics:shortpapers, pages 424–428, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics Discriminative Feature-Tied Mixture Modeling for ... feature types, word alignments, or domains, for various ap- plications. The proposed approach improves the translation performance significantly on a large-scale Arabic-to-Engli...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "On-line Language Model Biasing for Statistical Machine Translation" docx
... Discussion and Future Work Existing methods for target LM biasing for SMT rely on information retrieval to select a comparable subset from the training corpus. A foreground LM estimated from this subset ... improves SMT perfor- mance, none of the techniques has thus far been shown to be feasible for on-line sys- tems. In this paper, we develop a novel mea- sure of cross-lingual similari...
Ngày tải lên: 17/03/2014, 00:20