... rules defined for Greek, allowing for the complete parse of about 50% of the sentences in a corpus like Europarl (Koehn, 2005), which contains proceedings of the European Parliament. For the remaining ... Extraction Tool MWEs constitute a high proportion of the lexicon of a language, and are crucial for many NLP tasks (Sag et al., 2002). This section introduces the tool w...
Ngày tải lên: 22/02/2014, 02:20
... search for in the error typology, and then search backwards or forwards for error annota- tions of that type. It is possible both to search for specific errors deep in the typology, and to search for ... easy integration of new modules for preprocessing. BLAST has three working modes for handling error annotations: for adding new annotations, for editing existing annota-...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx
... us- ing two sources of information: (1) the Inter- net for gathering statistics for word-word co- occurrences and (2)WordNet for measuring the semantic density for a pair of words. We report ... (W~ NEAR W~(k'))) for all 1 < i < m. Using one of these queries, we get the number of hits for each sense i of W2 and this provides a ranking of the m senses...
Ngày tải lên: 08/03/2014, 06:20
Tài liệu Báo cáo khoa học: "Lexicographic Semirings for Exact Automata Encoding of Sequence Models" pdf
... transition, which re- flects the semantics of the “otherwise” formulation of smoothing (Allauzen et al., 2003). For example, the typical backoff formulation of the probability of a word w given a history ... its use for speech and language processing tasks. We prove that the semiring allows for exact en- coding of backoff models with epsilon tran- sitions. This allows for off...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: A strategy for discovery of cancer glyco-biomarkers in serum using newly developed technologies for glycoproteomics ppt
... microarray system is the best tool for a ‘cell profiler’, and it is expected to be applicable for selection of cancer-specific lectins and for quality control of stem cells before transplantation [12–15]. Recently, ... previously demonstrated the application of this method to the determination of the glycan structure of a form of AFP [10]. However, identification of the det...
Ngày tải lên: 16/02/2014, 08:20
Tài liệu Báo cáo khoa học: "A Framework for Syntactic Translation" docx
... postulated an encoding of the informa- tion in the form of what we called a specifier. The specifier of a sentence represents that sentence as a series of choices within the lim- ited range of choices ... framework of one language. In con- sidering the translation of a certain German verb form into English, it is necessary to un- derstand the German verb form as part o...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "A Method for Measuring Machine Translation Confidence" docx
... follow: P = the number of correctly tagged labels the number of tagged labels R = the number of correctly tagged labels the number of reference labels F = 2*P*R P+R (8) 4.2 Contribution of feature sets We ... showed that most of the participants in the developing countries are ready to introduce qualitative changes in the pattern of their lives for the sake of reducing the...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: A role for the intersubunit disulfides of seminal RNase in the mechanism of its antitumor action docx
... isoforms, isoenzymes, monomeric forms; assay for selective cytotoxicity of the enzyme. Methods Enzymol. 341, 248–263. 13. Kunitz, M. (1946) A spectrophotometric method for the meas- urement of ... antitumor action of BS- RNase, it has been recognized that the dimeric structure of the enzyme is essential for its display of cytotoxic activity [3]. This conclusion was based on th...
Ngày tải lên: 20/02/2014, 11:20
Tài liệu Báo cáo khoa học: "A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence" pptx
... Preparation of Error-Patterns: As the threshold value for the frequency of the occurrence, we employed a value of not less than 2, therefore we obtained 629 Error-Pattems using the 4321 results of ... value for the frequency of the occurrence, and 10 as the length of a string, therefore obtaining 16655 strings. 3.2 Two Factors for Evaluation We evaluated the following...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "A SYSTEM FOR TRANSLATING ENGLISH LOCATIVE INTO PREPOSITIONS FROM FRENCH*" pdf
... of an object is composed of a conditional part and a descrip- tive part. The conditional part is a list of properties of the object and of its situation in the sentence. The former kind of ... Herskovits' idea of the ideal meaning of a preposition (Herskovits 1986) and Lakoff's idea of Idealized Cognitive Models (ICM's) (Lakoff 1987). A central part o...
Ngày tải lên: 20/02/2014, 21:20