Báo cáo khoa học: "VARIOUS REPRESENTATIONS OF TEXT PROPOSED FOR EUROTRA" docx

Báo cáo khoa học: "VARIOUS REPRESENTATIONS OF TEXT PROPOSED FOR EUROTRA" docx

Báo cáo khoa học: "VARIOUS REPRESENTATIONS OF TEXT PROPOSED FOR EUROTRA" docx

... next two are of a linguistic nature), each of which is the object of different information data, stored in the table of formats : - recognition of the format : features of formats must be ... using various po~E forms. In the context of M(A)T, th~ advantages of taking into account the structure of the text are twofold : - the text can be decomposed if only part o...

Ngày tải lên: 01/04/2014, 00:20

6 181 0
Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

... LM learns the probabilities of word sequences from text corpora available for training. The perfor- mance of the model depends on the amount and style of the text. The more text there is, the better the ... in-domain text and the the filtered web text. If the amount of web text is very large, only a subset is used, which consists of the parts of the web data that are th...

Ngày tải lên: 22/02/2014, 02:20

9 301 0
Báo cáo khoa học: " The Development of Lexical Resources for Information Extraction from Text Combining Word Net and Dewey Decimal Classification" potx

Báo cáo khoa học: " The Development of Lexical Resources for Information Extraction from Text Combining Word Net and Dewey Decimal Classification" potx

... is one of the main bot- tlenecks in the development of new ap- plications in the field of Information Ex- traction from text. Generic resources (e.g., lexical databases) are promising for reducing ... Proceedings of EACL '99 The Development of Lexical Resources for Information Extraction from Text Combining WordNet and Dewey Decimal Classification* ... of the s...

Ngày tải lên: 08/03/2014, 21:20

4 436 0
Báo cáo khoa học: "BULK PROCESSING OF TEXT ON A MASSIVELY PARALLEL COMPUTER" docx

Báo cáo khoa học: "BULK PROCESSING OF TEXT ON A MASSIVELY PARALLEL COMPUTER" docx

... size of the text approaches many tens of thousands of words, the number of unique words in- creased into the thousands. Therefore, it can be con- cluded that the second implementation of the ... amount of useful unformatted text in computer readable form. Parallel computers and algorithms provide one way of dealing with this explosion. 2 The CM: Machine Description Th...

Ngày tải lên: 24/03/2014, 02:20

8 306 0
Báo cáo khoa học: "Automatic Detection of Text Genre" doc

Báo cáo khoa học: "Automatic Detection of Text Genre" doc

... the performance is for each of these binary machines, and for the sake of comparison this information is also given for some of the neural net models. Ta- ble 2 shows how often each of the ... letters to the editor will be of roughly equal in- terest. For other purposes we will want to stress narrativity, for example in looking for accounts of the storming of...

Ngày tải lên: 31/03/2014, 21:20

7 277 0
Tài liệu Báo cáo khoa học: "The Nature of Affixing in Written English" docx

Tài liệu Báo cáo khoa học: "The Nature of Affixing in Written English" docx

... obvious algorithms needed for most of the words and put this together with a list of irregular forms for a working procedure, except for the presence of a number of verbs where it is necessary ... and final consonant strings of the CVCVC forms turn out to be similar to sets found for the CVC forms. How- ever, the internal consonant strings of the cvcvc forms includ...

Ngày tải lên: 19/02/2014, 19:20

6 602 0
Tài liệu Báo cáo khoa học: "Structural Definition of Affixes from Multisyllable Words" docx

Tài liệu Báo cáo khoa học: "Structural Definition of Affixes from Multisyllable Words" docx

... English," an algorithm for the structural definition of affixes was developed and applied to data consisting of all the words of the form CVCVC in the Shorter Oxford Dictionary. Fourteen ... suffixes, in order to increase the reliability of the definition by reducing the probability of postulating a break before (for prefixes) or after (for suffixes) C 2 or C 3 where...

Ngày tải lên: 19/02/2014, 19:20

4 508 0
Báo cáo khóa học: Collectins Players of the innate immune system docx

Báo cáo khóa học: Collectins Players of the innate immune system docx

... glucuronoxylomannan were identified as ligands for SP-D [190]. Binding of SP-D to C. neoformans leads to a massive aggregation of acapsular but not of encapsulated C. neoformans. Moreover, secreted glucoron- oxylomannan ... K.B.M. (1990) Binding of the pentamer/hexamer forms of mannan- binding protein to zymosan activates the proenzyme C1r2C1s2 complex, of the classical pathway...

Ngày tải lên: 07/03/2014, 15:20

21 353 0
Báo cáo khoa học: "The Use of Statistics in Language Research" docx

Báo cáo khoa học: "The Use of Statistics in Language Research" docx

... indica- tors of the validity of a proposed solution. In my view there is no single solution of a for- eign text. Some 15 years experience as a trans- lation editor, translator (both of scientific ... "Proposals for the mechanical resolution of German syntax patterns," Modern Language Forum, vol. 36, no. 3-4. of the subject-literature." This major effort...

Ngày tải lên: 07/03/2014, 18:20

7 462 0
Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx

Báo cáo khoa học: "Discriminative Modeling of Extraction Sets for Machine Translation" pptx

... Conference of the Associa- tion for Computational Linguistics. John DeNero and Dan Klein. 2008. The complexity of phrase alignment problems. In Proceedings of the Annual Conference of the Association for ... Cherry and Dekang Lin. 2006. Soft syntactic constraints for word alignment through discrimina- tive training. In Proceedings of the Annual Confer- ence of the Associatio...

Ngày tải lên: 07/03/2014, 22:20

11 420 0
Từ khóa:
w