Báo cáo khoa học: "Four Techniques for Online Handling of Out-of-Vocabulary Words in Arabic-English Statistical Machine Translation" docx

Báo cáo khoa học: "Four Techniques for Online Handling of Out-of-Vocabulary Words in Arabic-English Statistical Machine Translation" docx

Báo cáo khoa học: "Four Techniques for Online Handling of Out-of-Vocabulary Words in Arabic-English Statistical Machine Translation" docx

... University habash@ccls.columbia.edu Abstract We present four techniques for online han- dling of Out -of- Vocabulary words in Phrase- based Statistical Machine Translation. The techniques use spelling expansion, morpho- logical ... four techniques for online handling of Out -of- Vocabulary (OOV) words in phrase-based Statistical Machine Translation (SMT). 1...

Ngày tải lên: 31/03/2014, 00:20

4 504 0
Báo cáo khoa học: "Offline Strategies for Online Question Answering: Answering Questions Before They Are Asked" ppt

Báo cáo khoa học: "Offline Strategies for Online Question Answering: Answering Questions Before They Are Asked" ppt

... Integer Frequency of concept head in CN/PN Integer Frequency of concept head in APOS Instance Features Integer Number of lexical items in instance Binary Instance contains honorific Binary ... Binary Instance contains common name Binary Instance ends in honorific Binary Instance ends in common name Binary Instance ends in determiner Case Features Integer Instan...

Ngày tải lên: 08/03/2014, 04:22

7 431 0
Báo cáo khoa học: "Dialect Classification for online podcasts fusing Acoustic and Language based Structural and Semantic Information" pot

Báo cáo khoa học: "Dialect Classification for online podcasts fusing Acoustic and Language based Structural and Semantic Information" pot

... text of 300 words. Table 1 summarizes the text material for three family-tree branches of Eng- lish, containing 474k words and 1325 documents. No. of Documents Dialect No .of words Train ... Sec 4 explains the baseline acoustic classifier. Language classifiers are described in Sec 5 and the results which are presented in Sec 6 affirm that combining various sources...

Ngày tải lên: 23/03/2014, 17:20

4 344 0
Báo cáo khoa học: "Annealing Techniques for Unsupervised Statistical Language Learning" ppt

Báo cáo khoa học: "Annealing Techniques for Unsupervised Statistical Language Learning" ppt

... 2,000 sentences (48,526 words) for testing. The remain- ing 47,208 sentences (1,125,240 words) were used in training, without any tags. The tagging dictionary was constructed using the entire corpus ... analogies to statistical physics (including phase transitions and the role of β as the inverse of temperature in free-energy minimization) are referred to Rose (1998) for a tho...

Ngày tải lên: 23/03/2014, 19:20

8 242 0
Tài liệu Báo cáo khoa học: Complex transcriptional and translational regulation of iPLA2c resulting in multiple gene products containing dual competing sites for mitochondrial or peroxisomal localization docx

Tài liệu Báo cáo khoa học: Complex transcriptional and translational regulation of iPLA2c resulting in multiple gene products containing dual competing sites for mitochondrial or peroxisomal localization docx

... resulting in loss of the AUG initiating translation of the 88 kDa isoform. We specifically point out that differential utilization of exon 1 vs. exon 2 in splice variants introduces differing upstream ORFs ... profiles and protein chemical techniques including radiolabeling with [ 3 H]BEL [7–12]. Study of human heart PLA 2 underscored the complexity of multiple distinct isoforms...

Ngày tải lên: 19/02/2014, 16:20

16 438 0
Tài liệu Báo cáo khoa học: Different mechanisms for cellular internalization of the HIV-1 Tat-derived cell penetrating peptide and recombinant proteins fused to Tat docx

Tài liệu Báo cáo khoa học: Different mechanisms for cellular internalization of the HIV-1 Tat-derived cell penetrating peptide and recombinant proteins fused to Tat docx

... appears unlikely. The number of arginine residues within the Tat peptide appeared to be the main determinant for main- taining a high translocating activity as pre viously shown by alanine-arginine substitution ... the N-terminal domain of these proteins [6]. Cellular internal- ization of this peptide fused to b-galactosidase was even observed in vivo in various tissues including...

Ngày tải lên: 21/02/2014, 03:20

8 485 0
Báo cáo khoa học: Structural evidence for a constant c11 ring stoichiometry in the sodium F-ATP synthase doc

Báo cáo khoa học: Structural evidence for a constant c11 ring stoichiometry in the sodium F-ATP synthase doc

... 2 lg of pure c 11 with 2 lgofc 1 purified in detergent, the slower migrating band reappeared (lanes 4 and 8). Upon incubation of 2 lg of pure c 11 with 2 and 10 lgofc 1 purified in chloroform ⁄ ... accumulation of the incomplete c 10 complex in the recombinant c ring preparations suggests that the insertion of the last c subunit forms the limiting step in the assembly process...

Ngày tải lên: 07/03/2014, 21:20

10 477 0
Báo cáo khoa học: " A Tool for Error Analysis of Machine Translation Output" doc

Báo cáo khoa học: " A Tool for Error Analysis of Machine Translation Output" doc

... with main informa- tion, and then an item for each menu containing: • The name of the menu • A list of menu items, containing: – Display name – Internal name (used in annotation file, and internally ... of new modules for preprocessing. BLAST has three working modes for handling error annotations: for adding new annotations, for editing existing annota- tions, and for sea...

Ngày tải lên: 07/03/2014, 22:20

6 479 0
Báo cáo khoa học: "A System for Semantic Analysis of Chemical Compound Names" pdf

Báo cáo khoa học: "A System for Semantic Analysis of Chemical Compound Names" pdf

... of BioNLP is to automatically support humans by means of research in the area of infor- mation retrieval, data mining and information ex- traction. Term identification is of great importance in ... task into the subtasks of term recognition (marking the interesting words in a text), term classification (classifying them ac- cording to a taxonomy or an ontology) and term mappin...

Ngày tải lên: 08/03/2014, 01:20

9 479 0
Báo cáo khoa học: "Hybrid Methods for POS Guessing of Chinese Unknown Words" pot

Báo cáo khoa học: "Hybrid Methods for POS Guessing of Chinese Unknown Words" pot

... contextual information and the like- lihood for a character to appear in a par- ticular position of words of a particular length and POS category. By combining models that use different sources of infor- mation, ... POS information about the component words or morphemes of many unknown words is available in the training lexicon. Second, Wu and Jiang (2000) argued that assi...

Ngày tải lên: 08/03/2014, 04:22

6 349 0
w