Báo cáo khoa học: "Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency" ppt

Báo cáo khoa học: "Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency" ppt

Báo cáo khoa học: "Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency" ppt

... Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency Mathias Creutz Neural Networks Research Centre, Helsinki University of Technology P.O.Box 9800, ... the quality of the segmentation we compute the expectation of the proportion of correct mappings from morphs to morpheme labels, E{p(morpheme | morph) }: 1 N N  i=1 p i (mo...

Ngày tải lên: 31/03/2014, 03:20

8 215 0
Tài liệu Báo cáo khoa học: "Unsupervised Segmentation of Chinese Text by Use of Branching Entropy" pdf

Tài liệu Báo cáo khoa học: "Unsupervised Segmentation of Chinese Text by Use of Branching Entropy" pdf

... Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 428–435, Sydney, July 2006. c 2006 ... Computational Linguistics 428 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 1 2 3 4 5 6 7 8 entropy offset 429 430 431 432 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.55 0.6 0.65 0.7 0.75

Ngày tải lên: 20/02/2014, 12:20

8 395 0
Báo cáo khoa học: "Text Segmentation by Language Using Minimum Description Length" ppt

Báo cáo khoa học: "Text Segmentation by Language Using Minimum Description Length" ppt

... he used a gold standard of multilingual texts annotated by borders and languages. This segmentation approach is similar to that of word segmentation for non- segmented texts, and he tested it ... problem of detecting words and phrases in languages other than the prin- cipal language of a given text. They used statisti- cal language modeling and heuristics to detect for-...

Ngày tải lên: 07/03/2014, 18:20

10 290 0
Báo cáo khoa học: "A Multi-Neuro Tagger Using Variable Lengths of Contexts" pdf

Báo cáo khoa học: "A Multi-Neuro Tagger Using Variable Lengths of Contexts" pdf

... A Multi-Neuro Tagger Using Variable Lengths of Contexts Qing Ma and Hitoshi Isahara Communications Research Laboratory Ministry of Posts and Telecommunications 588-2, Iwaoka, ... variable lengths of contexts and weighted inputs (with information gains) for part of speech tagging. Computer experiments show that it has a correct rate of over 94% for tag- ging ambiguou...

Ngày tải lên: 23/03/2014, 19:20

5 208 0
Báo cáo khoa học: G protein-coupled receptor-induced Akt activity in cellular proliferation and apoptosis pptx

Báo cáo khoa học: G protein-coupled receptor-induced Akt activity in cellular proliferation and apoptosis pptx

... progression, migration and sur- vival [1–4]. The Akt subfamily of protein kinases con- sists of three isoforms – Akt1, Akt2 and Akt3 (also termed PKBa, PKBb and PKBc) – which are the products of distinct ... apoptosis and survival. Thus, the over- expression of Akt subtypes has been measured in a number of cancer types, and dominant-negative forms of Akt can trigger apop...

Ngày tải lên: 16/03/2014, 05:20

12 392 0
Báo cáo khoa học: "Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining" ppt

Báo cáo khoa học: "Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining" ppt

... s i .nch is the number of children of a subtree. q i .w and q i .t are the word and its POS tag in the queue. dist(s 0 ,s 1 ) is the linear distance between the head -words of s 0 and s 1 . subtrees ... output). Ω is determined by the set of dependency labels r ∈ R and one of three transition types: • Shift: remove the head of Q (w j ) and place it on the top of S as...

Ngày tải lên: 16/03/2014, 19:20

9 319 0
Tài liệu Báo cáo khoa học: "Unsupervised Discourse Segmentation of Documents with Inherently Parallel Structure" pdf

Tài liệu Báo cáo khoa học: "Unsupervised Discourse Segmentation of Documents with Inherently Parallel Structure" pdf

... Jeong and Ivan Titov Saarland University Saarbr ¨ ucken, Germany {m.jeong|titov}@mmci.uni-saarland.de Abstract Documents often have inherently parallel structure: they may consist of a text and commentaries, ... Segmentation of these par- allel parts into coherent fragments and discovery of hidden relations between them would facilitate the development of better user interface...

Ngày tải lên: 20/02/2014, 04:20

5 376 0
Báo cáo khoa học: "Unsupervised Learning of Field Segmentation Models for Information Extraction" pot

Báo cáo khoa học: "Unsupervised Learning of Field Segmentation Models for Information Extraction" pot

... range of linguistic phe- nomena in text, including morphology, parts -of- speech (POS), named entity mentions, and even topic changes in discourse. An HMM consists of a set of states S, a set of ... title, and date. Classified advertise- ments, such as the one in Figure 1(b), also exhibit field structure, if less rigidly: an ad consists of de- scriptions of attributes of an i...

Ngày tải lên: 23/03/2014, 19:20

8 343 0
Báo cáo khoa học: "Unsupervised Lexicon-Based Resolution of Unknown Words for Full Morphological Analysis" doc

Báo cáo khoa học: "Unsupervised Lexicon-Based Resolution of Unknown Words for Full Morphological Analysis" doc

... segmentation of each word and its character classification is observed, and the POS tagging is ambiguous. The segmentation (of all words in a given sentence) and the POS tagging (of the known words) ... instances, against gold-standard morphological analysis. We also exploit the morphological patterns characteris- tic of semitic morphology, but extend the guessing of morph...

Ngày tải lên: 31/03/2014, 00:20

9 273 0
Tài liệu Báo cáo khoa học: "Unsupervised Search for The Optimal Segmentation for Statistical Machine Translation" doc

Tài liệu Báo cáo khoa học: "Unsupervised Search for The Optimal Segmentation for Statistical Machine Translation" doc

... unaddressed problem of unsupervised determination of the optimal morphological segmentation for statistical machine translation (SMT) and propose a segmentation metric that takes into account both sides of the ... linguistic resources in initializing the segmentation with the output of a morphological analyzer and disam- biguator. Talbot and Osborne (2006) tackle a spe- ci...

Ngày tải lên: 20/02/2014, 04:20

6 446 0
w