a syllable based word recognition model

Báo cáo khoa học: "A Syllable Based Word Recognition Model for Korean Noun Extraction" potx

Báo cáo khoa học: "A Syllable Based Word Recognition Model for Korean Noun Extraction" potx

... is larger than that of an alphabet in English. In addition, there are particular characteristics in Korean syllables. The fact that words do not start with certain syllables is one of such examples. ... it requires a training data. Because the existing Korean POS tagged corpora are annotated by a morpheme level, we cannot use them as a training data without converting the data suitable for the word recognition model. ... such as “ (nim)”, “ (deul)”, or “ (jeog)” is also regarded as a word because it is an uninflected morpheme. 3 Syllable based word recognition model A Korean syllable consists of an obligatory...

Ngày tải lên: 17/03/2014, 06:20

8 368 0
Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

... translation, we develop a discriminative or- der model. An advantage of such a model is that we can easily combine different kinds of features (such as syntax -based and surface -based) , and that ... Distortion models for statistical machine translation. In ACL. D. Chiang. 2005. A hierarchical phrase -based model for statis- tical machine translation. In ACL. M. Collins. 2000. Discriminative reranking ... inference and train- ing of context-rich syntactic translation models. In ACL. P. Koehn. 2004. Pharaoh: A beam search decoder for phrase- based statistical machine translation models. In AMTA. R....

Ngày tải lên: 20/02/2014, 12:20

8 404 0
Báo cáo khoa học: "A TAG-based noisy channel model of speech repairs" pdf

Báo cáo khoa học: "A TAG-based noisy channel model of speech repairs" pdf

... together with a bigram language model. Then each of these analysis is rescored using the TAG chan- nel model and a syntactic parser based language model. The TAG channel model s analysis do not ... reparandum and interregnum words than the classifier proposed in Charniak and Johnson (2001). Replacing the bigram language model with a trigram model helps slightly, and parser- based language model ... sentence, and we do this by reranking the initial analysis, replacing the bi- gram language model with a syntactic parser based model. We do not need to intersect this parser based language model...

Ngày tải lên: 17/03/2014, 06:20

8 444 0
Báo cáo khoa học: "Constituent-Based Morphological Parsing: A New Approach to the Problem of Word-Recognition" pdf

Báo cáo khoa học: "Constituent-Based Morphological Parsing: A New Approach to the Problem of Word-Recognition" pdf

... Ngarrka-ngku.ka marlu marna-kurra luwa.rnu ngarni.nja-kurra (man-ergative-aux kangaroo grass-obj shoot-past eat-infmitive-obj) 'The man is shooting the kangaroo while it is eating grass.' ... Vowel-Harmony The first rule indicates that a word consists of an optional prefix followed by a Vowel- Harmony-Domain; the second claims that a Vowel-Harmony-Domain is a string analyzable as a ... Sciences and Humanities Research Council of Canada. [1] Reduplication is a word formation process involving the repetition of a word or a part of a word. As an example, in Warlpiri there is a process...

Ngày tải lên: 08/03/2014, 18:20

8 522 0
Measuring Word Recognition Using a Picture

Measuring Word Recognition Using a Picture

... "Look at the picture (Pause). Look at the words around the picture (Pause.) Find the biggest word. What is that word? (Students will say, "Table.") A line has been drawn from that word ... from a systematic pattern of lining. • Have one example word that is familiar to all students and is bigger than the rest of the words. Difficulty of Items The task will contain words of varying ... task type as one of a multiple of task types in a reading test. Teachers may also apply this task type to classroom exercises and homework assignments. Acknowledgments • The author thanks Hyesug...

Ngày tải lên: 06/09/2013, 11:10

3 322 1
Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

... Translation: Syntactically Informed Phrasal SMT. In Proc. ACL, pages 271-279. A. Ramanathan, Pushpak Bhattacharyya, Jayprasad Hegde, Ritesh M. Shah and Sasikumar M. 2008. Simple syntactic and ... source lan- guage based on both lexical and syntactical features. We evaluated our approach on large- scale Japanese-English and English-Japanese machine translation tasks, and show that it can significantly ... Machine Translation. Ph.D. Thesis. Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan and Nandakishore Kamb- hatla. 2010. Syntax Based Reordering with Automat- ically Derived...

Ngày tải lên: 19/02/2014, 19:20

9 616 0
Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

... and Linda C. Bauman Peto. 1995. A hierarchical Dirichlet language model. Natural Lan- guage Engineering, 1(3):1–19. Y.W. Teh. 2006. A hierarchical Bayesian language model based on Pitman-Yor processes. ... n-grams: C(ab) − C(ab∗). A( ab) = max(1, K(C(ab) − C(ab∗))) A different K constant is chosen for each n-gram order. Using this formulation as an interpolated 5- gram language model gives a cross ... interpolated form is: α(c|ab) = max(0, C(abc) − D) C(ab∗) (4) γ(ab) = N(ab∗)D C(ab∗) The ∗ represents a wildcard matching any word and C(ab∗) is the total count of n-grams that start with the n − 1 words...

Ngày tải lên: 20/02/2014, 09:20

4 425 1
Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt

Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt

... SMS normalization. 2.3 SMS Normalization versus Text Para- phrasing Problem Others may regard SMS normalization as a para- phrasing problem. Broadly speaking, paraphrases capture core aspects ... a consensus translation technique to bootstrap parallel data using off-the-shelf translation sys- tems for training a hierarchical statistical transla- tion model for general domain instant ... minimal adaptation. One advantage of this pre-translation normalization is that the di- versity in different user groups and domains can be modeled separately without accessing and adapting...

Ngày tải lên: 20/02/2014, 12:20

8 400 0
Tài liệu Báo cáo khoa học: "An Alignment Algorithm using Belief Propagation and a Structure-Based Distortion Model" pdf

Tài liệu Báo cáo khoa học: "An Alignment Algorithm using Belief Propagation and a Structure-Based Distortion Model" pdf

... sentence pairs extracted from pre-aligned data(Utiyama and Isahara, 2003) as a gold standard. We segmented all the Japanese data with the automatic segmenter Juman (Kurohashi and Nagao, 1994). ... Related Work Automatic word alignment of parallel corpora is an important step for data-oriented Machine trans- lation (whether Statistical or Example -Based) as well as for automatic lexicon acquisition. ... Rada Mihalcea and Ted Pedersen, editors, HLT-NAACL 2003 Workshop: Building and Using Parallel Texts: Data Driven Ma- chine Translation and Beyond, pages 1–10, Edmon- ton, Alberta, Canada, May...

Ngày tải lên: 22/02/2014, 02:20

9 456 0
Báo cáo khoa học: "A Class-Based Agreement Model for Generating Accurately Inflected Translations" pptx

Báo cáo khoa học: "A Class-Based Agreement Model for Generating Accurately Inflected Translations" pptx

... exponential translation model for target language morphology. In ACL-HLT. C. Tillmann. 2004. A unigram orientation model for statistical machine translation. In NAACL. K. Toutanova, H. Suzuki, and A. ... similar agreement phenom- ena as probabilistic sequences. Factored Translation Models Factored transla- tion models (Koehn and Hoang, 2007) facilitate a more data-oriented approach to agreement modeling. Words ... modeling. Words are represented as a vector of features such as lemma and POS. The bitext is annotated with separate models, and the annotations are saved during phrase extraction. Hassan et al. (2007)...

Ngày tải lên: 16/03/2014, 19:20

10 414 0
Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

... systems (Ng and Low, 2004; Jiang et al., 200 8a; Zhang and Clark, 2008). 2.2 Character -Based and Word -Based Methods Two kinds of approaches are popular for joint word segmentation and POS tagging. ... information for each character. Each character can be assigned one of two possi- ble boundary tags: “B” for a character that begins a word and “I” for a character that occurs in the mid- dle of a word. ... applicable. Zhang et al. (2006) described a sub -word based tagging model to resolve word segmentation. To get the pieces which are larger than characters but smaller than words, they combine a character -based segmenter...

Ngày tải lên: 17/03/2014, 00:20

10 412 0
Báo cáo khoa học: "Applying a Grammar-based Language Model to a Simplified Broadcast-News Transcription Task" ppt

Báo cáo khoa học: "Applying a Grammar-based Language Model to a Simplified Broadcast-News Transcription Task" ppt

... linguistically motivated grammar (a hand-crafted Head-driven Phrase Structure Grammar) and a statistical model estimating the probability of a parse tree. The language model is applied by means of an N-best ... create an artificial recognition task with manageable complexity. Our primary aim was to design a task which allows us to investigate the properties of our grammar -based approach and to compare ... grammar accept arbitrary sequences of words and phrases. To keep the gram- mar restrictive, such sequences are penalized by the statistical model. Accurate hand-crafted grammars have been ap- plied...

Ngày tải lên: 17/03/2014, 02:20

8 385 0
Báo cáo khoa học: "A Plan Recognition Model for Clarification Subdialogues" ppt

Báo cáo khoa học: "A Plan Recognition Model for Clarification Subdialogues" ppt

... the plan. The parameters of a plan are the parameters in the header. Associated with each plan is a set of constraints, which are assertions about the plan and its terms and parameters. ... wharf") or a pop. The pop allows a metaplan to the stacked SEEK-ID- PARAMETER of PLAN2 ("What's a gate?") or a pop, which allows a metaplan to the original domain plan ... constructed an entire plan stack based on the original domain-specific expectations to BOARD or MEET a train. Recall that in parallel with the above, communicative analysis is also taking place....

Ngày tải lên: 17/03/2014, 19:21

10 275 0
Báo cáo khoa học: "The Best of Both Worlds – A Graph-based Completion Model for Transition-based Parsers" pot

Báo cáo khoa học: "The Best of Both Worlds – A Graph-based Completion Model for Transition-based Parsers" pot

... a suitable amount of training data, the model can thus learn to make the correct deci- sion. The dynamic-programming based graph- based parser is designed in such a way that any score calculation ... in the beam are recalculated based on a scoring model inspired by the graph -based parsing ap- proach, i.e., taking complete factors into account as they become incrementally available. As a con- sequence ... compare aspects of transition -based and graph -based pars- ing, and end up using a transition -based parser with a combined transition -based/ second-order graph -based scoring model (Zhang and Clark, 2008, 567),...

Ngày tải lên: 17/03/2014, 22:20

11 353 0
Báo cáo khoa học: "Multiple Interpreters in a Principle-Based Model of Sentence Processing" potx

Báo cáo khoa học: "Multiple Interpreters in a Principle-Based Model of Sentence Processing" potx

... (such as the syntactic and semantic pro- cessors), apply maximally to any input, thereby con- structing a maximal, partial interpretation for a given partial input signal. This entails that each ... on locally identifiable links of a Chain: (5) In an argument (NP) Chain, i) <C-Node- A- co C-NodeA> case-mark(C-Nodea) or, ii) C-NodeA - head(Chain) * ease-mark(C-Nodea) In an argument ... to it as a 'unit' Chain, representing an unmoved element. We noted above that each representation's schema provides a natural locality constraint. That is, we should be able...

Ngày tải lên: 18/03/2014, 02:20

6 364 0

Bạn có muốn tìm thêm với từ khóa:

w