an unsupervised model for joint phrase alignment and extraction

Tài liệu Báo cáo khoa học: "An Unsupervised Model for Joint Phrase Alignment and Extraction" ppt

Tài liệu Báo cáo khoa học: "An Unsupervised Model for Joint Phrase Alignment and Extraction" ppt

... 5-gram model. For GIZA ++, we use the standard training reg- imen up to Model 4, and combine alignments with grow-diag-final -and. For the proposed models, we train for 100 iterations, and use the ... Japan 2 National Institute of Information and Communication Technology 3-5 Hikari-dai, Seika-cho, Soraku-gun, Kyoto, Japan Abstract We present an unsupervised model for joint phrase alignment and ... removed for TM training. For both tasks, we perform weight tuning and testing on specified development and test sets. We compare the accuracy of our proposed method of joint phrase alignment and extraction...

Ngày tải lên: 20/02/2014, 04:20

10 641 0
Báo cáo khoa học: "A Discriminative Model for Joint Morphological Disambiguation and Dependency Parsing" ppt

Báo cáo khoa học: "A Discriminative Model for Joint Morphological Disambiguation and Dependency Parsing" ppt

... definite article; Hungarian has both a definite and an indefi- nite article. In both languages (Tables 5 and 6), noun and adjective gender, number, and case are more accurately predicted than in Czech and Latin. ... authors’ and do not necessarily reflect those of the sponsors. References David Bamman and Gregory Crane. 2006. The Design and Use of a Latin Dependency Treebank. Proc. Work- shop on Treebanks and ... Results We compare the performance of the pipeline model (§4) and the joint model (§3) on morphological dis- ambiguation and unlabeled dependency parsing. Model Tagger Joint Tagger Joint Attr. ↓ all...

Ngày tải lên: 17/03/2014, 00:20

10 412 0
Báo cáo khoa học: "An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging" docx

Báo cáo khoa học: "An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging" docx

... Dong and Qiang Dong. 2006. Hownet and the Computation of Meaning. World Scientific. Kuzman Ganchev, Koby Crammer, Fernando Pereira, Gideon Mann, Kedar Bellare, Andrew McCallum, Steven Carroll, Yang ... Lafferty, Andrew McCallum, and Fernando Pereira. 2001. Conditional random fields: Prob- abilistic models for segmenting and labeling se- quence data. In Proceedings of ICML, pages 282– 289. Ryan McDonald, ... discriminative word-character hybrid model for joint Chi- nese word segmentation and POS tagging. Our word-character hybrid model offers high performance since it can handle both known and unknown words. We...

Ngày tải lên: 17/03/2014, 01:20

9 338 0
Tài liệu Báo cáo khoa học: "A Joint Statistical Model for Simultaneous Word Spacing and Spelling Error Correction for Korean" pdf

Tài liệu Báo cáo khoa học: "A Joint Statistical Model for Simultaneous Word Spacing and Spelling Error Correction for Korean" pdf

... correction candidates. Candidates are increased in number by inserting the blank cha- racters on the created candidates, which cover the spacing error correction candidates. We find the best candidate ... Jianfeng Gao, Mu Li and Chang-Ning Huang. 2003. Improved Source-Channel Models for Chinese Word Segmentation. Proceedings of the 41 st Annual Meet- ing of the ACL, pp. 272-279 Seung-Shik Kang ... word spacing error and spelling error simulta- neously for Korean. This algorithm is based on noisy-channel model, which uses Jaso 3 transition probabilities and Eojeol transition probabilities...

Ngày tải lên: 20/02/2014, 12:20

4 523 0
An Equilibrium Model of Rare-Event Premia and Its Implication for Option Smirks potx

An Equilibrium Model of Rare-Event Premia and Its Implication for Option Smirks potx

... include Gilboa and Schmeidler (1989), Epstein and Wang (1994), Anderson, Hansen, and Sargent (2000), Chen and Epstein (2002), Hansen and Sargent (2001), Epstein and Miao (2003), Routledge and Zin (2002), ... derivatives are examined by Liu and Pan (2003), Liu, Longstaff, and Pan (2003) and Das and Uppal (2001). Dufresne and Hugonnier (2001) study the impact of event risk on pricing and hedging of contingent ... 2005 134 An Equilibrium Model of Rare-Event Premia and Its Implication for Option Smirks Jun Liu Anderson School at UCLA Jun Pan MIT Sloan School of Management, CCFR and NBER Tan Wang Sauder School...

Ngày tải lên: 07/03/2014, 10:20

34 500 0
Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

... seg- mentation only and joint segmentation and part-of-speech tagging. On the Penn Chinese Treebank 5.0, we obtain an error reduction of 18.5% on segmentation and 12% on joint seg- mentation and part-of-speech ... outside-layer linear model. We used SRI Language Modelling Toolkit (Stolcke and Andreas, 2002) to train a 3- gram word LM with modified Kneser-Ney smooth- ing (Chen and Goodman, 1998), and a 4-gram POS Features ... 2002), Chinese word seg- mentation (Ng and Low, 2004; Zhang and Clark, 2007) and so on. We trained a character-based per- ceptron for Chinese Joint S&T, and found that the perceptron itself...

Ngày tải lên: 08/03/2014, 01:20

8 445 0
Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

... explored in previous work (Zhang and Clark, 2010; Jiang et al., 2008b). In this paper, we present an effective and effi- cient solution for joint Chinese word segmentation and POS tagging. Our work ... (Ng and Low, 2004; Jiang et al., 2008a; Zhang and Clark, 2008). 2.2 Character-Based and Word-Based Methods Two kinds of approaches are popular for joint word segmentation and POS tagging. The ... June. Association for Computational Lin- guistics. Yue Zhang and Stephen Clark. 2010. A fast decoder for joint word segmentation and POS-tagging using a sin- gle discriminative model. In Proceedings...

Ngày tải lên: 17/03/2014, 00:20

10 412 0
Báo cáo khoa học: "A global model for joint lemmatization and part-of-speech prediction" doc

Báo cáo khoa học: "A global model for joint lemmatization and part-of-speech prediction" doc

... log-linear model capturing such dependen- cies, and demonstrated its effectiveness on English and three Slavic languages. Acknowledgements We would like to thank Galen Andrew and Lucy Vander- wende for ... tag pre- diction and lemmatization are strongly dependent and that by building state-of-the art models for the two subtasks and performing joint inference we can improve performance on both tasks. ... the annotations. 491 lemmatization subtasks, which a joint model could exploit. 6.3 Evaluation of joint models Since our joint model re-ranks candidates pro- duced by the component tagger and...

Ngày tải lên: 17/03/2014, 01:20

9 431 0
Báo cáo khoa học: "A Joint Rule Selection Model for Hierarchical Phrase-based Translation" pptx

Báo cáo khoa học: "A Joint Rule Selection Model for Hierarchical Phrase-based Translation" pptx

... proposed joint probability model is factored into four sub-models that can be further classified into source-side and target- side rule selection models or context-based and context-free selection models. ... same. As Rule (1) cannot be applied to Fig- ure 1(b) for the translation and Rule (2) cannot be applied to Figure 1(a) for the translation either, υ = 1, C(r a s ), C(r a t ) and υ = 1, C(r b s ), ... context infor- mation. 3 Model Training of CBSM and CBTM 3.1 The acquisition of training instances CBSM and CBTM are trained by ME approach for the binary classification, where a training instance consists...

Ngày tải lên: 23/03/2014, 16:20

6 314 0
Báo cáo khoa học: "An Unsupervised Approach to Prepositional Phrase Attachment using Contextually Similar Words" potx

Báo cáo khoa học: "An Unsupervised Approach to Prepositional Phrase Attachment using Contextually Similar Words" potx

... 7.1. Table 8 presents the precision and recall of our algorithm and Table 9 presents a performance comparison between our system and previous supervised and unsupervised approaches using the same ... PGSB207797. References Altmann, G. and Steedman, M. 1988. Interaction with Context During Human Sentence Processing. Cognition, 30:191-238. Brill, E. 1995. Transformation-based Error-driven Learning and Natural Language ... classifier outperforms all previous unsupervised techniques and approaches the performance of supervised algorithm. We reconstructed the two earlier unsupervised classifiers cl HR and cl R2 . Table...

Ngày tải lên: 08/03/2014, 05:20

8 376 0
Báo cáo khoa học: Functional analysis of cell-free-produced human endothelin B receptor reveals transmembrane segment 1 as an essential area for ET-1 binding and homodimer formation pptx

Báo cáo khoa học: Functional analysis of cell-free-produced human endothelin B receptor reveals transmembrane segment 1 as an essential area for ET-1 binding and homodimer formation pptx

... incubated for 1 h at 25 °C, and purified on Strep- Tactin columns as described above. Acknowledgements We are grateful to Clemens Glaubitz and Andreas Engel for valuable discussions, and we thank Walter Rosenthal ... 3263 Functional analysis of cell-free-produced human endothelin B receptor reveals transmembrane segment 1 as an essential area for ET-1 binding and homodimer formation Christian Klammt 1 , Ankita Srivastava 2 , ... Brij78, 1%; and digitonin, 0.4%. Cloning procedures and protein analysis Coding regions of full-length ETB and its derivatives were amplified from cDNA by standard PCR techniques, and the fragments...

Ngày tải lên: 16/03/2014, 10:20

13 434 0
Báo cáo khoa học: "Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora" pptx

Báo cáo khoa học: "Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora" pptx

... term-tagging tools for English, Latvian, Lithuanian, and Romanian, but can be easily extended for other languages if a POS-tagger, a phrase pattern list, a stop-word list, and an inverse document ... Levenshtein distance between term candidates. For evaluation, Eurovoc (Steinberger et al., 2002) was used. Tables 4 and 5 show the performance figures of the mapper for English-Romanian and English-Latvian. ... performance for English- Latvian. 3 Conclusions and Related Information This demonstration paper describes the ACCURAT toolkit containing tools for multi-level alignment and information extraction...

Ngày tải lên: 16/03/2014, 20:20

6 289 0
Báo cáo khoa học: "A Language-Independent Unsupervised Model for Morphological Segmentation" pot

Báo cáo khoa học: "A Language-Independent Unsupervised Model for Morphological Segmentation" pot

... syntac- tic and semantic information from the context the word occurs (Schone and Jurafsky, 2000; Bordag, 2006; Yarowski and Wicentowski, 2000; Jacquemin, 1997). Exploiting semantic and syntactic informa- tion ... thank Emily Pitler and Samarth Ke- shava for making available the code of the RePortS algorithm, and Stefan Bordag and Delphine Bern- hard for running their algorithms on the German data. Many ... stem candidate auff ¨ uhr is then stored together with the suffix candidates {ender, ung, en, t, laune}. Step 2: Ranking candidate stems There are two types of affix candidates: type-1 affix candidates...

Ngày tải lên: 17/03/2014, 04:20

8 288 0
Báo cáo Y học: An alternative model for photosystem II/light harvesting complex II in grana membranes based on cryo-electron microscopy studies pptx

Báo cáo Y học: An alternative model for photosystem II/light harvesting complex II in grana membranes based on cryo-electron microscopy studies pptx

... with software and Dr S. Prince, Dr S. V. Rue and Prof. G. Garab for useful suggestions and debate. T. D. Flint is thanked f or plant g rowth and specimen p reparation as well as L. Child and P. McPhie for ... Tris/urea-treated membranes as determined by SDS/PAGE and Coomassie staining. The left lane shows band a and the right lane band c. Molecular mass markers are ind icated o n t he left of the panel. Ó FEBS ... adjacent membrane. This ®ts the structural and biochemical data, w here PSII core complexes can be observed in one discrete plane and membrane fraction, and LHCII complexes can be observed in another membrane...

Ngày tải lên: 17/03/2014, 17:20

11 456 0
Báo cáo khoa học: "A Topic Similarity Model for Hierarchical Phrase-based Translation" ppt

Báo cáo khoa học: "A Topic Similarity Model for Hierarchical Phrase-based Translation" ppt

... like to thank Yun Huang, Zhengxian Gong, Wenliang Chen, Jun lang, Xiangyu Duan, Jun Sun, Jinsong Su and the anonymous reviewers for their insightful comments. References Nicola Bertoldi and Marcello ... Hanna M. Wallach, Jason Naradowsky, David A. Smith, and Andrew McCallum. 2009. Polylingual topic models. In Proc. of EMNLP 2009. Franz J. Och and Hermann Ney. 2002. Discriminative training and ... 2009. Andreas Stolcke. 2002. Srilm – an extensible language modeling toolkit. In Proc. ICSLP 2002. Yik-Cheung Tam, Ian R. Lane, and Tanja Schultz. 2007. Bilingual lsa-based adaptation for statistical...

Ngày tải lên: 23/03/2014, 14:20

9 399 0

Bạn có muốn tìm thêm với từ khóa:

w