a part of speech estimation method

Tài liệu Báo cáo khoa học: "A Fully Bayesian Approach to Unsupervised Part-of-Speech Tagging∗" docx

Tài liệu Báo cáo khoa học: "A Fully Bayesian Approach to Unsupervised Part-of-Speech Tagging∗" docx

... probability over a range of possible parameters, and per- mits the use of priors favoring the sparse distributions that are typical of natural lan- guage. Our model has the structure of a standard ... no gold standard available. Luckily, the Bayesian approach allows us to automatically select values for the hyperparameters by treating them as addi- tional variables in the model. We augment the ... optimal set of parameter values, we seek to directly maximize the probability of the hidden variables given the ob- served data, integrating over all possible parame- ter values. Using part- of- speech...

Ngày tải lên: 20/02/2014, 12:20

8 524 0
Báo cáo khoa học: "A Cost Sensitive Part-of-Speech Tagging: Differentiating Serious Errors from Minor Errors" pptx

Báo cáo khoa học: "A Cost Sensitive Part-of-Speech Tagging: Differentiating Serious Errors from Minor Errors" pptx

... 43–46. Sharon Goldwater and Thomas T. Griffiths. 2007. A fully Bayesian Approach to Unsupervised Part- of- Speech Tagging. In Proceedings of the 45th Annual Meeting of the Association of Computational ... 265–292. Dipanjan Das and Slav Petrov. 2011. Unsupervised Part- of- Speech Tagging with Bilingual Graph-Based Pro- jections. In Proceedings of the 49th Annual Meeting of the Association of Computational ... Com- putational Natural Language Learning. pp. 296–305. Taku Kudo, Kaoru Yamamoto, and Yuji Matsumoto. 2004. Applying Conditional Random Fields to Japanese Morphological Analysis. In Proceedings of the...

Ngày tải lên: 07/03/2014, 18:20

10 406 0
Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

... at the same time, we expand boundary tags to include POS information by attaching a POS to the tail of a boundary tag as a postfix following Ng and Low (2004). As each tag is now composed of a ... segmentation and POS tagging (Joint S&T). Since the typical ap- proach of discriminative models treats segmentation as a labelling problem by assigning each character a boundary tag (Xue and ... i a N-best list of candidate results from all these candidates. When we derive a candidate result from a word-POS pair p and a candidate q at prior position of p, we cal- culate the scores of...

Ngày tải lên: 08/03/2014, 01:20

8 445 0
Báo cáo khoa học: "A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing" potx

Báo cáo khoa học: "A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing" potx

... Linguistics A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing Jianfeng Gao * , Galen Andrew * , Mark Johnson *& , Kristina Toutanova * * Microsoft ... Introduction Parameter estimation is fundamental to many sta- tistical approaches to NLP. Because of the high-dimensional nature of natural language, it is often easy to generate an extremely large ... Lasso (L 1 ) regularization. We first investigate all of our estimators on two re-ranking tasks: a parse selection task and a language model (LM) adaptation task. Then we apply the best of...

Ngày tải lên: 08/03/2014, 02:21

8 505 0
Báo cáo khoa học: "A Practical Solution to the Problem of Automatic Part-of-Speech Induction from Text" pdf

Báo cáo khoa học: "A Practical Solution to the Problem of Automatic Part-of-Speech Induction from Text" pdf

... (1992). Class- based n-gram models of natural language. Computa- tional Linguistics 18(4), 467-479. Clark, Alexander (2003). Combining distributional and morphological information for part of speech ... data sparseness can be minimized by reducing the dimensionality of the matrix. An appropriate alge- braic method that has the capability to reduce the dimensionality of a rectangular matrix ... are much more salient. Also, widely and rural are well within the adjective cluster. The comparison of the two dendrograms indicates that the SVD was capable of making ap- propriate generalizations....

Ngày tải lên: 08/03/2014, 04:22

4 433 0
Báo cáo khoa học: "A Hierarchical Pitman-Yor Process HMM for Unsupervised Part of Speech Induction" doc

Báo cáo khoa học: "A Hierarchical Pitman-Yor Process HMM for Unsupervised Part of Speech Induction" doc

... that result in the same tagging, at all levels in the hierarchy: tag trigrams, bigrams and unigrams; and also words, character bigrams and character unigrams. To avoid this rather onerous marginalisation 2 we ... Natu- ral Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP), pages 504–512. Noah A. Smith and Jason Eisner. 2005. Contrastive estimation: Training log-linear ... bigram distribution. A hierarchy of PYPs can be formed by making the base distribution of a PYP another PYP, following a 868 state -of- the-art results across a range of corpora and languages. 2 Background Past...

Ngày tải lên: 17/03/2014, 00:20

10 422 0
A Study of Channel Estimation for OFDM Systems and System Capacity for MIMO-OFDM Systems

A Study of Channel Estimation for OFDM Systems and System Capacity for MIMO-OFDM Systems

... increasing demands of high data transmission rate and reliable communication quality, channel estimation has become a necessary part in the OFDM system. For example, the digital video broadcasting ... 4.3.4. Summary of the proposed channel estimation and data detection 98 4.4. Analysis of MSE of the proposed channel estimation method 99 4.4.1. MSE analysis of channel estimation for the ... transmitting data spread over a large bandwidth (usually larger than 500 MHz) that shares among users. UWB was traditionally applied in non-cooperative radar imaging. Most recent applications include...

Ngày tải lên: 20/11/2012, 11:28

205 639 2
part of speech

part of speech

... The speaker announced the of a new college. ESTABLISH 147. We want to students to participate fully in the running of the college. COURAGE 148. Details of the are available at all participating ... mixture of the two. FRUSTRATE 139. Researchers in this field have made some important new DISCOVER 140. is part of the American character. GENEROUS 141. , his wife was killed in a car accident. TRAGIC 142. ... musically and it is very effective. LYRICS 133. She promised not to say a word to anyone about it. SOLEMN 134. What unusual of flavours! COMBINE 135. His was a combination of surgery, radiation and...

Ngày tải lên: 02/06/2013, 01:25

4 555 10
Tài liệu Báo cáo khoa học: "Fast and Robust Part-of-Speech Tagging Using Dynamic Model Selection" pptx

Tài liệu Báo cáo khoa học: "Fast and Robust Part-of-Speech Tagging Using Dynamic Model Selection" pptx

... Ogren, Wayne Ward, James H. Martin, Guergana Savova, and Martha Palmer. 2010. An architecture for complex clinical question answering. In Proceedings of the 1st ACM International Health Informatics ... of the Associa- tion for Computational Linguistics: Human Language Technologies, ACL’11, pages 48–52. Drahom ´ ıra ”johanka” Spoustov ´ a, Jan Haji ˇ c, Jan Raab, and Miroslav Spousta. 2009. Semi-supervised ... in at least 3 documents of the training data are used. For a domain-specific model, we use a threshold of 1. The generalized and domain-specific models are trained separately; their learning parameters...

Ngày tải lên: 19/02/2014, 19:20

5 455 0

Bạn có muốn tìm thêm với từ khóa:

w