Ngày tải lên: 17/03/2014, 02:20
... morphological tagger for Arabic. 2 General Approach Arabic words are often ambiguous in their morpho- logical analysis. This is due to Arabic s rich system of affixation and clitics and the omission of disam- biguating ... values of a large number of (or- thogonal) features, such as basic part- of- speech (i.e., noun, verb, and so on), voice, gender, number, infor- mation about the clitics, and so on. 2 For Arabic, ... (including part- of- speech tagging) are the same operation, which consists of three phases. First, we obtain from our morphological analyzer a list of all possible analyses for the words of a given sentence....
Ngày tải lên: 20/02/2014, 15:20
Báo cáo khoa học: "Simultaneous Tokenization and Part-of-Speech Tagging for Arabic without a Morphological Analyzer" doc
Ngày tải lên: 30/03/2014, 21:20
part of speech
... speaker announced the of a new college. ESTABLISH 147. We want to students to participate fully in the running of the college. COURAGE 148. Details of the are available at all participating . COMPETE 149. ... the race because of heavy snow. ORGANIZE 4 Exercises (Parts of speech) Leâ Ngoïc Thaïch 80. Some people are more than others. DEMONSTRATE 81. Your are something to be proud of. ACHIEVE 82. There ... of anger and sensitivity. MIX 3 Exercises (Parts of speech) Leâ Ngoïc Thaïch Give the correct form of the words in brackets. 1. The _______________ of the agriculture in our country is very necessary....
Ngày tải lên: 02/06/2013, 01:25
Tài liệu Báo cáo khoa học: "Fast and Robust Part-of-Speech Tagging Using Dynamic Model Selection" pptx
... and Robust Part- of- Speech Tagging Using Dynamic Model Selection Jinho D. Choi Department of Computer Science University of Colorado Boulder choijd@colorado.edu Martha Palmer Department of Linguistics University ... Yoram Singer. 2003. Feature-Rich Part- of- Speech Tagging with a Cyclic Dependency Network. In Proceedings of the Annual Conference of the North American Chapter of the Association for Computa- tional ... Proceedings of the 45th Annual Meet- ing of the Association of Computational Linguistics, ACL’07, pages 760–767. Anders Søgaard. 2011. Semi-supervised condensed nearest neighbor for part- of- speech...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments" pdf
... 2010). One of the most fundamental parts of the linguis- tic pipeline is part- of- speech (POS) tagging, a basic form of syntactic analysis which has countless appli- cations in NLP. Most POS taggers ... to test the efficacy of this feature set for part- of- speech tagging given lim- ited training data. We randomly divided the set of 1,827 annotated tweets into a training set of 1,000 (14,542 tokens), ... USA {kgimpel,nschneid,brenocon,dipanjan,dpmills, jacobeis,mheilman,dyogatama,jflanigan,nasmith}@cs.cmu.edu Abstract We address the problem of part- of- speech tag- ging for English data from the popular micro- blogging service Twitter. We develop...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "A Fully Bayesian Approach to Unsupervised Part-of-Speech Tagging∗" docx
... Bayesian Approach to Unsupervised Part- of- Speech Tagging ∗ Sharon Goldwater Department of Linguistics Stanford University sgwater@stanford.edu Thomas L. Griffiths Department of Psychology UC Berkeley tom griffiths@berkeley.edu Abstract Unsupervised ... es- timation (MLE) of the model parameters. We show using part- of- speech tagging that a fully Bayesian approach can greatly im- prove performance. Rather than estimating a single set of parameters, ... optimal set of parameter values, we seek to directly maximize the probability of the hidden variables given the ob- served data, integrating over all possible parame- ter values. Using part- of- speech...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Deriving an Ambiguous Word’s Part-of-Speech Distribution from Unannotated Text" doc
... pair consisting of the left and right neighbor of a particular token is characteristic of the part of speech at this position, and by clustering the neighbor pairs on the basis of their middle ... Abstract A distributional method for part- of- speech induction is presented which, in contrast to most previous work, determines the part- of- speech distribution of syntacti- cally ambiguous words ... of speech. The core assumption underlying our approach, which in the context of cognition and child lan- guage has been proposed by Mintz (2003), is that words of a particular part of speech...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Detecting Errors in Part-of-Speech Annotation" docx
... this did not occur. 110 Detecting Errors in Part- of- Speech Annotation Markus Dickinson W. Detmar Meurers Department of Linguistics Department of Linguistics The Ohio State University The ... effectiveness of each method by reporting the results of applying them to the Wall Street Journal (WSJ) corpus as part of the Penn Treebank 3 re- lease, which was tagged using the PARTS tagger and ... Karel Tauger (eds.), Text, Speech and Dialogue (TSD). Springer, pp. 39-46. Adwait Ratnaparkhi, 1996. A maximum entropy model part- of- speech tagger. In Proceedings of EMNLP. Philadelphia, PA,...
Ngày tải lên: 22/02/2014, 02:20
Tài liệu Báo cáo khoa học: "Inferring Selectional Preferences from Part-Of-Speech N-grams" doc
... Feature-Rich Part- of- Speech Tagging with a Cyclic Dependency Network. In Proceedings of the Human Language Technology Conference and Annual Meeting of the North American Chapter of the Association ... paper introduces a method named PONG (for Part- Of- Speech N-Grams) to compute selectional preferences for many different relations by combining part- of- speech information and Google N-grams. ... Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, OR, 2011, 1556–1565. 386 Proceedings of the 13th Conference of the European Chapter of the...
Ngày tải lên: 22/02/2014, 03:20
Báo cáo khoa học: "Part-of-Speech Implications of Affixes" potx
... member of the affix list and met the established criteria. Each of these words had a part- of- speech string given for it, that is, the list of parts of speech possible for that word. The parts of ... independent of prefixes, and vice versa, there was a possibility of a particularly in- fluential and common affix introducing an extra part of speech into the part- of- speech counts of other affixes. ... include one or two extraneous parts of speech. The extra parts of speech will differ accord- ing to the class of words, as adjectives may have an extra part- of- speech "noun" or "adverb,"...
Ngày tải lên: 07/03/2014, 18:20
Báo cáo khoa học: "A Cost Sensitive Part-of-Speech Tagging: Differentiating Serious Errors from Minor Errors" pptx
... Proceedings of the North American Chapter of the Association for Computational Linguistics. pp. 582–590. Thorsten Brants. 2000. TnT-A Statistical Part- of- Speech Tagger. In Proceedings of the Sixth ... 2 Determiner 13 7 47 24 0 Etc 23 11 3 1 0 Table 2: The distribution of tagging errors on WSJ corpus by Stanford Part- Of- Speech Tagger. Tagger (Manning, 2011) (trained with WSJ sections 00–18). In this ... a Maxi- mum Entropy Part- of- Speech Tagger. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. pp. 63–70. Ioannis Tsochantaridis, Thomas Hofmann, Thorsten Joachims,...
Ngày tải lên: 07/03/2014, 18:20
Báo cáo khoa học: "Simple semi-supervised training of part-of-speech taggers" pptx
... of the ACL 2010 Conference Short Papers, pages 205–208, Uppsala, Sweden, 11-16 July 2010. c 2010 Association for Computational Linguistics Simple semi-supervised training of part- of- speech taggers Anders ... Spoustova et al. (2009) use a new pool of unlabeled data tagged by an ensemble of state -of- the-art taggers in every training step of an averaged perceptron POS tagger with 4–5% error reduction. Finally, Søgaard ... Søgaard Center for Language Technology University of Copenhagen soegaard@hum.ku.dk Abstract Most attempts to train part- of- speech tag- gers on a mixture of labeled and unlabeled data have failed. In...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Efficient Optimization of an MDL-Inspired Objective Function for Unsupervised Part-of-Speech Tagging" docx
... HMM POS-taggers (when given a good start). In Proceedings of the ACL. S. Goldwater and T. L. Griffiths. 2007. A fully Bayesian approach to unsupervised part- of- speech tagging. In Proceedings of the ... Optimization of an MDL-Inspired Objective Function for Unsupervised Part- of- Speech Tagging Ashish Vaswani 1 Adam Pauls 2 David Chiang 1 1 Information Sciences Institute University of Southern ... minimize the size of the model simultane- ously. We define the size of a model as the number of non-zero probabilities in its parameter vector. Let θ 1 , . . . , θ n be the components of θ. We would like...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Semisupervised condensed nearest neighbor for part-of-speech tagging" pot
... w i of a supervised part- of- speech tagger, in our case SVMTool 1 (Gimenez and Marquez, 2004) trained on Sect. 0–18, and x 2 i is a prediction on w i from an unsupervised part- of- speech tagger ... C ′ from the new data set which is a mixture of labeled and unlabeled data points. See Figure 4 for details. 3 Part- of- speech tagging Our part- of- speech tagging data set is the standard data ... semi- supervised part- of- speech tagging and present the best published result on the Wall Street Journal data set. 1 Introduction Labeled data for natural language processing tasks such as part- of- speech...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf
... segmentation and part- of- speech tagging. On the Penn Chinese Treebank 5.0, we obtain an error reduction of 18.5% on segmentation and 12% on joint seg- mentation and part- of- speech tagging over ... L ¨ u † † Key Lab. of Intelligent Information Processing ‡ Department of Computer & Information Science Institute of Computing Technology University of Pennsylvania Chinese Academy of Sciences Levine ... problem by as- signing each character a boundary tag of the follow- ing four types: • b: the begin of the word • m: the middle of the word • e: the end of the word • s: a single-character word We can...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Examining the Content Load of Part of Speech Blocks for Information Retrieval" pptx
... membership of the parts of speech within such blocks reflects the content load of the blocks, on the basis that open class parts of speech are more content-bearing than closed class parts of speech. ... Computational Linguistics Examining the Content Load of Part of Speech Blocks for Information Retrieval Christina Lioma Department of Computing Science University of Glasgow 17 Lilybank Gardens Scotland, ... U.K. xristina@dcs.gla.ac.uk Iadh Ounis Department of Computing Science University of Glasgow 17 Lilybank Gardens Scotland, U.K. ounis@dcs.gla.ac.uk Abstract We investigate the connection between part of speech (POS) distribution...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "A Practical Solution to the Problem of Automatic Part-of-Speech Induction from Text" pdf
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Machine Aided Error-Correction Environment for Korean Morphological Analysis and Part-of-Speech Tagging" pptx
Ngày tải lên: 08/03/2014, 05:21
Báo cáo khoa học: "Categorial Fluidity in Chinese and its Implications for Part-of-speech Tagging" pptx
Ngày tải lên: 08/03/2014, 21:20