parts of speech
... reasonably C. reasoning D. reason 67. There are a lot of ______ jobs in this company. A. attracted B. attraction C. attractive D. attract 68. Fortunately, the plane landed ______ after the violent ... Young Asians are not so as their American counterparts. A. Rome B. Roman C. romantic D. romanticize 75. All of my students appreciate the ______ of English learning. A. importance B. important ... expect B. expecting C. expectation D. expects 80 . With the ______ of weather, the journey was wonderful. A. except B. exception C. excepting D. excepts 81 . The food in this restaurant was rather...
Ngày tải lên: 27/10/2013, 10:11
Tài liệu Báo cáo khoa học: "Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments" pdf
... perform poorly on Twitter (Finin et al., 2010). One of the most fundamental parts of the linguis- tic pipeline is part -of- speech (POS) tagging, a basic form of syntactic analysis which has countless appli- cations ... to test the efficacy of this feature set for part -of- speech tagging given lim- ited training data. We randomly divided the set of 1 ,82 7 annotated tweets into a training set of 1,000 (14,542 tokens), ... standard parts of speech 3 (noun, verb, etc.) as well as categories for token varieties seen mainly in social media: URLs and email ad- dresses; emoticons; Twitter hashtags, of the form #tagname,...
Ngày tải lên: 20/02/2014, 04:20
Báo cáo khoa học: "Examining the Content Load of Part of Speech Blocks for Information Retrieval" pptx
... 0.294 +4 .8 0.292 +4.1 0.297 +5.9 0.293 +4.5 BB2 0.237 0.291 +22 .8 0. 287 +21.0 0.295 +24.2 0. 288 +21.5 PL2 0.2 68 0.2 98 +11.2 0.297 +10.9 0.306 +14.1 0.302 +12 .8 DLH 0.237 0.239 +0.7 0.2 38 +0.4 0.243 ... blocks, on the basis that open class parts of speech are more content-bearing than closed class parts of speech. We test these hypothe- ses in the context of Information Retrieval, by syntactically ... pages 531–5 38, Sydney, July 2006. c 2006 Association for Computational Linguistics Examining the Content Load of Part of Speech Blocks for Information Retrieval Christina Lioma Department of Computing...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Tagging Urdu Text with Parts of Speech: A Tagger Comparison" doc
... tool VB 95 .88 % 95 .88 % 96. 58% 96 .80 % NN 94.64% 95 .85 % 94.79% 96.64% PN 86 .92% 79.73% 84 .96% 81 .70% ADV 82 . 28% 79.11% 81 .64% 81 .01% ADJ 91.59% 89 .82 % 92.37% 88 .26% “Table 7: Accuracies of open class ... VB 93.20% 91 .86 % 92. 68% 94.23% NN 94.12% 96.21% 93 .89 % 96.45% PN 73.20% 66 .88 % 72.77% 68. 62% ADV 75.94% 72. 78% 74. 68% 72.15% ADJ 85 .67% 80 . 78% 86 .5% 85 .88 % “Table 5: Accuracies of open class ... distribution informa- tion was given with the lexicon. Tag TnT tagger Tree- Tagger RF tagger SVM tool VB 28. 57% 0.00% 42 .86 % 42 .86 % NN 74.47% 95.74% 80 .85 % 80 .85 % PN 68. 18% 54.54% 63.63%...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Feature-Rich Part-of-speech Tagging for Morphologically Complex Languages: Application to Bulgarian" docx
... four major types of ambiguity: 1. Between the wordforms of the same lexeme, i.e., in the paradigm. For example, , an inflected form of (‘sofa’, mascu- line), can mean (a) ‘the sofa’ (definite, singu- lar, ... improve- ment over the state -of- the-art for Bulgarian. 1 Introduction Part -of- speech (POS) tagging is the task of as- signing each of the words in a given piece of text a contextually suitable ... larger inventory of POS tags, e.g., the Penn Treebank (Marcus et al., 1993) uses 48 tags: 36 for part- of- speech, and 12 for punctuation and currency symbols. This increase in the number of tags is...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Automatic Determination of Parts of Speech of English Words" docx
... DETERMINATION OF PARTS OF SPEECH 61 DETERMINATION OF PARTS OF SPEECH 63 TASK 2: TABULATION OF SPECIAL-PURPOSE WORDS WHICH ARE NOT COVERED BY RULES A, B, OR C For Task 2, a subset of the dictionary ... believed to be adequate to main- tain the goal of 95 per cent accuracy. DETERMINATION OF PARTS OF SPEECH 55 DETERMINATION OF PARTS OF SPEECH 65 [Mechanical Translation and Computational ... into affix and kernel parts and assigned a part of speech on the basis of the part -of- speech implications of the affixes and the length of the remaining kernel. An accuracy of 95 per cent is achieved...
Ngày tải lên: 16/03/2014, 19:20
Báo cáo khoa học: "Part of Speech Tagger for Assamese Text" docx
... a manually tagged corpus of about 10000 words for training, we obtain a tagging accuracy of nearly 87 % for test inputs. 1 Introduction Part of Speech (POS) tagging is the process of marking up words ... richness of the language, many words of Assamese occur in secondary forms in texts. This increases the number of POS tags that needed for the language. Also, often there are differences of opinion ... Saharia Department of CSE Tezpur University India - 784 0 28 Dhrubajyoti Das Department of CSE Tezpur University India - 784 0 28 {nava tu,dhruba it06,utpal}@tezu.ernet.in Utpal Sharma Department of CSE Tezpur...
Ngày tải lên: 17/03/2014, 02:20
Báo cáo khoa học: "Modeling Human Sentence Processing Data with a Statistical Parts-of-Speech Tagger" ppt
... that a formally adequate account of recur- sive syntactic structure is an essential component of any model of the behaviour. In this study, we tested a bigram POS tagger on different types of structural ... Journal of Experimental Psychology, 16: 555–5 68, 1 986 . L. Frazier. On comprehending sentences: Syntac- tic parsing strategies. Ph.D. dissertation, Uni- versity of Massachusetts, Amherst, MA, 19 78. L. ... reading time penalty for a garden-path region when the size of the prob- ability decrease at the disambiguating region of a garden-path sentence will be greater than that of control sentences....
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Guessing Parts-of-Speech of Unknown Words Using Global Information" ppt
... (604) (J) [0.0000] 788 [0. 087 2] 936 RWC 0.7699 (2572) 0.7 785 (2476) 0.7 787 (2474) (J) [0.0000] 5044 [0.0000] 587 8 GEN 0 .88 36 (905) 0 .88 37 (904) 0 .88 63 (88 4) (E) [1.0000] 4094 [0.0244] ... 0.6 785 (89 30) (C) [0.0000] 16019 [0.0000] 188 61 EDR 0.9639 (87 4) 0.9643 (86 3) 0.9651 (84 4) (J) [0.1775] 4903 [0.0034] 7770 KUC 0.7501 (619) 0.7634 ( 586 ) 0.7562 (604) (J) [0.0000] 788 ... clues for guessing the part -of- speech of the ambiguous one, because unknown words with the same lexical form usually have the same part -of- speech. For another example, there is a part -of- speech...
Ngày tải lên: 23/03/2014, 18:20
Báo cáo khoa học: "Weakly Supervised Part-of-Speech Tagging for Morphologically-Rich, Resource-Scarce Languages" potx
... Institute University of Texas at Dallas Richardson, TX 75 083 -0 688 {saidul,vince}@hlt.utdallas.edu Abstract This paper examines unsupervised ap- proaches to part -of- speech (POS) tagging for morphologically-rich, ... one of some finite number of possible outcomes). For a multinomial with K outcomes, a K-dimensional Dirichlet distribution, which is conjugate to the multinomial, is a natural choice of prior. For ... Similar trends are observed for Lex- icon 2, where BHMM+IS outperforms BHMM and MLHMM by a larger margin of 5–10% and 12–16%, respectively. For Lexicon 3, BHMM+IS outperforms SHMM, the stronger...
Ngày tải lên: 24/03/2014, 03:20
Báo cáo khoa học: "Simultaneous Tokenization and Part-of-Speech Tagging for Arabic without a Morphological Analyzer" doc
Ngày tải lên: 30/03/2014, 21:20