all words in unrestricted text

Tài liệu Báo cáo khoa học: "SenseLearner: Word Sense Disambiguation for All Words in Unrestricted Text" doc

Tài liệu Báo cáo khoa học: "SenseLearner: Word Sense Disambiguation for All Words in Unrestricted Text" doc

... words previously seen in the training corpus, and therefore their overall coverage is not 100%. Starting with an annotated corpus consisting of all annotated files in SemCor, a separate training ... Min- imally supervised word sense disambiguation for all words in open text. In Proceedings of ACL/SIGLEX Senseval-3, Barcelona, Spain, July. R. Mihalcea and D. Moldovan. 2002. Pattern learning and ... advantage of providing larger coverage. In this paper, we present a method for solving the semantic ambiguity of all content words in a text. The algorithm can be thought of as a minimally supervised word...

Ngày tải lên: 20/02/2014, 15:20

4 400 0
Báo cáo khoa học: "Anaphor resolution in unrestricted texts with partial parsing" doc

Báo cáo khoa học: "Anaphor resolution in unrestricted texts with partial parsing" doc

... which includes all the coordinated noun phrases (in this case John and Bill). We will detect the coordination of noun phrases from the SS returned by the SUG fact coordinated. In one- 4 In ... principles of reasoning with uncertainty: e.g. Connoly (1994) and Mitkov (1997). Our system can be included into the first approach. In these integrated approaches the semantic and domain ... worked on different texts (Spanish texts). apply a partial parsing and we deal with other kinds of anaphors. As a future aim we will include semantic information in our algorithm in order to check...

Ngày tải lên: 08/03/2014, 05:21

7 274 0
Báo cáo khoa học: "A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization" potx

Báo cáo khoa học: "A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization" potx

... Vector Space Model (VSM) in text information processing, document indexing (term extraction) acts as a pre-requisite step in most text information proc- essing tasks such as Information Retrieval ... Intelligent Text Processing (CI- CLing 2003), 602-614. Yuan Liu, Nanyuan Liang. 1986. Basic Engineering for Chinese Processing – Contemporary Chinese Words Frequency Count, Journal of Chinese In- formation ... Tsinghua University, Beijing 100084, China lijingyang@gmail.com sms@tsinghua.edu.cn kevinn9@gmail.com Abstract Words and character-bigrams are both used as features in Chinese text...

Ngày tải lên: 08/03/2014, 02:21

8 493 0
1000 Most Common Words in English - Numbers Vocabulary

1000 Most Common Words in English - Numbers Vocabulary

... minute 492 tire 391 appear 442 strong 493 bring 392 road 443 special 494 yes 393 map 444 mind 495 distant 394 rain 445 behind 496 fill 395 rule 446 clear 497 east 396 govern 447 tail 498 paint 397 pull 448 produce 499 language 398 cold 449 fact 500 among 399 notice 450 street 400 voice 451 inch 401 unit 452 multiply 402 power 453 nothing 403 town 454 course 404 fine 455 stay 405 certain 456 wheel 406 fly 457 full 407 ... tire 391 appear 442 strong 493 bring 392 road 443 special 494 yes 393 map 444 mind 495 distant 394 rain 445 behind 496 fill 395 rule 446 clear 497 east 396 govern 447 tail 498 paint 397 pull 448 produce 499 language 398 cold 449 fact 500 among 399 notice 450 street 400 voice 451 inch 401 unit 452 multiply 402 power 453 nothing 403 town 454 course 404 fine 455 stay 405 certain 456 wheel 406 fly 457 full 407 fall 458 force 408 lead 459 blue 409 cry 460 object 410 dark 461 decide 411 machine 462 surface 412 note 463 deep 413 wait 464 moon 414 plan 465 island 415 figure 466 foot 416 star 467 system 417 box 468 busy 418 noun 469 test 419 field 470 record 420 rest 471 boat 421 correct 472 common 422 able 473 gold 423 pound 474 possible 424 done 475 plane 425 beauty 476 stead 426 ... Word 251 open 297 color 343 told 252 seem 298 face 344 knew 253 together 299 wood 345 pass 254 next 300 main 346 since 255 white 301 enough 347 top 256 children 302 plain 348 whole 257 begin 303 girl 349 king 258 got 304 usual 350 space 259 walk 305 young 351 heard 260 example 306 ready 352 best 261 ease 307 above 353 hour 262 paper 308 ever 354 better 263 group 309 red 355 true...

Ngày tải lên: 05/10/2012, 08:20

10 3,8K 25
Equivalence in the translation of vietnamese cultural words in the book “wandering through vietnamese culture” by huu ngoc

Equivalence in the translation of vietnamese cultural words in the book “wandering through vietnamese culture” by huu ngoc

... the nominal sense of the words is not. For instance, “Lễ xem mặt” is translated into “Looking at the face” as in: “The first would be an introduction ceremony called Lễ xem mặt (Looking at ... environment. All in all, languages tend to have a superordinate but lack hyponyms since each language makes only those distinctions in meaning which seem relevant to its particular environment. In English, ... involving introducing Vietnamese culture to the world, which will bring about the indispensable task of translating Vietnamese authentic words of culture into English. His greatest of all work,...

Ngày tải lên: 07/11/2012, 14:31

42 3,1K 17
some strategies for dealing with unfamiliar words in reading comprehension for students of english faculty, hanoi open university

some strategies for dealing with unfamiliar words in reading comprehension for students of english faculty, hanoi open university

... but in a new context. There are many difficulties made by unfamiliar words in reading comprehension such as: slowing reading speed, interrupting reading process and lack of interest in reading. ... participants in a particular communicative situation”. (Roy Harris in Rethinking Writing, 2000) “Reading is asking questions of printed text. And reading with comprehension becomes a matter of getting ... Considering the sentences and surrounding ones, students can know the meaning of the same words presented. In any cases, a word’s context is important in terms of understanding its meaning and...

Ngày tải lên: 18/02/2014, 23:04

62 972 0
Tài liệu Báo cáo khoa học: "Topological Ordering of Function Words in Hierarchical Phrase-based Translation" pdf

Tài liệu Báo cáo khoa học: "Topological Ordering of Function Words in Hierarchical Phrase-based Translation" pdf

... Y  ) 326 N N = 2048 N = 128 N = 64 N = 2048 331 Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, pages 324–332, Suntec, Singapore, 2-7 August 2009. c 2009 ACL and AFNLP 1...

Ngày tải lên: 20/02/2014, 07:20

9 472 1
Tài liệu Báo cáo khoa học: "An API for Measuring the Relatedness of Words in Wikipedia" docx

Tài liệu Báo cáo khoa học: "An API for Measuring the Relatedness of Words in Wikipedia" docx

... available in a straightfor- ward manner the large amount of structured text in Wikipedia (e.g. for building a language model), as well as its rich internal link structure (e.g. the links between ... Evaluating WordNet-based measures of semantic distance. Computational Linguistics, 32(1). Finkelstein, L., E. Gabrilovich, Y. Matias, E. Rivlin, Z. Solan, G. Wolfman & E. Ruppin (2002). Placing ... (1995). Using information content to evaluate seman- tic similarity in a taxonomy. In Proc. of IJCAI-95, Vol. 1, pp. 448–453. Seco, N., T. Veale & J. Hayes (2004). An intrinsic information content...

Ngày tải lên: 20/02/2014, 12:20

4 547 1
Tài liệu Báo cáo khoa học: "Finding Predominant Word Senses in Untagged Text" pptx

Tài liệu Báo cáo khoa học: "Finding Predominant Word Senses in Untagged Text" pptx

... 2000. Mapping wordnets using structural information. In Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, Hong Kong. Finding Predominant Word Senses in Untagged Text Diana ... BNC text represents imaginative writing, the remaining 80% being classified as informative. sense according to SemCor. This seems intuitive given our expected relative usage of these senses in modern ... baseline for choosing the predominant sense over all these words ( ) is 32%. Both WordNet similarity measures beat this baseline. The random baseline for ( ) is 24%. Again, the automatic ranking...

Ngày tải lên: 20/02/2014, 16:20

8 371 0
Tài liệu Báo cáo khoa học: "Learning to Recognize Tables in Free Text" pptx

Tài liệu Báo cáo khoa học: "Learning to Recognize Tables in Free Text" pptx

... Cambridge, MA. 450 Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line 1234567890123456789012345678901234567890123456789012345678901234567890 ... Computational Lin- guistics, pages 19-24. Shona Douglas, Matthew Hurst, and David Quinn. 1995. Using natural language processing for iden- tifying and interpreting tables in plain text. In Fourth ... blank lines or delimiters that immediately precede or follow a table within an input text. In this paper, we assume that our input texts are plain texts that do not contain any formatting codes,...

Ngày tải lên: 20/02/2014, 19:20

8 495 0
Báo cáo khoa học: "Using WordNet to Automatically Deduce Relations between Words in Noun-Noun Compounds" docx

Báo cáo khoa học: "Using WordNet to Automatically Deduce Relations between Words in Noun-Noun Compounds" docx

... occurring in their corpus of compounds. For each category pair, they manually examined 20% of the compounds falling under that category pair, paraphrasing the relation between the nouns in that ... Computational Linguistics Using WordNet to Automatically Deduce Relations between Words in Noun-Noun Compounds Fintan J. Costello, School of Computer Science, University College Dublin, Dublin 6, Ireland. fintan.costello@ucd.ie Tony ... disam- biguation algorithm, one aim in constructing this corpus was to examine the relations that exist in naturally occurring noun-noun compounds. This follows from existing research on the relations...

Ngày tải lên: 08/03/2014, 02:21

8 319 0

Bạn có muốn tìm thêm với từ khóa:

w