0

automatic compilation of travel information from automatically identified travel blogs

Báo cáo khoa học:

Báo cáo khoa học: "Automatic Compilation of Travel Information from Automatically Identified Travel Blogs" doc

Báo cáo khoa học

... travel information from them. In the identification of travel blogs, we obtained of 38.1% for Recall and 86.7% for Precision. In the extraction of travel information from travel blogs, we obtained ... 2009.c2009 ACL and AFNLP Automatic Compilation of Travel Information from Automatically Identified Travel Blogs Hidetsugu anba Graduate School of Information Sciences, Hiroshima City University ... extraction of travel information from travel blogs, we obtained 74.0% for Precision at the top 100 extracted local products, thereby confirming that travel blogs are a useful source of travel information. ...
  • 4
  • 307
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "AN ASSESSMENT EXTRACTED OF SEMANTIC INFORMATION FROM MACHINE READABLE AUTOMATICALLY DICTIONARIES" pptx

Báo cáo khoa học

... derived from any one of the d~tionaries alone. 5. CONCLUSION The results of our study show that dictionaries can be a reliable source of automatically extracted semantic information. Merging information ... improve automatically ex tracted hierarchies. One of the most promising strategies for refining extracted information is the Use of information from several dictionaries. Hierarchies derived from ... whether information automatically extracted from dictionaries is sufficiently complete and coherent to be actually usable in NLP systems. Although there is concern over the quality of automatically...
  • 6
  • 333
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Báo cáo khoa học

... (20%) out of 210 terms were col-lected by the system. This low recall primarilycomes from the failure of automatic term recogni-tion (case A in the above classification). Improve-ment of this ... term of the original seed term byhand. The result is shown in the left half (EvaluationI) of Table 2. In this evaluation, 519 terms out of 610terms were correct: the precision is 85%. From ... development)情報処理学会 (Information ProcessingSociety of Japan; IPSJ)√√意味処理 (semantic processing)√√音声処理 (speech processing)√音声情報処理 (speech information pro-cessing)√√情報処理 (information processing)自然言語処理分野...
  • 4
  • 437
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

Báo cáo khoa học

... frequencies. The distributions of such clusters can be modeled automatically and the models used for identifying false positives. The second requirement for automatically generating a full-scale ... architecture of the system, and that of this pa- per, directly reflects the three challenges described above. The system consists of three modules: 1. Verb detection: Finds some occurrences of verbs ... preposition. Then he measures the mutual information between oc- currences of the verb and occurrences of infinitives following within a certain number of words. Unlike our system, Church's...
  • 6
  • 416
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx

Báo cáo khoa học

... benefit from information about predicate-argument struc-ture (e.g. Information Extraction (IE) (Surdeanu etal., 2003)).The first systems capable of automatically learn-ing a small number of verbal ... enhancing the performance of ∗Part of this research was conducted while this author wasat the University of Edinburgh Laboratory for Foundations of Computer Science.state -of- art statistical systems ... Proceedings of the 43rd Annual Meeting of the ACL, pages 614–621,Ann Arbor, June 2005.c2005 Association for Computational Linguistics Automatic Acquisition of Adjectival Subcategorization from CorporaJeremy...
  • 8
  • 390
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Practical Solution to the Problem of Automatic Part-of-Speech Induction from Text" pdf

Báo cáo khoa học

... prob-lem of automatic word sense induction. Proceedings of ACL (Companion Volume), Barcelona, 195-198. Schütze, Hinrich (1993). Part -of- speech induction from scratch. Proceedings of ACL, Columbus, ... assignment of the ambiguous words to clusters is not required at this stage, as this is taken care of in the next step. This step involves computing the differential vector of each word from the ... Class-based n-gram models of natural language. Computa-tional Linguistics 18(4), 467-479. Clark, Alexander (2003). Combining distributional and morphological information for part of speech induc-tion....
  • 4
  • 433
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Identification of Word Translations from Unrelated English and German Corpora" pot

Báo cáo khoa học

... in terms of corpus frequencies: kl~ = frequency of common occurrence of word A and word B kl2 = corpus frequency of word A - kll k21 = corpus frequency of word B - kll k22 = size of corpus ... accuracy of our system we counted the number of times where an acceptable translation of the source word is ranked first. This was true for 72 of the 100 test words, which gives us an accuracy of ... more often than expected by chance in a corpus of English, then the German translations of teacher and school, Lehrer and Schule, should also co-occur more often than expected in a corpus of...
  • 8
  • 438
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Coreference Resolution Using Semantic Relatedness Information from Automatically Discovered Patterns" pptx

Báo cáo khoa học

... satisfy one or severalpattern features. Lastly, from the point of view of machine learning, using only one semantic feature,instead of hundreds of pattern features, can avoidoverfitting and thus ... Semantic Relatedness Information from Automatically Discovered PatternsXiaofeng Yang Jian SuInstitute for Infocomm Research21 Heng Mui Keng Terrace, Singapore, 119613{xiaofengy,sujian}@i2r.a-star.edu.sgAbstractSemantic ... iseliminated from the reference pattern set. The re-maining patterns are sorted as normal, from whichthe top 100 patterns are selected as features.531Proceedings of the 45th Annual Meeting of the...
  • 8
  • 271
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

Báo cáo khoa học

... polarity of wordsThere are some works that discuss learning the po-larity of words instead of sentences.Hatzivassiloglou and McKeown proposed amethod of learning the polarity of adjectives from corpus ... subjective adjectives from a set of seed adjectives. The idea is to automatically identify the synonyms of the seed and to add themto the seed adjectives (Wiebe, 2000). Riloff etal. proposed ... of reviews are notavailable. In addition, the corpus created from re-views is often noisy as we discuss in Section 2.This paper proposes a novel method of buildingpolarity-tagged corpus from...
  • 8
  • 409
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx

Báo cáo khoa học

... PatternSingular“a(x) x is made up of ” NPQTis made up of NP’C“a(x) x is made of NPQTis made of NP’C“a(x) x comprises” NPQTcomprises (of) ? NP’C“a(x) x consists of NPQTconsists of NP’CPlural“p(x) ... NP’CPlural“p(x) are made up of ” NPQTis made up of NP’C“p(x) are made of NPQTare made of NP’C“p(x) comprise” NPQTcomprise (of) ? NP’C“p(x) consist of NPQTconsist of NP’CTable 2: Clues ... a fixednumber of basic components”, ”data mining com-prises a range of data analysis techniques”, ”booksconsist of a series of dots”, or ”a conversation ismade up of a series of observable...
  • 8
  • 378
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web" pot

Báo cáo khoa học

... Automatic Acquisition of Named Entity Tagged Corpus from World WideWebJoohui AnDept. of CSEPOSTECHPohang, Korea 790-784minnie@postech.ac.krSeungwoo LeeDept. of CSEPOSTECHPohang, ... taggedcorpusFigure 1: Automatic generation of NE tagged corpus from the websiderations in this marking process because of theword ambiguity and boundary ambiguity of NE in-stances. To overcome ... different processes: sep-aration of functional words, segmentation of com-pound nouns, and verification of the usefulness of the extracted sentences.An NE is often concatenated with more than...
  • 4
  • 397
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic construction of a hypernym-labeled noun hierarchy from text" docx

Báo cáo khoa học

... up of multiple words, rather than just using the head nouns of the noun phrases. 124 Automatic construction of a hypernym-labeled noun hierarchy from text Sharon A. Caraballo Dept. of Computer ... cluster of cities that because of sparse data was assigned a poor hypernym. Some of the suggestions in the .following sec- tion might correct this problem. Of the 50 noise words, a few of them ... shown that automatic methods can be used in building semantic lexicons. This work goes a step further by automatically creating not just clusters of related words, but a hierarchy of nouns...
  • 7
  • 418
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Generation of Information-seeking Questions Using Concept Clusters" ppt

Báo cáo khoa học

... selecttopics from a set of relevant questions from YahooAnswers.None of the above methods consider the con-texts of the list of answers in the documents re-turned by QA systems. The topic of a good information- seeking ... limits of the above methods, wepropose a concept clusters method and choose thelabels of the clusters as topics.Recent research on automatically extractingconcepts and clusters of words from ... LiDepartment of Computer ScienceUniversity of York, YO10 5DD, UKsgli@cs.york.ac.ukSuresh ManandharDepartment of Computer ScienceUniversity of York, YO10 5DD, UKsuresh@cs.york.ac.ukAbstractOne of...
  • 4
  • 424
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA" doc

Báo cáo khoa học

... rates automatically, and this technique or some similar form of automatic optimization could prof- itably be incorporated into my system. RESULTS The program acquired a dictionary of 4900 ... be obtained from text corpora, the only research that I am aware of that has dealt directly with the problem of the automatic acquisition of subcategorization frames is a series of papers by ... many of the uses of verbs in a text are captured by our subcate- gorization dictionary. For two randomly selected pieces of text from other parts of the New York Times newswire, a portion of...
  • 8
  • 342
  • 0

Xem thêm