Báo cáo khoa học: "Linear Text Segmentation using a Dynamic Programming Algorithm" potx
... Linear Text Segmentation using a Dynamic Programming Algorithm Athanasios Kehagias Dept. of Math., Phys. and Comp. Sciences Aristotle Univ of Thessaloniki GREECE kehagias@egnatia.ee.auth.gr Fragkou ... (Heinonen, 1998) and Utiyama and Isahara (Utiyama and Isa- hara, 2001). Finally, other researchers use probabilistic ap- proaches to text segmentation including the use of hidd...
Ngày tải lên: 31/03/2014, 20:20
... Lexical Chain Features 4.1 Chain starts and ends We follow (Chan et al. 2007) to model the lexi- cal chain starts and ends at a story boundary with a statistical distribution. We apply a window ... consideration and statistically modeled. 2 Experimental Setup Experiments are conducted using data from the TDT-2 Voice of America Mandarin broadcast. In particular, we only use the d...
Ngày tải lên: 23/03/2014, 17:20
... Using Random Walks Ahmed Hassan University of Michigan Ann Arbor Ann Arbor, Michigan, USA hassanam@umich.edu Dragomir Radev University of Michigan Ann Arbor Ann Arbor, Michigan, USA radev@umich.edu Abstract Automatically ... USA radev@umich.edu Abstract Automatically identifying the polarity of words is a very important task in Natural Language Processing. It has applications in text cl...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "BASED TEXT SEGMENTATION ON SIMILARITY BETWEEN WORDS" pdf
... 3, has large hills and valleys, and also meaningless noise. The graph is so complicated that one can not easily deternfine which valley should be considered as a segment boundary. The shape ... anaphora and ellipsis. One of the constituents of the text struc- ture is a text segment. A text segment, whether or not it is explicitly marked, as are sentences and paragraphs, i...
Ngày tải lên: 20/02/2014, 21:20
Báo cáo khoa học: "Chinese Term Extraction Using Different Types of Relevance" potx
... a good authority is a term candidate that is contained in many good hubs. In TV_HITS, a node p can either be a sentence or a term candidate. If a term candidate TC is contained in a sentence ... relevance between term candidates are iteratively calculated by graphs using link analysis algorithm to avoid the dependency on prior domain knowledge. The rest of the paper is...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Multilingual Text Processing in a Two-Byte Code" pdf
... German and Scandinavian alphabets do alphabetize their digraphs just like a sequence, s__ plus h etc. But these national alphabets are not typical. Spanish, Hungarian, Polish, Croatian and Albanian ... variations of a single letter (as a, _~, A, A~ , all variants will automatically be alphabetized the ~ame way, which is as it should be. The choice of variant forms is specified b...
Ngày tải lên: 08/03/2014, 18:20
Báo cáo khoa học: "Finding Word Substitutions Using a Distributional Similarity Baseline and Immediate Context Overlap" potx
... distributional similarity baseline (a sub- set of Wikipedia) in an attempt to show that a good semantic parse and adequate filtering can provide reasonable performance even on domains where data is sparse. ... the head ‘rescue’, lemma:rescue arg:ARG2 var:bank which indicates that ‘bank’ is object of the head ‘rescue’, and lemma:failing arg:ARG1 var:bank which indicates that the argument of...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Accurate Collocation Extraction Using a Multilingual Parser" docx
... Spanish and Italian (sec- tion 3). Then we present in sections 4 and 5 a com- parative evaluation experiment proving that a hy- brid approach leads to more accurate results than a classical approach ... speakers. 1.b) play role: What role can Canada’s immigration program play in help- ing developing nations ? 1.c) make mistake: We could look back and probably see a lot of mistakes tha...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Semantic Discourse Segmentation and Labeling for Route Instructions" potx
... detec- tion and tracking, and one segment of the text. In order to annotate unambiguously, we need to detect and track both landmarks and actions. A landmark is a hallway or a door, and an action is a ... 7 Table 1: Example Parts: linear-chain CRFs involves exactly one landmark, we can label the segment with an action and a specific landmark. For example, GHR1 := ”advance to the fi...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Fast Semantic Extraction Using a Novel Neural Network Architecture" docx
... is labeled for each particular verb as so-called frames. Addition- ally, semantic roles can also be labeled with one of 13 ARGM adjunct labels, such as ARGM-LOC or ARGM-TMP for additional locational ... solves a multi-class prob- lem using a one-vs-the-rest approach. The final sys- tem, called ASSERT, gives state-of-the-art perfor- mance and is also freely available at: http:// oak.color...
Ngày tải lên: 17/03/2014, 04:20