Báo cáo khoa học: "A Hybrid Approach to Word Segmentation and POS Tagging" doc
... we study a hybrid method for Chi- nese and Japanese word segmentation and POS tag- ging, in which word- based and character-based pro- cessing is combined, and word segmentation and POS tagging ... In the method, POS tagging of unknown words is conducted at the same time as word segmentation and POS tag- 217 Figure 1: Word Segmentation and Known Word...
Ngày tải lên: 31/03/2014, 01:20
... Systems and Technologies Corporation 10 Moulton St. CambHdge, MA 02138 Abstract In BBN's natural language understanding and generation system (Janus), we have used a hybrid approach to ... patterns (1), 198 (2), and (3) and select for, on. and of as prepositions. 7 The information acquired through KNACQ is used both by the understanding components and by BBN&ap...
Ngày tải lên: 21/02/2014, 20:20
... sequence of POS tags. The joint approach to word segmentation and POS tagging has been reported to improve word seg- mentation and POS tagging accuracies by more than 1% in Chinese (Zhang and Clark, ... q −1 and q −2 respectively denote the last-shifted word and the word shifted before q −1 . q.w and q.t respectively denote the (root) word form and POS...
Ngày tải lên: 07/03/2014, 18:20
Tài liệu Báo cáo khoa học: "A Bootstrapping Approach to Named Entity Classification Using Successive Learners" pdf
... the repository to train a decision list for NE classification. 3. The learned rules are applied to the NE candidates stored in the repository. 4. The proper names tagged in Step 3 and their ... 86.7% To benchmark the quality of the automatically constructed corpus (Table 2), the testing corpus is first processed by our parser and then saved into the repository. The reposit...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "A PRAGMATICBASED APPROACH TO UNDERSTANDING" pdf
... text, and attend- Ing class will all reside at the same focus level within the expanded plan for earning credit in a course. The action of going to the cashler's office to pay one's ... proceding to answer the ques- tion or to seek information relevant to formulat- ing an answer. However IS may refuse to accept the question posed by IP because he does not under...
Ngày tải lên: 21/02/2014, 20:20
Tài liệu Báo cáo khoa học: "A PROBABILISTIC APPROACH TO GRAMMATICAL ANALYSIS OF WRITTEN ENGLISH BY COMPUTER" pot
... word token of sample text, (3) the word tag for the word and (~) a field of hypertags and brackets showing the constituency-level status of each word token. Any amendments to the rules and ... the word tagged corpus marked as a sentence is given a root hypertag, 'S'. Between 'S' and the word tag level of analysis, all constituents perceived by...
Ngày tải lên: 22/02/2014, 09:20
Báo cáo khoa học: A kinetic approach to the dependence of dissimilatory metal reduction by Shewanella oneidensis MR-1 on the outer membrane cytochromes c OmcA and OmcB potx
... Mn(IV) in MR-1 is thought to be composed of cytochromes and a qui- none, located in both the cytoplasmic membrane (CymA and menaquinone) and the outer membrane (OmcB, and a partial role for OmcA) ... these cytochromes have been proposed to be ter- minal Fe(III) and Mn(IV) reductases, although their role in the reduction of other metals is less well understood. To obtain more i...
Ngày tải lên: 07/03/2014, 09:20
Báo cáo khoa học: "A new Approach to Improving Multilingual Summarization using a Genetic Algorithm" pptx
... Buckley. 1997. Automatic text structuring and summarization. In- formation Processing and Management, 33(2):193– 207. C. N. Satoshi, S. Satoshi, M. Murata, K. Uchimoto, M. Utiyama, and H. Isahara. ... languages to estimate the size of the Web as of the end of January 2005. 927 word segmentation. We have evaluated our approach on two mono- lingual corpora of English and Hebrew...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "A Deductive Approach to Dependency Parsing∗" potx
... used to decide whether a step linking words a and b (i.e., having a → b as a side condition) is executed or not, and probabilities can be attached to items in order to assign different weights to ... for the algorithm to parse sen- tences correctly, we will need to define D-rules to allow w 0 to be linked to the real sentence head. 3.3 ES99 (Eisner and Satta, 99) Eisner...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "A Bootstrapping Approach to Unsupervised Detection of Cue Phrase Variants" docx
... concept-A accumulator list which has not been used as an active element be- fore. Repeat steps 1-3 for k iterations Output: top M words of concept-A (verb) accumulator list and top N words of concept-B ... shows that clusters of vector-space- based patterns can be successfully employed to detect specific IE relationships (companies and their headquarters), and Ravichandran and Hovy’s...
Ngày tải lên: 08/03/2014, 02:21