Báo cáo khoa học: "Social Network Extraction from Texts: A Thesis Proposal" doc
... overall goal of my thesis is to build a system that automatically extracts a social network from raw texts such as literary texts, emails, blog comments and news articles. I take a “social net- work” ... to extract social networks from a wide range of textual data. At the same time I will be able to empirically analyze the types of linguistic patterns, both lexi- cal and synta...
Ngày tải lên: 30/03/2014, 21:20
... Social Media Analytics. Kalervo J ¨ arvelin and Jaana Kek ¨ al ¨ ainen. 2002. Cumu- lated gain-based evaluation of ir techniques. ACM Transactions on Information Systems, 20(4):422–446. John Lafferty ... keyphrase extrac- tion. Mihalcea and Tarau (2004) proposed to use TextRank, a modified PageRank algorithm to ex- tract keyphrases. Based on the study by Mihalcea and Tarau (2004), Liu et al....
Ngày tải lên: 17/03/2014, 00:20
... representative. 2.5 Data visualisation The results obtained are displayed as a weighted list in HTML format. Such lists, also named “heat maps” or “tag clouds” when they describe tags 1 usually represent ... corpora, in English and in French. 2 Description of the method 2.1 Extraction of words The system takes as input a corpus of texts. Para- graphs written in another language than the...
Ngày tải lên: 17/03/2014, 22:20
Tài liệu Báo cáo khoa học: Aldehydes release zinc from proteins. A pathway from oxidative stress⁄lipid peroxidation to cellular functions of zinc pptx
... Kitagawa K, Matsuno K, Matsumoto A, Yoshida A, Nakayama K, Nakayama K & Kawa- moto T (2002) Diminished alcohol preference in trans- genic mice lacking aldehyde dehydrogenase activity. Pharmacogenetics ... Roman J, Gimenez A, Lluis JM, Gasso M, Rubio M, Caballeria J, Pares A, Rodes J & Fernandez-Checa JC (2000) Enhanced DNA binding and activation of tran- scription factors NF-kappa...
Ngày tải lên: 19/02/2014, 06:20
Tài liệu Báo cáo khoa học: "Acquiring Lexical Generalizations from Corpora: A Case Study for Diathesis Alternations" pdf
... which diathesis alternations are empirically attested in corpus data. Using the dative and bene- factive alternations as a test case we attempt to de- termine: (a) if some alternations are more ... acquire alternating verbs from large balanced corpora by using partial- parsing methods and taxonomic information, and discuss how corpus data can be used to quantify lin- guistic genera...
Ngày tải lên: 20/02/2014, 19:20
Báo cáo khoa học: "Neural Network Probability Estimation for Broad Coverage Parsing" doc
... appropriate way. 1 Introduction Many statistical parsers (Ratnaparkhi, 1999; Collins, 1999; Charniak, 2001) are based on a history-based probability model (Black et al., 1993), where the probability ... structurally specified and linguistically appropriate biases on the search for a good history representation. The resulting parser achieves performance far greater than previous approaches...
Ngày tải lên: 24/03/2014, 03:20
Tài liệu Báo cáo khoa học: "Three BioNLP Tools Powered by a Biological Lexicon" doc
... automatically extracted verb subcategorization frames Yutaka Sasaki 1 Paul Thompson 1 John McNaught 1, 2 Sophia Ananiadou 1, 2 1 School of Computer Science, University of Manchester ... …” IL/NP IL-2/NN-BIOMED -/- 2/CD mediated/VVD IL-2-mediated/UNKNOWN IL/NP 2/CD IL-2/NN-BIOMED BioLexicon mediated/VVD mediate/VVP mediate/VV of/IN mediated/VVN -/- -/- mediated/VVN d...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Anaphor resolution in unrestricted texts with partial parsing" doc
... anaphora in unrestricted texts. These kinds of anaphora are pronominal references, surface- count anaphora and one-anaphora. In order to solve these anaphors we work on the output of a part-of-speech ... accuracy. Our framework will allow us a similar approach to that of Kennedy and Boguraev (1996), but we will automatically get syntactic information from partial parsing. Moreo...
Ngày tải lên: 08/03/2014, 05:21
Báo cáo khoa học: "Cross Language Dependency Parsing using a Bilingual Lexicon∗" docx
... recent years. Typical domain adaptation tasks often assume annotated data in new domain absent or insufficient and a large scale unlabeled data available. As unlabeled data are concerned, semi-supervised ... essentially different from that. As Chinese is basically a character-based written language. Character plays an important role in many means, most characters can be formed as single-...
Ngày tải lên: 17/03/2014, 01:20
Báo cáo khoa học: "Free Indexation: Combinatorial Analysis and A Compositional Algorithm*" doc
... Indexation: Combinatorial Analysis and A Compositional Algorithm* Sandiway Fong 545 Technology Square, Rm. NE43-810, MIT Artificial Intelligence Laboratory, Cambridge MA 02139 Internet: sandiway@ai.mit.edu ... course, a compositional algorithm can also be used in the non-interleaved case. Basically, the algorithm works by maintaining a set of indices at each sub-phrase of a...
Ngày tải lên: 17/03/2014, 20:20