Báo cáo khoa học: TICL – a web tool for network-based interpretation of compound lists inferred by high-throughput metabolomics doc
... Currently, several bioinformatics approaches are available for metabolomics. Each approach was developed to solve different practical problems related to the analysis of metabolomics data [5,2 7–3 0]. Most of ... present TICL, a web tool for the automatic interpretation of lists of compounds. The major advance of TICL is that it not only provides a model of...
Ngày tải lên: 16/03/2014, 01:20
... amount of data that can reasonably be annotated by hand. Leacock et al. (1998), Agirre and Lopez de La- calle (2004), and Mihalcea and Moldovan (1999) propose a set of methods for automatic harvesting of ... parts of a compound are not usually separated by blank spaces or hy- phens, German compounding poses a particular challenge for target word identification. Another...
Ngày tải lên: 22/02/2014, 03:20
... Daisuke Kawahara † Yoshikiyo Kato † Tetsuji Nakagawa † Kentaro Inui † Sadao Kurohashi †‡ Yutaka Kidawara † † National Institute of Information and Communications Technology ‡ Graduate School ... Data Infrastructure We usually utilize 100 million Japanese Web pages as the analysis target. The Web pages have been converted into the standard formatted Web data, an XML format...
Ngày tải lên: 17/03/2014, 02:20
Tài liệu Báo cáo khoa học: "Archivus: A multimodal system for multimedia meeting browsing and retrieval" doc
... browsing and retrieval Marita Ailomaa, Miroslav Melichar, Martin Rajman Artificial Intelligence Laboratory ´ Ecole Polytechnique F ´ ed ´ erale de Lausanne CH-1015 Lausanne, Switzerland marita.ailomaa@epfl.ch Agnes ... other – more familiar – modality for a sizeable portion of the experiment. In order to gather a useful amount of natural language data, greater care has to be...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Outilex, a Linguistic Platform for Text Processing" pdf
... con- verters are included in the platform. The grammar formalism allows for the combination of statis- tical approaches with resource-based approaches. Manually constructed lexicons of substantial cov- erage ... general lexicon proposing a large set of analyses for stan- dard language. The user can, for a specific appli- cation, enrich it by means of complementary lexi- con...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "TOWARDS A CORE VOCABULARY FOR SYSTEM A NATURAL LANGUAGE" potx
... seen as a long-range research prt~gram rather than as a short-term goal. Motiva!ion Rcasearch on natural language processing sys- tems today strives for the construction of robust and portable ... a core vocabulary which is needed for handl- ing any subject domain. 'llfis assumpti(m is also shared by many researchers, and it tmdcrlies the production of basic vocab...
Ngày tải lên: 22/02/2014, 10:20
Báo cáo khoa học: "Creating a Gold Standard for Sentence Clustering in Multi-Document Summarization" potx
... same paragraph are clustered together whereas our approach is to find similar information be- tween documents. A gold standard for event identification was built by Naughton (2007). Ten annotators ... Hatzivas- siloglou et al. (2001) created a set of 10.535 man- ually marked pairs of paragraphs. Two human an- notator were asked to judge if the paragraphs con- tained ’common informat...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Bootstrapping a Stochastic Transducer for Arabic-English Transliteration Extraction" pdf
... 2002) or (AbdulJaleel and Larkey, 2003) require a large set of sample transliterations to use for training. If such a training set is unavailable for a particular language pair, a detection algorithm ... transliterations of the units are assessed manually from a set of training pairs. For each katakana string in a bitext, all possible translitera- tions are produced ba...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "TotalRecall: A Bilingual Concordance for Computer Assisted Translation and Language Learning" potx
... record additional information, including the source of each sentence pairs, metadata, and the information on phrase and word level alignment. With that ad- ditional information, TotalRecall provides ... places, and events in Taiwan for the past three decade. The concordance database is composed of bi- lingual sentence pairs, which are mutual transla- tion. In addition, there are...
Ngày tải lên: 17/03/2014, 06:20
Báo cáo khoa học: Apo a-lactalbumin and lysozyme are colocalized in their subsequently formed spherical supramolecular assembly doc
... bovine whey as reported by Caussin et al. [31]. Apo a- lactalbumin (apo a- LA) was prepared by dialysis of a solution of holo a- LA against deionized water at pH 3 during 48 h at 4 °C using a 6–8 000 Da nominal ... entity containing a molecule of LYS and a molecule of apo a- LA. The occurrence of a heterodimer form between lysozyme and a- lactalbumin at neutral...
Ngày tải lên: 23/03/2014, 07:20