Báo cáo khoa học: "High-Performance Bilingual Text Alignment Using Statistical and Dictionary Information" pptx

Báo cáo khoa học: "High-Performance Bilingual Text Alignment Using Statistical and Dictionary Information" pptx

Báo cáo khoa học: "High-Performance Bilingual Text Alignment Using Statistical and Dictionary Information" pptx

... High-Performance Bilingual Text Alignment Using Statistical and Dictionary Information Masahiko Haruno Takefumi Yamazaki NTT Communication ... sentences in bilingual texts and on sta- tistically acquired word correspondences. The texts for the experiment varied in length and genres as summarized in Table 2. Texts 1 and 2 are editorials ... Sentence Alignment Tabl...
Ngày tải lên : 31/03/2014, 06:20
  • 8
  • 270
  • 0
Báo cáo khoa học: "a new text alignment architecture" pot

Báo cáo khoa học: "a new text alignment architecture" pot

... an- notated corpus as input 2 . The output of the text alignment system consists of the corpus alignment information and a bilingual dictionary. During the alignment process, hypotheses on translation ... generation paragraph alignment strategies sentence alignment strategies word alignment strategies phrase alignment strategies further alignment strategies alignment...
Ngày tải lên : 23/03/2014, 18:20
  • 8
  • 313
  • 0
Báo cáo khoa học: "Automatically Evaluating Text Coherence Using Discourse Relations" docx

Báo cáo khoa học: "Automatically Evaluating Text Coherence Using Discourse Relations" docx

... Language and Informa- tion, 16:445–464, October. Mirella Lapata and Regina Barzilay. 2005. Automatic evaluation of text coherence: Models and representa- tions. In Leslie Pack Kaelbling and Alessandro ... (Barzilay and Lapata, 2005) and (Elsner et al., 2007). In this task, the system is asked to decide which of two texts is more coherent. The pair of texts consists of a source...
Ngày tải lên : 23/03/2014, 16:20
  • 10
  • 292
  • 0
Tài liệu Báo cáo khoa học: "Chinese Word Segmentation without Using Lexicon and Hand-crafted Training Data" pdf

Tài liệu Báo cáo khoa học: "Chinese Word Segmentation without Using Lexicon and Hand-crafted Training Data" pdf

... segmenting Chinese texts without making use of any lexicon and hand-crafted linguistic resource. The statistical data required by the algorithm, that is, mutual information and the difference ... developed so far, both statistical and rule-based, exploited two kinds of important resources, i.e., lexicon and hand-crafted linguistic resources(manually segmented and tagged c...
Ngày tải lên : 20/02/2014, 18:20
  • 7
  • 396
  • 0
Tài liệu Báo cáo khoa học: "Guiding an HPSG Parser using Semantic and Pragmatic Expectations" pdf

Tài liệu Báo cáo khoa học: "Guiding an HPSG Parser using Semantic and Pragmatic Expectations" pdf

... by The Ohio State Center for Cognitive Science and The Ohio State Departments of Computer and Information Science and Linguistics grammar (using compiled knowledge) which is then used to realize ... language generation has been successfully demonstrated using highly compiled knowledge about speech acts and their related social actions. A design and prototype implementation...
Ngày tải lên : 20/02/2014, 21:20
  • 3
  • 379
  • 0
Báo cáo khoa học: Functional classification of scaffold proteins and related molecules pptx

Báo cáo khoa học: Functional classification of scaffold proteins and related molecules pptx

... external and internal environment. Because of the multiplicity and broad substrate specificity of signaling enzymes, it is of immense importance to understand how the cell achieves efficiency and accuracy ... activating, coordinating and regulating signaling events in regulatory networks. Here we discuss the categories of scaffolds, anchors, docking proteins and adaptors in some detai...
Ngày tải lên : 06/03/2014, 22:21
  • 8
  • 431
  • 0
Báo cáo khoa học: "WORD-SENSE DISAMBIGUATION METHODS USING STATISTICAL" pot

Báo cáo khoa học: "WORD-SENSE DISAMBIGUATION METHODS USING STATISTICAL" pot

... of w by using the flip-flop algo- rithm devised by Nadas, Nahamoo, Picheny, and Poweli [Nadas et aL, 1991]. To under- stand their algorithm, first imagine that w is a French word and that ... because take and decision no longer fall within a single trigram. Errors such as this are common because the statistical models only capture local phe- nomena; if the context neces...
Ngày tải lên : 08/03/2014, 07:20
  • 7
  • 351
  • 0
Báo cáo khoa học: Cytokine properties of prokineticins Justin Monnier and Michel Samson pptx

Báo cáo khoa học: Cytokine properties of prokineticins Justin Monnier and Michel Samson pptx

... com- pared the expression of PROK1 and PROK2 in monocytes and monocyte-derived macrophages, in undifferentiated and differentiated THP1 cells, and in undifferentiated and differentiated U937 cells. ... K d (nm) values for PROK1 and PROK2 binding to PROKR1 are 12.3 ± 4.2 and 1.4 ± 0.5, respectively, and the K d (nm) values for PROK1 and PROK2 binding to PROKR2 are 1.8 ± 0.1...
Ngày tải lên : 23/03/2014, 07:20
  • 8
  • 439
  • 0
Báo cáo khoa học: "Language Independent Authorship Attribution using Character Level Language Models" pptx

Báo cáo khoa học: "Language Independent Authorship Attribution using Character Level Language Models" pptx

... Alexander Hamilton and James Madison (Holmes and Forsyth, 1995). Recently, vast repos- itories of electronic text have become available on the Internet, making the problem of managing large text ... any test corpus, and therefore some mechanism for assigning non-zero probability to novel n-grams is a central and un- avoidable issue in statistical language modeling. One standard...
Ngày tải lên : 24/03/2014, 03:20
  • 8
  • 286
  • 0
Báo cáo khoa học: "An Implementation of Combined Partial Parser and Morphosyntactic Disambiguator" pptx

Báo cáo khoa học: "An Implementation of Combined Partial Parser and Morphosyntactic Disambiguator" pptx

... the left context and the adjective and the noun specified in the right context (cf. unify(case,1,4,5)), as well as case agreement (possibly of a different case) between the adjective and noun in ... the rules is a tokenised and morphosyntactically annotated XML text. The output contains disambiguation an- notation and two new levels of constructions: syn- tactic words and syntacti...
Ngày tải lên : 31/03/2014, 01:20
  • 6
  • 319
  • 0

Xem thêm