Tài liệu Báo cáo khoa học: "Joint Word Segmentation and POS Tagging using a Single Perceptron" docx

Tài liệu Báo cáo khoa học: "Joint Word Segmentation and POS Tagging using a Single Perceptron" docx

Tài liệu Báo cáo khoa học: "Joint Word Segmentation and POS Tagging using a Single Perceptron" docx

... used as training and test data for development. The standard F-scores are used to measure both the word segmentation accuracy and the overall seg- mentation and tagging accuracy, where the overall accuracy ... character c 6 space-separated characters c 1 and c 2 7 character bigram c 1 c 2 in any word 8 the first / last characters c 1 / c 2 of any word 9 word w immediately b...

Ngày tải lên: 20/02/2014, 09:20

9 576 0
Tài liệu Báo cáo khoa học: "Unsupervized Word Segmentation: the case for Mandarin Chinese" doc

Tài liệu Báo cáo khoa học: "Unsupervized Word Segmentation: the case for Mandarin Chinese" doc

... hypoth- esis, (Jin and Tanaka-Ishii, 2006) propose a system that segments when BE is rising or when it reach a certain maximum. The main drawback of Jin and Tanaka-Ishii (2006) model is that segmentation ... expect a de- creasing BE and look for a less decreasing value (or on the contrary, rising at least to some extent). A threshold of 0 can be seen as a default value. Fi...

Ngày tải lên: 19/02/2014, 19:20

5 467 1
Tài liệu Báo cáo khoa học: "Chinese Word Segmentation without Using Lexicon and Hand-crafted Training Data" pdf

Tài liệu Báo cáo khoa học: "Chinese Word Segmentation without Using Lexicon and Hand-crafted Training Data" pdf

... mark (x:y) 'separated' else '?' 1.6 Bb ifdts(x:y) is local maximum then mark (x:y) 'bound' else '9' if (dts(x:y) is local minimum) and (a( ats(x:y)) ... > 81 then mark (x:y) 'bound' else '?' ifdts(x:y) is local minimum then if d(dts(x.'y)) > ~2 then mark (x:y) 'separated' else '?' 1.4...

Ngày tải lên: 20/02/2014, 18:20

7 396 0
Tài liệu Báo cáo khoa học: ATP-dependent modulation and autophosphorylation of rapeseed 2-Cys peroxiredoxin docx

Tài liệu Báo cáo khoa học: ATP-dependent modulation and autophosphorylation of rapeseed 2-Cys peroxiredoxin docx

... vector] as 5¢-primer and 5¢-TCTCCGTAGG GGAGACAAAAGT-3¢,5¢-ATCCCGCGGGGGAAACCT CATC-3¢ and 5¢-CTGTTTGGAC GAACGCAAGATG-3¢ as 3¢-primers for C53S, C175S and W88F variants, respec- tively (mutated codons ... such as reductants and ROS. An imbalance between these dual functions is probably associated with many human pathologies, such as thyroid tumors, breast and lung cancer, Alzheimer’s...

Ngày tải lên: 18/02/2014, 17:20

14 460 0
Tài liệu Báo cáo khoa học: "Sense-based Interpretation of Logical Metonymy Using a Statistical Method" pdf

Tài liệu Báo cáo khoa học: "Sense-based Interpretation of Logical Metonymy Using a Statistical Method" pdf

... kappa (Fleiss, 1971) and f-measure com- puted pairwise and then averaged across the an- notators. The agreement in group 1 was 0.76 (f-measure) and 0.56 (kappa); in group 2 0.68 (f-measure) and ... representation of the interpretation of logical metonymy and a more thorough evaluation method than that of Lapata and Lascarides (2003). By carrying out a human experiment we prove...

Ngày tải lên: 20/02/2014, 09:20

9 429 0
Tài liệu Báo cáo khoa học: "Hand-held Scanner and Translation Software for non-Native Readers" docx

Tài liệu Báo cáo khoa học: "Hand-held Scanner and Translation Software for non-Native Readers" docx

... off-line) material in foreign languages. It consists of a hand-held scanner and sophisticated parsing and translation software to provide readers a limited number of translations selected on the basis of a ... which can be as short as one word or as long as a whole sentence or even a whole paragraph. • The text appears in the user interface of the TwicPen system and is immed...

Ngày tải lên: 20/02/2014, 12:20

4 339 0
Tài liệu Báo cáo khoa học: "Matching Readers’ Preferences and Reading Skills with Appropriate Web Texts" docx

Tài liệu Báo cáo khoa học: "Matching Readers’ Preferences and Reading Skills with Appropriate Web Texts" docx

... (Collins-Thompson and Callan, 2004) is based on web data that have been annotated and indexed off-line. Also, relatedly, (Schwarm and Ostendorf, 2005) use a statistical language model to train SVM classifiers ... system that performs in real time a) keyword search, b)thematic classification and c)analysis of reading difficulty. Search results and analyses are returned within a few...

Ngày tải lên: 22/02/2014, 02:20

4 330 0
Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf

Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf

... (NIPS’10), Vancouver, Canada. Andreas Zollmann and Khalil Sima’an. 2005. A consis- tent and efficient estimator for data-oriented parsing. Journal of Automata, Languages and Combinatorics, 10(2/3):367–388. 21 Proceedings ... Gunn, M. Nikravesh, and L. Zadeh, ed- itors, Feature Extraction: Foundations and Applica- tions. Springer. Percy Liang, Alexandre Bouchard-C ˆ ot ´ e, Dan Klei...

Ngày tải lên: 19/02/2014, 19:20

11 549 0
Tài liệu Báo cáo khoa học: "Improving Word Representations via Global Context and Multiple Word Prototypes" pdf

Tài liệu Báo cáo khoa học: "Improving Word Representations via Global Context and Multiple Word Prototypes" pdf

... src="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAA9gAAAFXCAIAAADWMXECAAAACXBIWXMAABYlAAAWJQFJUiTwAAAgAElEQVR42uzdeVxTV/438EPCkrAEBJe4AIorErSKCLhCI0WqI0IVFRWwLm1RnHZ0purgr1Z5nNrq6EBLW6tF24KKLYp1wYVCXQBBUEkQ17gAiooQgpCw3OT5485cLwEpVSABPu8Xf5h4c3O4uSSfe/I95xhoNBoCAAAAAADti4NDAAAAAACAIA4AAAAAgCAOAAAAAAAI4gAAAAAACOIAAAAAAIAgDgAAAACAIA4AAAAAAAjiAAAAAAAI4gAAAAAAgCAOAAAAAIAgDgAAAACAIA4AAAAAAAjiAAAA...

Ngày tải lên: 19/02/2014, 19:20

10 494 0
Tài liệu Báo cáo khoa học: "Enhanced word decomposition by calibrating the decision threshold of probabilistic models and using a model ensemble" pdf

Tài liệu Báo cáo khoa học: "Enhanced word decomposition by calibrating the decision threshold of probabilistic models and using a model ensemble" pdf

... boundaries as hidden variables and include probabilities for let- ter transitions within segments. The ad- vantage of this model family is that it can learn from small datasets and easily gen- eralises ... 2006). They used a natural language tagger which was trained on the output of ParaMor and Morfes- sor. The goal was to mimic each algorithm since ParaMor is rule-based and there i...

Ngày tải lên: 20/02/2014, 04:20

9 558 0
w