0

automatic evaluation of text coherence models and representations

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

Báo cáo khoa học

... section, we present the evaluations of ROUGE-L, ROUGE-S, and compare their per-formance with other automatic evaluation meas-ures. 5 Evaluations One of the goals of developing automatic evalua-tion ... (Stem)Table 1. Pearson’s ρ and Spearman’s ρ correlations of automatic evaluation measures vs. adequacy and fluency: BLEU1, 4, and 12 are BLEU with maximum of 1, 4, and 12 grams, NIST is the NIST ... cognate candi-dates during construction of N-best translation lexicons from parallel text. Melamed (1995) used the ratio (LCSR) between the length of the LCS of two words and the length of the...
  • 8
  • 442
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation" doc

Báo cáo khoa học

... IntroductionResearch and development on automatic and man-ual evaluation of summarization systems have beenmainly focused on content coverage (Lin and Hovy,2003; Nenkova and Passonneau, 2004; ... researchon automatic evaluation of summary readability, the Text Analysis Conference (TAC) (Owczarzak and Dang, 2011) introduced a new subtask on readabilityto its Automatically Evaluating Summaries of ... 1006–1014,Jeju, Republic of Korea, 8-14 July 2012.c2012 Association for Computational LinguisticsCombining Coherence Models and Machine Translation Evaluation Metricsfor Summarization Evaluation Ziheng...
  • 9
  • 351
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Correlating Human and Automatic Evaluation of a German Surface Realiser" doc

Báo cáo khoa học

... are. Belz and Reiter(2006) and Reiter and Belz (2009) describe com-parison experiments between the automatic eval-uation of system output and human (expert and non-expert) evaluation of the same ... involvesstring comparisons between the output of the sys-tem and some gold standard set of strings. Typi-cally automatic metrics from the fields of MachineTranslation (e.g. BLEU) or Summarisation ... paper99Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pages 97–100,Suntec, Singapore, 4 August 2009.c2009 ACL and AFNLPCorrelating Human and Automatic Evaluation of a German SurfaceRealiserAoife...
  • 4
  • 285
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

Báo cáo khoa học

... spelling, good grammar, rhythm and flow,appropriateness of tone, and several other specificcharacteristics of good text. In terms of automatic evaluation, we are not aware of any technique that measures ... choosing the parsers and themetrics derived from them; generating some textsfor human and parser evaluation; and, the key part,getting human judgements on these texts and corre-lating them ... faithfulness and to fluency. In addition,the need for reference texts for an evaluation metriccan be problematic, and intuitively seems unneces-sary for characterising an aspect of text quality...
  • 8
  • 507
  • 0
Tài liệu Báo cáo khoa học: The isolation and characterization of cytochrome c nitrite reductase subunits (NrfA and NrfH) from Desulfovibrio desulfuricans ATCC 27774 Re-evaluation of the spectroscopic data and redox properties ppt

Tài liệu Báo cáo khoa học: The isolation and characterization of cytochrome c nitrite reductase subunits (NrfA and NrfH) from Desulfovibrio desulfuricans ATCC 27774 Re-evaluation of the spectroscopic data and redox properties ppt

Báo cáo khoa học

... intenseband of 61 kDa (NrfA) and a band of weak intensity of 19 kDa (NrfH), confirming its hetero-oligomeric nature(Fig. 1, lane 1).However, in the absence of boiling (Fig. 1A, lanes 2 and 4) ... and 4) high molecular mass bands of approximately 110 kDa and > 200 kDa were visible, as well as a faint band at37 kDa, suggesting the presence of dimers. All of the bandsstained positively ... B.H., Scheidt, R. & Osvath, S.R. (1986) Models of the cytochromes b6. The effect of axial ligand planeorientation on the EPR and Mo¨ssbauer spectra of low-spin fer-rihemes. J. Am. Chem. Soc....
  • 12
  • 593
  • 0
PET in the Evaluation of Alzheimer’s Disease and Related Disorders docx

PET in the Evaluation of Alzheimer’s Disease and Related Disorders docx

Sức khỏe giới tính

... and my family—my wife Wei, our kids, our parents Donna and Robert and Pei and Robert, and my sibs Anne, Beth, and Mikhael, whose contributions of friendship, love, understanding of my professional ... Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CASung-Cheng Huang, D.Sc.Professor, Department of Molecular and Medical ... Section, Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, CAxiii1 Clinical Evaluation of Dementia and When to Perform...
  • 231
  • 535
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Evaluation of Linguistic Quality in Multi-Document Summarization" pptx

Báo cáo khoa học

... compression. Artificial Intelligence,139(1):91–107.M. Lapata and R. Barzilay. 2005. Automatic evalua-tion of text coherence: Models and representations. In International Joint Conference On Artificial ... entity coherence, sentence fluency and lan-guage models are the most powerful classes of fea-tures that should be used in automation of evalu-ation and against which novel predictors of text quality ... Predicting the fluency of text with shallow structural features: case studies of machine translation and human-written text. InProceedings of EACL, pages 139–147.E. Charniak and M. Elsner. 2009....
  • 11
  • 407
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Evaluation of Chinese Translation Output: Word-Level or Character-Level" doc

Báo cáo khoa học

... Chinese Translation Evaluation Automatic MT evaluation aims at formulating au-tomatic metrics to measure the quality of MT out-put. Compared with human assessment, automatic evaluation metrics ... Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/ or ... I. Dan Melamed, Ryan Green and Joseph P. Turian, 2003. Precision and Recall of Machine Translation. Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational...
  • 6
  • 344
  • 1
Báo cáo khoa học:

Báo cáo khoa học: "QARLA:A Framework for the Evaluation of Text Summarization Systems" pdf

Báo cáo khoa học

... QARLA, for the evaluation of text summarisation systems. The in-put of the framework is a set of man-ual (reference) summaries, a set of base-line (automatic) summaries and a set of similarity ... Metrics.In Proceedings of MT Summit IX, New Orleans,LA.Luke Shen Joseph P. Turian and I. Dan Melamed.2003. Evaluation of Machine Translation and its Evaluation. In In Proceedings of MT Summit IX,New ... discussion6.1 Application of similarity metrics toevaluate summariesBoth in Text Summarisation and Machine Trans-lation, the automatic evaluation of systems con-sists of computing some similarity...
  • 10
  • 517
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "a Method for Automatic Evaluation of Machine Translation" pot

Báo cáo khoa học

... the BLEU score and the monolingual group. Of particular interest is the accuracy of BLEU’s esti-mate of the small difference between S2 and S3 and the larger difference between S3 and H1. The figurealso ... and monitored by SPAWAR under contractNo. N66001-99-2-8916. The views and findingscontained in this material are those of the authors and do not necessarily reflect the position of pol-icy of ... Proceedings of theEagles Workshop on Standards and Evaluation, Pisa,Italy.Kishore Papineni, Salim Roukos, Todd Ward, John Hen-derson, and Florence Reeder. 2002. Corpus-basedcomprehensive and diagnostic...
  • 8
  • 336
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Collecting a Why-question corpus for development and evaluation of an automatic QA-system" pdf

Báo cáo khoa học

... the evaluation: manual evaluation is adifficult, time-consuming process and not ap-plicable within efficient development of sys-tems. Automatic evaluation requires a cor-pus of questions and ... problem of several possible answers and, inconsequence, automatic evaluation has been tackledfor years within another field of study: automatic summarisation (Hori et al., 2003; Lin and Hovy,2003). ... selection of natural questions. Thearticles varied in topic, degree of formality and theamount of details; from ”Horror film” and ”Christ-mas worldwide” to ”G-Man (Half-Life)” and ”His-tory of London”....
  • 9
  • 610
  • 1
Báo cáo khoa học:

Báo cáo khoa học: "Task-oriented Evaluation of Syntactic Parsers and Their Representations" potx

Báo cáo khoa học

... will illustrate the ad-vantages and disadvantages of these parsers and rep-resentations, leading us to better parsing models and a better design for parse representations. 4.4 Comparison with ... dependency parsers (Mc-Donald and Pereira, 2006; Nivre and Nilsson, 2005;Sagae and Tsujii, 2007) and deep parsers (Kaplanet al., 2004; Clark and Curran, 2004; Miyao and Tsujii, 2008). However, ... Dependency-based evaluation of MINI-PAR. In LREC Workshop on the Evaluation of ParsingSystems.M. Marcus, B. Santorini, and M. A. Marcinkiewicz.1994. Building a large annotated corpus of En-glish:...
  • 9
  • 483
  • 0

Xem thêm