quantitative and qualitative evaluation

Báo cáo khoa học: "Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems" pdf

Báo cáo khoa học: "Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems" pdf

... questions about the user’s travel plans both at the beginning of the dialogue and also after Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems Marilyn A. Walker AT&T ... and that there were four groups of performers with sites 3,2,1,4 in the top group (listed by average user satisfac- tion), sites 4,5,9,6 in a second group, and sites 8 and 7 defining a third and ... standard and tools were used by all sites to collect a set of core metrics for making cross system comparisons. The core metrics were developed during a workshop of the Evaluation Committee and...

Ngày tải lên: 31/03/2014, 04:20

8 319 0
Báo cáo " Language program evaluation: Quantitative or qualitative approach? " pptx

Báo cáo " Language program evaluation: Quantitative or qualitative approach? " pptx

... reality is objective and detached from the observers, and that this reality can be Tạp chí Khoa học ĐHQGHN, Ngoại ngữ 24 (2008) 1-6 1 Language program evaluation: Quantitative or qualitative approach? ... approaches: positivistic /quantitative and naturalistic /qualitative. This article will attempt to review these two major paradigms by (i) giving the definition of each paradigm and presenting its logic ... evaluators want to achieve in the evaluation process. However, evaluators have to rely on either quantitative or qualitative approach which has its own strengths and weaknesses. The researchers...

Ngày tải lên: 28/03/2014, 11:20

6 169 0
Báo cáo khoa học: "You Can’t Beat Frequency (Unless You Use Linguistic Knowledge) – A Qualitative Evaluation of Association Measures for Collocation and Term Extraction" pot

Báo cáo khoa học: "You Can’t Beat Frequency (Unless You Use Linguistic Knowledge) – A Qualitative Evaluation of Association Measures for Collocation and Term Extraction" pot

... for CE (Wermter and Hahn, 2004) and for ATR (Wermter and Hahn, 2005), which have been shown to outperform several of the statistics- only metrics. 3 Methods and Experiments 3.1 Qualitative Criteria Because ... best-performing statistics- only measure for CE (cf. Evert and Krenn (2001) and Krenn and Evert (2001)) and also for ATR (see Wermter and Hahn (2005)). Concerning more recent linguistically grounded AMs, ... conditions. Several studies (e.g., Evert and Krenn (2001), Krenn and Evert (2001), Frantzi et al. (2000), Wermter and Hahn (2004)), however, have al- ready observed that ranking the candidates merely by their...

Ngày tải lên: 31/03/2014, 01:20

8 435 0
Tài liệu Báo cáo khoa học: "Methods for the Qualitative Evaluation of Lexical Association Measures" doc

Tài liệu Báo cáo khoa học: "Methods for the Qualitative Evaluation of Lexical Association Measures" doc

... part- of-speech tags and minimal PPs were identified. 5 The PNV triples were selected automatically such that the preposition and the noun are constituents of the same PP, and the PP and the verb co-occur within ... more susceptible to random variation, which illustrates that evaluation based on a small number of -best candidate pairs cannot be reliable. With respect to the recall curves (Figures 3 and 4), we find: ... log-likelihood, and even precision gained by frequency is better than or at least comparable to log-likelihood. These pairings – log-likelihood and t-test for AdjN, and t-test and frequency for PNV...

Ngày tải lên: 20/02/2014, 18:20

8 516 0
"The Potential of Cellulosic Ethanol Production from Municipal Solid Waste: A Technical and Economic Evaluation" doc

"The Potential of Cellulosic Ethanol Production from Municipal Solid Waste: A Technical and Economic Evaluation" doc

... of xylan and glucan for ADC final reached 79.1% and 88.2%, respectively. The overall yield of xylan and glucan for ADC green was 83.3% and 89.1%, respectively, through pretreatment and enzymatic ... Municipal solid waste: A Technical and Economic Evaluation Jian Shi, Mirvat Ebrik, Bin Yang*, and Charles E. Wyman Center for Environmental Research and Technology Bourns College of Engineering ... transportation fuels and chemicals because of its abundance, the need to find uses for this problematic waste, and its low and perhaps negative cost. However, significant heterogeneity and possible...

Ngày tải lên: 09/03/2014, 00:20

41 554 0
Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining pptx

Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining pptx

... it – Class counts in each of the partitions, A < v and A ≥ v Simple method to choose best v – For each v, scan the database to gather count matrix and compute its Gini index – Computationally Inefficient! ... or random coil Categorizing news stories as finance, weather, entertainment, sports, etc © Tan,Steinbach, Kumar Introduction to Data Mining 47 Stopping Criteria for Tree Induction Stop expanding ... attribute, – Sort the attribute on values – Linearly scan these values, each time updating the count matrix and computing gini index – Choose the split position that has the least gini index Cheat No No...

Ngày tải lên: 15/03/2014, 09:20

101 4,3K 1
Báo cáo khoa học: "Correlation between ROUGE and Human Evaluation of Extractive Meeting Summaries" pptx

Báo cáo khoa học: "Correlation between ROUGE and Human Evaluation of Extractive Meeting Summaries" pptx

... ICASSP. X. Zhu and G. Penn. 2005. Evaluation of sentence selection for speech summarization. In ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/ or Summariza- tion. X. Zhu and G. ... Infor- mative Coverage (IC): S2 and S9; Informative Relevance (IRV): S3 and S8; and Informative Redundancy (IRD): S4 and S7. 4 Results 4.1 Correlation between Human Evaluation and Original ROUGE Score Similar ... R-SU4 and human evaluation. 5 Conclusion and Future Work In this paper, we have made a first attempt to system- atically investigate the correlation of automatic ROUGE scores with human evaluation...

Ngày tải lên: 17/03/2014, 02:20

4 293 0
Báo cáo khoa học: "From Single to Multi-document Summarization: A Prototype System and its Evaluation" pptx

Báo cáo khoa học: "From Single to Multi-document Summarization: A Prototype System and its Evaluation" pptx

... Japan. DUC and TSC both aim to compile standard training and test collections that can be shared among researchers and to provide common and large scale evaluations in single and multiple ... 2000 and 2001. However, the area is still being fleshed out: most past efforts have focused only on single-document summarization (Mani 2000), and no standard test sets and large scale evaluations ... between most and all, cohesion, some and most, and coherence, some and most. This indicates the strategies employed by NeATS (stigma word filtering, adding lead sentence, and time annotation)...

Ngày tải lên: 17/03/2014, 08:20

8 288 0
Báo cáo khoa học: "Unsupervised Discovery of Generic Relationships Using Pattern Clusters and its Evaluation by Automatically Generated SAT Analogy Questions" pot

Báo cáo khoa học: "Unsupervised Discovery of Generic Relationships Using Pattern Clusters and its Evaluation by Automatically Generated SAT Analogy Questions" pot

... we demonstrate in our bilingual evaluation. 2.3 Evaluation Method Evaluation for hypernymy and synonymy usually uses WordNet (Lin and Pantel, 2002; Widdows and Dorow, 2002; Davidov and Rappoport, 2006). ... meronymy (Berland and Charniak, 1999; Girju et al., 2006), synonymy (Widdows and Dorow, 2002; Davidov and Rap- poport, 2006), and verb strength + verb happens- before (Chklovski and Pantel, 2004). ... (Davidov and Rappoport, 2006; Widdows and Dorow, 2002) and meronymy (Berland and Charniak, 1999; Girju et al., 2006). Since named entities are very important in NLP, many studies define and discover...

Ngày tải lên: 23/03/2014, 17:20

9 390 0
Báo cáo khoa học: "Correlating Human and Automatic Evaluation of a German Surface Realiser" doc

Báo cáo khoa học: "Correlating Human and Automatic Evaluation of a German Surface Realiser" doc

... these are. Belz and Reiter (2006) and Reiter and Belz (2009) describe com- parison experiments between the automatic eval- uation of system output and human (expert and non-expert) evaluation of ... 0.03686 Table 4: Correlation between dependency-based evaluation and human judgements the parses of the original strings. We calculate both a weighted and unweighted dependency f- score, as given in ... Short Papers, pages 97–100, Suntec, Singapore, 4 August 2009. c 2009 ACL and AFNLP Correlating Human and Automatic Evaluation of a German Surface Realiser Aoife Cahill Institut f ¨ ur Maschinelle...

Ngày tải lên: 23/03/2014, 17:20

4 285 0
STOCK ASSESSMENT AND FISHERY EVALUATION REPORT FOR THE GROUNDFISH FISHERIES OF THE GULF OF ALASKA AND BERING SEA/ALEUTIAN ISLANDS AREA: ECONOMIC STATUS OF THE GROUNDFISH FISHERIES OFF ALASKA, 2008 potx

STOCK ASSESSMENT AND FISHERY EVALUATION REPORT FOR THE GROUNDFISH FISHERIES OF THE GULF OF ALASKA AND BERING SEA/ALEUTIAN ISLANDS AREA: ECONOMIC STATUS OF THE GROUNDFISH FISHERIES OFF ALASKA, 2008 potx

... mortality are not available. Tables 7 and 8, and 9 and 10, respectively, provide estimates of discarded catch and discard rates by species, area, gear, and target fishery. Within each area or ... 1.7 and 2.2 million t (Fig. 1 and Table 1). The rapid displacement of the foreign and joint-venture fisheries by the domestic fishery between 1984 and 1991 can be seen by comparing Figures 1 and ... Status NPFMC Economic SAFE STOCK ASSESSMENT AND FISHERY EVALUATION REPORT FOR THE GROUNDFISH FISHERIES OF THE GULF OF ALASKA AND BERING SEA/ALEUTIAN ISLANDS AREA: ECONOMIC STATUS OF THE GROUNDFISH...

Ngày tải lên: 23/03/2014, 21:20

276 668 0
Báo cáo khoa học: "Comparing Automatic and Human Evaluation of NLG Systems" potx

Báo cáo khoa học: "Comparing Automatic and Human Evaluation of NLG Systems" potx

... Riezler and J. T. Maxwell III. 2005. On som e pit- falls in automatic evaluation and significance testing for MT. In Proc. ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/ or ... system, and then compared the generated texts to the original corpus texts. Similar evaluations have been used e.g. by Banga- lore et al. (2000) and Marciniak and Strube (2004). Such corpus-based evaluations ... (non-repeating column and row entries) experimental design where each combination of date and system is assigned one evaluation. 4 Results Table 2 shows evaluation scores for the five NLG systems and the corpus...

Ngày tải lên: 24/03/2014, 03:20

8 376 0
DELIVERING HEALTH EDUCATION VIA THE WEB: DESIGN AND FORMATIVE EVALUATION OF A DISCOURSE-BASED LEARNING ENVIRONMENT pot

DELIVERING HEALTH EDUCATION VIA THE WEB: DESIGN AND FORMATIVE EVALUATION OF A DISCOURSE-BASED LEARNING ENVIRONMENT pot

... protocols and tools were developed to collect quantitative and qualitative data. Pre and post tests related to HIV/AIDS and nutrition will allow for quantitative comparison of knowledge, attitude and ... scheduled lecture and tutorial hours; (3) opportunities for a variety of learning activities including small group discussion and collaborative projects; and (4) exposure to and a forum for expressing and ... Web-based and classroom learning environments; the design and development of a prototype Web environment to facilitate these learning activities; and, the formative evaluation of learning activities and...

Ngày tải lên: 28/03/2014, 21:20

12 411 0
self-similar network traffic and performance evaluation.

self-similar network traffic and performance evaluation.

... network xi 20 Toward an Improved Understanding of Network Traf®c Dynamics 507 R. H. Riedi and Walter Willinger 21 Future Directions and Open Problems in Performance Evaluation and Control of Self-Similar ... Heavy Tails and Heavy Traf®c 143 O. J. Boxma and J. W. Cohen 7 Fluid Queues, OnaOff Processes, and Teletraf®c Modeling with Highly Variable and Correlated Inputs 171 Sidney Resnick and Gennady ... make a short detour and discuss self-similar processes in slightly more generality. Further extensions and detailed treatments can be found in Beran [9] and Samorodnitsky and Taggu [60]. Consider...

Ngày tải lên: 01/06/2014, 10:53

574 1,5K 0
báo cáo hóa học: " Development and preliminary evaluation of a quality of life measure targeted at dementia caregivers" doc

báo cáo hóa học: " Development and preliminary evaluation of a quality of life measure targeted at dementia caregivers" doc

... emotional and social concerns, and spirituality and benefits were identified. Conclusion: These preliminary results support subsequent evaluation of test-retest reliability, construct validity, and ... family involvement, caregiving demands, worry, spirituality and faith, benefits of caregiving, caregiver feelings, and role limitations due to caregiving [19]. Spirituality and faith, and benefits of caregiving ... family involvement, demands of caregiving, caregiver worry, and caregiver feelings (See Table 5). Less patient education and non-white caregiver ethnicity were associated with higher spirituality and faith and...

Ngày tải lên: 18/06/2014, 18:20

12 741 0

Bạn có muốn tìm thêm với từ khóa:

w