Báo cáo khoa học: "Learning Better Data Representation using Inference-Driven Metric Learning" potx

Tài liệu Báo cáo khoa học: "Predicate Argument Structure Analysis using Transformation-based Learning" pdf

Tài liệu Báo cáo khoa học: "Predicate Argument Structure Analysis using Transformation-based Learning" pdf

... ‘location’ types by ourselves. We show the dataset distribution in Table 1. We extracted the BP units and dependencies among these BPs from the dataset using Cabocha, a Japanese dependency parser, ... the dataset. The rest of this paper is organized as follows. Section 2 describes Japanese predicate structure, our graph expression of it, and our improved method. The results of experiments...

Ngày tải lên: 20/02/2014, 04:20

6 496 0
Báo cáo khoa học: "Historical Change in Language Using Monte Carlo Techniques" potx

Báo cáo khoa học: "Historical Change in Language Using Monte Carlo Techniques" potx

... areas. C. STRUCTURE OF THE PROGRAM The components in the system are data tables and dy- namic programs. 13 One of the major data tables contains HISTORICAL CHANGE IN LANGUAGE 69 the sets ... simu- lations of group, language change, has been successfully tested in several computer runs using an extremely simple model of linguistic interaction. (The system, and any model teste...

Ngày tải lên: 07/03/2014, 18:20

16 336 0
Báo cáo khoa học: "Learning Better Data Representation using Inference-Driven Metric Learning" potx

Báo cáo khoa học: "Learning Better Data Representation using Inference-Driven Metric Learning" potx

... July 2010. c 2010 Association for Computational Linguistics Learning Better Data Representation using Inference-Driven Metric Learning Paramveer S. Dhillon CIS Deptt., Univ. of Penn. Philadelphia, ... exploits labeled as well as unlabeled data during metric learning. These methods learn a Mahalanobis dis- tance metric to compute distance between a pair of data instances,...

Ngày tải lên: 30/03/2014, 21:20

5 314 0
Tài liệu Báo cáo khoa học " Towards Better Evaluation of Design Wind Speed of Vietnam " doc

Tài liệu Báo cáo khoa học " Towards Better Evaluation of Design Wind Speed of Vietnam " doc

... VN06 - R100 Data to end - year of 1994 Data to end - year of 2000 20 40 60 80 100 A23 A24 A25 A30 A31 A32 A34 A35 A36 A37 A38 A39 A40 A41 A42 A43 A44 A45 A46 A47 Wind speed (m/s) Station Using ... provided. 2. Choosing and processing wind data of Vietnam Although 75 years data were available for all meteorological stations in Japan but only 40 years data were employed....

Ngày tải lên: 18/02/2014, 13:20

12 574 0
Tài liệu Báo cáo khoa học: "Updating a Name Tagger Using Contemporary Unlabeled Data" ppt

Tài liệu Báo cáo khoa học: "Updating a Name Tagger Using Contemporary Unlabeled Data" ppt

... with older data. First, we studied whether it was better to update the seeds or the unlabeled data; then, we analyzed whether using a smaller amount of current unlabeled data could be better than ... and outperforms updating the labeled data. Furthermore, we will also show that augmenting the unlabeled data with older data in most cases does not re- sult in better performance...

Ngày tải lên: 20/02/2014, 09:20

4 329 0
Tài liệu Báo cáo khoa học: "Chinese Word Segmentation without Using Lexicon and Hand-crafted Training Data" pdf

Tài liệu Báo cáo khoa học: "Chinese Word Segmentation without Using Lexicon and Hand-crafted Training Data" pdf

... Chinese Word Segmentation without Using Lexicon and Hand-crafted Training Data Sun Maosong, Shen Dayang*, Benjamin K Tsou** State Key Laboratory of ... texts without making use of any lexicon and hand-crafted linguistic resource. The statistical data required by the algorithm, that is, mutual information and the difference of t-score between ... corpus annotated is balanced and...

Ngày tải lên: 20/02/2014, 18:20

7 396 0
Tài liệu Báo cáo khoa học: "Untangling Text Data Mining" ppt

Tài liệu Báo cáo khoa học: "Untangling Text Data Mining" ppt

... Non-textual data standard data mining Textual data computational linguistics Finding Nuggets Novel I Non-Novel ? database queries real TDM information retrieval Table 1: A classification of data ... more widely usable information. 5 Text Data Mining as Exploratory Data Analysis Another way to view text data mining is as a process of exploratory data analysis (Tuke...

Ngày tải lên: 20/02/2014, 18:20

8 336 0
Báo cáo khoa học: "An Interactive Machine Translation System with Online Learning" pdf

Báo cáo khoa học: "An Interactive Machine Translation System with Online Learning" pdf

... translation models. The language model is im- plemented using statistical n-gram language mod- els and the translation model is implemented using phrase-based models. The IMT system proposed here ... project (Foster et al., 1997; Langlais et al., 2002). The idea proposed in that work was to embed data driven MT techniques within the interactive translation environment. Fol- lowing the T...

Ngày tải lên: 07/03/2014, 22:20

6 431 0
Báo cáo khoa học: "Boosting Statistical Word Alignment Using Labeled and Unlabeled Data" ppt

Báo cáo khoa học: "Boosting Statistical Word Alignment Using Labeled and Unlabeled Data" ppt

... word aligner by using both the labeled data and the unlabeled data. Then we build a pseudo reference set for the unlabeled data, and calculate the error rate of each word aligner using only ... alignment as a case study. 5.1 Data We have two kinds of training data from general domain: Labeled Data (LD) and Unlabeled Data (UD). The Chinese sentences in the data are...

Ngày tải lên: 08/03/2014, 02:21

8 451 1
w