learning from positive and unlabeled data

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

... method of active learning for WSD with pseudo negative examples, which are selected from unlabeled data by a classifier trained with positive and unlabeled examples. McCallum and Nigam (1998) ... 61–64, Suntec, Singapore, 4 August 2009. c 2009 ACL and AFNLP A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: ... semi-supervised learning using unlabeled examples is effective. The accuracies of with-EM, random and with- out-EM are gradually increasing according to the percentage of added hand labeled examples and...

Ngày tải lên: 23/03/2014, 17:20

4 441 1
Báo cáo khoa học: "Boosting Statistical Word Alignment Using Labeled and Unlabeled Data" ppt

Báo cáo khoa học: "Boosting Statistical Word Alignment Using Labeled and Unlabeled Data" ppt

... the labeled data and the unlabeled data. Then we build a pseudo reference set for the unlabeled data, and calculate the error rate of each word aligner using only the labeled data. Based ... alignment as a case study. 5.1 Data We have two kinds of training data from general domain: Labeled Data (LD) and Unlabeled Data (UD). The Chinese sentences in the data are automatically segmented ... labeled data and large amounts of unlabeled data. The proposed approach modifies the super- vised boosting algorithm to a semi- supervised learning algorithm by incor- porating the unlabeled data. ...

Ngày tải lên: 08/03/2014, 02:21

8 451 1
How Do Earnings Change When Reservists Are Activated - A Reconciliation of Estimates Derived from Survey and Administrative Data docx

How Do Earnings Change When Reservists Are Activated - A Reconciliation of Estimates Derived from Survey and Administrative Data docx

... at RAND oversaw data management for the project including obtaining and processing military personnel records, writing analysis programs, and facilitating data transfer to and from DMDC and ... discrepancies between survey and administrative data include Goldman and Smith (2001), Denmead and Turek (2005), Hurd and Rohwedder (2006), Kapteyn and Ypma (2007), and Haider and Loughran (2008). ... SECURITY POPULATION AND AGING PUBLIC SAFETY SCIENCE AND TECHNOLOGY SUBSTANCE ABUSE TERRORISM AND HOMELAND SECURITY TRANSPORTATION AND INFRASTRUCTURE WORKFORCE AND WORKPLACE The RAND Corporation...

Ngày tải lên: 23/03/2014, 02:20

74 228 0
Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc

Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc

... classifier with robustness from noisy data (Ko and Seo, 2004). How can labeled training data be automatically created from unlabeled data and title words? Maybe unlabeled data don’t have any information ... A. McCallum, S. Thrun, and T. Mitchell, 1998, Learning to Classify Text from Labeled and Unlabeled Documents, In Proc. of AAAI-98. K. P. Nigam, 2001, Using Unlabeled Data to Improve Text Classification, ... related works, we presented two approaches using unlabeled data in text categorization; one approach combines unlabeled data and labeled data, and the other approach uses the clustering technique...

Ngày tải lên: 20/02/2014, 16:20

8 444 0
Báo cáo khoa học: "Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz data: Bootstrapping and Evaluation" pot

Báo cáo khoa học: "Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz data: Bootstrapping and Evaluation" pot

... (for training RL) from such limited data. The use of WOZ data has earlier been proposed in the context of RL. (Williams and Young, 2004) utilise WOZ data to discover the state and action space ... using data- driven methods. The em- ployed database contains 438 items and is similar in retrieval ambiguity and structure to the one used in the WOZ experiment. The dialogue system used for learning ... FP6 project “TALK: Talk and Look, Tools for Am- bient Linguistic Knowledge (IST 507802, www. talk-project.org), from the EPSRC, project no. EP/E019501/1, and from the IRTG Saarland University. 645 ...

Ngày tải lên: 23/03/2014, 17:20

9 402 0
Statistics The Art and Science of Learning from Data pot

Statistics The Art and Science of Learning from Data pot

... available for download from Pearson’s online catalog at www . pearsonhighered.com/irc and through MyStatLab. Statistics: The Art and Science of Learning from Data 1 1.1 Using Data to Answer Statistical ... www.pearsonhighered.com 16 Chapter 1 Statistics: The Art and Science of Learning from Data Data Files To make statistical analysis easier, large sets of data are organized in a data file . This file usually has ... Check Sources Not all databases or reported data summaries give reliable information. Before you give credence to such data, verify that the data are from a trustworthy source and that the source...

Ngày tải lên: 28/06/2014, 20:20

834 2,5K 0
Tài liệu Adding, Modifying, and Removing DataRowView Objects from a DataView docx

Tài liệu Adding, Modifying, and Removing DataRowView Objects from a DataView docx

... Country " + " ;FROM Customers"; SqlDataAdapter mySqlDataAdapter = new SqlDataAdapter(); mySqlDataAdapter.SelectCommand = mySqlCommand; DataSet myDataSet = new DataSet(); mySqlConnection.Open(); ... System .Data. SqlClient; class AddModifyAndRemoveDataRowViews { public static void DisplayDataRow( DataRow myDataRow, DataTable myDataTable ) { Console.WriteLine("\nIn DisplayDataRow()"); ... ADDMODIFYANDREMOVEDATAROWVIEWS.CS /* AddModifyAndRemoveDataRowViews.cs illustrates how to add, modify, and remove DataRowView objects from a DataView */ using System; using System .Data; ...

Ngày tải lên: 24/12/2013, 01:17

7 368 0
THE DEGREE OF JUDICIAL ENFORCEMENT AND CREDIT MARKETS: EVIDENCE FROM JAPANESE HOUSEHOLD PANEL DATA ppt

THE DEGREE OF JUDICIAL ENFORCEMENT AND CREDIT MARKETS: EVIDENCE FROM JAPANESE HOUSEHOLD PANEL DATA ppt

... graduated from college (COLLEGE); (8) the logarithm of the sum of the value of financial assets (bank and postal deposits and investment securities) and the value of real assets (land and housing) ... and X are economic and demographic household characteristics that affect loan supply and demand, and E are three dummy variables (1 st Enforcement Quartile, 2 nd Enforcement Quartile, and ... 125, 150, and 112 in 2003, 2004, 2005, 2006, and 2007, respectively. There are three advantages to using data from the JPSC. The biggest advantage of using the JPSC data is that this data set...

Ngày tải lên: 22/03/2014, 20:20

31 429 0
Visual Event Recognition in Videos by Learning from Web Data ppt

Visual Event Recognition in Videos by Learning from Web Data ppt

... RECOGNITION IN VIDEOS BY LEARNING FROM WEB DATA 1679 Visual Event Recognition in Videos by Learning from Web Data Lixin Duan, Dong Xu, Member, IEEE, Ivor Wai-Hung Tsang, and Jiebo Luo, Fellow, ... samples) from the target domain and auxiliary domain are used to calculate h in (6). Note that all test samples are used as unlabeled data during the learning process. Table 3 reports the means and ... observe that the absolute values of  1 and  2 are always DUAN ET AL.: VISUAL EVENT RECOGNITION IN VIDEOS BY LEARNING FROM WEB DATA 1675 TABLE 3 Means and Standard Deviations (Percent) of MAPs over...

Ngày tải lên: 23/03/2014, 13:20

14 510 0
Báo cáo khoa học: "Attacking Parsing Bottlenecks with Unlabeled Data and Relevant Factorizations" pdf

Báo cáo khoa học: "Attacking Parsing Bottlenecks with Unlabeled Data and Relevant Factorizations" pdf

... 84.3 dpo3 (Grand+Sib) 93.21 44.8 89.6 86.9 dpo3 +Unlabeled (Edges) 93.12 43.6 85.3 87.0 dpo3 +Unlabeled (Sib) 93.15 43.7 85.5 86.8 dpo3 +Unlabeled (Grand) 93.55 46.1 90.6 87.5 dpo3 +Unlabeled (Grand+Sib) ... may not add some additional gains. 4 Using Unlabeled Data Effectively Associations from unlabeled data have the poten- tial to improve both conjunctions and prepositions. We predict that web counts ... the data representation and the learning representation both capture relevant prop- erties of prepositions and conjunctions. We predict that Conversion 2 and a factorization which includes grand-parent...

Ngày tải lên: 23/03/2014, 14:20

9 277 0
Civilization and Beyond Learning From Hist doc

Civilization and Beyond Learning From Hist doc

... and use of land were shared with the temples and with those members of the nobility closest to the ruling monarch. Hence there were state lands and state income and temple lands and temple income. ... animals and working the land, could release a comparatively large part of the population to devote its time and energy to trade and commerce, to industry and transport, to the arts and sciences and ... by merchants and bankers who owned it and used it for their purposes. Accumulating wealth and money enabled the traders, merchants, bankers and manufacturers to out-buy and out-point landlords and churchmen. Politically,...

Ngày tải lên: 23/03/2014, 17:20

146 343 0
Báo cáo khoa học: "Semi-Supervised Sequential Labeling and Segmentation using Giga-word Scale Unlabeled Data" pdf

Báo cáo khoa học: "Semi-Supervised Sequential Labeling and Segmentation using Giga-word Scale Unlabeled Data" pdf

... 1G-word unlabeled data 93.66 89.36 37M-word unlabeled data (Ando and Zhang, 2005) 93.15 89.31 27M-word unlabeled data (Florian et al., 2003) 93.87 88.76 own large gazetteers, 2M-word labeled data (Suzuki ... PTB III data evaluated by label accuracy system test additional resources JESS-CM (CRF/HMM) 95.15 1G-word unlabeled data 94.67 15M-word unlabeled data (Ando and Zhang, 2005) 94.39 15M-word unlabeled ... Results for POS tagging (PTB III data) , syntactic chunking (CoNLL’00 data) , and NER (CoNLL’03 data) incorporated with 1G-words of unlabeled data, and the performance gain from supervised CRF ᎀ፾፵፸ ᎀ፾፵፹ ᎀ፾፵፺ ᎀ፾፵፻ ፷...

Ngày tải lên: 23/03/2014, 17:20

9 274 0
Báo cáo khoa học: "Learning from evolving data streams: online triage of bug reports" potx

Báo cáo khoa học: "Learning from evolving data streams: online triage of bug reports" potx

... col- lect data from several open issue trackers, use the minimal amount of simple preprocessing and fil- ter heuristics to get useful input data, and publicly share both the raw and preprocessed data. We ... situation. 4.1 Progressive validation When learning from data streams the standard evaluation methodology where data is split into a separate training and test set is not applicable. An evaluation ... stream for exploratory data analysis and feature and param- eter tuning, and then use progressive validation to evaluate on entirely unseen test data. Below we specify the size and number of unique...

Ngày tải lên: 24/03/2014, 03:20

10 432 0
Learning from My Mother’s Voice Family Legend and the Chinese American Experience pptx

Learning from My Mother’s Voice Family Legend and the Chinese American Experience pptx

... that unfold in the oral history. The themes of promise and obligation, loss and abandonment guilt, poverty and survival, ritual and sacrifice, and pride and respect, spoken through the voice of a Chinese ... culture and the connections with family and community promote resiliency and happiness. At the same time, the obligations of family prescribed by cul- ture and guilt over abandoning family and culture ... are abandoned and shamed. These stories mimic Asian values and are writ- ten from Western perspectives. Westerners often fail to understand that the latter portrayals victimize Asian women and...

Ngày tải lên: 29/03/2014, 04:20

178 223 0
Báo cáo y học: "IVBrainSeqDB: a database of annotated HIV envelope sequences from brain and other anatomical sites" ppt

Báo cáo y học: "IVBrainSeqDB: a database of annotated HIV envelope sequences from brain and other anatomical sites" ppt

... sequence database, assembled and curated sequences, performed all bioinformatic analysis, and drafted the manuscript. MM assembled and curated sequences and clinical data. NO designed and implemented ... Network U01MH083506, R24MH59745, Statistics and Data Coordinating Center U01MH083545, N01MH32002. The funders and NNTC had no role in study design, data analysis, or preparation and submission of the publication. Author ... HIVBrainSeqDB: a database of annotated HIV envelope sequences from brain and other anatomical sites. AIDS Research and Therapy 2010 7:43. Submit your next manuscript to BioMed Central and take full...

Ngày tải lên: 10/08/2014, 05:21

12 394 0
w