word selection based feature reduction

Báo cáo khoa học: "Word Clustering and Word Selection based Feature Reduction for MaxEnt based Hindi NER" ppt

Báo cáo khoa học: "Word Clustering and Word Selection based Feature Reduction for MaxEnt based Hindi NER" ppt

... Word Selection It is noted that not all words are equally important in determining the NE category. Some of the words 492 Feature Using Word Features Using Words (I1) Using Words (I2) Using Words (I3) w i , ... 6: F-values for different features in a MaxEnt based Hindi NER with important word based feature reduction [window(−m, +n) refers to the important word or baseline word features corresponding to ... used the clusters for feature reduction. In this paper we propose two feature reduction techniques for Hindi NER based on word cluster- ing and word selection. A number of word similar- ity measures...

Ngày tải lên: 08/03/2014, 01:20

8 444 0
Tài liệu Báo cáo khoa học: "Learning Word Senses With Feature Selection and Order Identification Capabilities" pdf

Tài liệu Báo cáo khoa học: "Learning Word Senses With Feature Selection and Order Identification Capabilities" pdf

... efforts on word sense dis- crimination. In section 5 we will conclude our work and suggest some possible improvements. 2 Learning Procedure 2.1 Feature selection Feature selection for word sense ... better feature selection. Since the sense associated with a word s occur- rence is always determined by some feature words in its contexts, it is reasonable to suppose that the selected features ... validation based feature selection in feature set used by CGD. Then Cluster algorithm was used to group target word s instances using Euclidean distance measure. τ was set as 0.90 in feature subset...

Ngày tải lên: 20/02/2014, 16:20

8 463 0
Tài liệu Báo cáo khoa học: "An Information-Theory-Based Feature Type Analysis for the Modelling of Statistical Parsing" docx

Tài liệu Báo cáo khoa học: "An Information-Theory-Based Feature Type Analysis for the Modelling of Statistical Parsing" docx

... objective feature types Class Feature type PIQ(Y;R) History feature type Y= headword of the parent 2.3253 Y= the first word in the objective word sequence 3.2398Objective feature type Y= the second word ... modify the headword 2.8757 (Y= the first word in the objective word sequence which has the possibility to modify the headword) the exact headword information 3.7333 (Y= the headword of the current ... the headword of the current node (type1), the headword of the parent node (type2), the headword of the grandpa node (type3), the first word in the objective word sequence(type4), the first word...

Ngày tải lên: 20/02/2014, 18:20

8 504 0
 Báo cáo y học: "Comparative study of control selection in a national population -based case-control study: Estimating risk of smoking on cancer deaths in Chinese men"

Báo cáo y học: "Comparative study of control selection in a national population -based case-control study: Estimating risk of smoking on cancer deaths in Chinese men"

... a population -based case-control study to assess the validation of the novel control selection design by comparing the consistency between the new design and a routine control selection design ... data. The sex-matched living spouse control design as an alternative control selection for a nationwide popula- tion -based case-control study is valid and feasible, and can produce highly acceptable ... 33.1%), and other medical disorders (360–389, 680–709, 780–796, 7.9%). The selection of controls in this study was based on three assumptions: (1) the individuals in both control groups had,...

Ngày tải lên: 26/10/2012, 09:48

9 533 1
Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf

Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf

... features on the development set. The number of features rises to 4.7 million without feature selection, which iter- atively selects 100,000 features with best  2 norm values across shards. Feature ... of the corresponding feature across tasks/shards. The  1 sum of the  2 norms en- forces a selection among features based on these norms. Consider for example the two 5 -feature, 3- task weight ... shards. We compute the  2 norm of the weights in each feature column, sort features by this value, and keep K features in the model. This feature selection procedure is done after each epoch. Reduced...

Ngày tải lên: 19/02/2014, 19:20

11 549 0
Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

... the word order in target language. To this end, we propose a simple but effective ranking -based ap- proach to word reordering. The ranking model is automatically derived from the word aligned ... pre-reordering – an approach that re-positions source words to approximate target lan- guage word order as much as possible based on the features from source syntactic parse trees. This is usually ... annotators tend to align function words which might be left un- aligned by automatic word aligner. 5.6 Effect of Ranking Features Here we examine the effect of features for ranking reorder model....

Ngày tải lên: 19/02/2014, 19:20

9 616 0
Tài liệu Báo cáo khoa học: "HITS-based Seed Selection and Stop List Construction for Bootstrapping" doc

Tài liệu Báo cáo khoa học: "HITS-based Seed Selection and Stop List Construction for Bootstrapping" doc

... neighboring con- texts: collocational features and bag-of-words fea- tures. For collocational features, we set a window of three words to the right and left of the target word. 4.2 Evaluation methodology We ... 2008). Each word comes with a number of instances (context sentences) in which the target word occur, and some of these sentences are manually labeled with the cor- rect sense of the target word in ... We lowercased words in the sentence and pre-processed them with the Porter stemmer (Porter, 1980) to get the stems of words. Following (Komachi et al., 2008), we used two types of features extracted...

Ngày tải lên: 20/02/2014, 04:20

7 383 0
Tài liệu Báo cáo khoa học: "A Novel Feature-based Approach to Chinese Entity Relation Extraction" ppt

Tài liệu Báo cáo khoa học: "A Novel Feature-based Approach to Chinese Entity Relation Extraction" ppt

... approaches include feature -based and kernel -based classification. Feature -based approaches transform the context of two entities into a liner vector of carefully selected linguistic features, varying ... context information. 3.1 Classification Features The classification is based on the following four types of features. z Entity Positional Structure Features We define and examine nine finer ... merged into three coarser structures. z Entity Features Entity types and subtypes are concerned. z Entity Context Features These are character -based features. We consider both internal and external...

Ngày tải lên: 20/02/2014, 09:20

4 480 0
Tài liệu Báo cáo khoa học: "A Feature Based Approach to Leveraging Context for Classifying Newsgroup Style Discussion Segments" pptx

Tài liệu Báo cáo khoa học: "A Feature Based Approach to Leveraging Context for Classifying Newsgroup Style Discussion Segments" pptx

... corpus. We also included a feature that indicated the number of words in the segment. Thread Structure Features. The simplest context- oriented feature we can add based on the threaded structure ... single message in our evaluation below. 3 Feature Based Approach In previous text classification research, more atten- tion to the selection of predictive features has been done for text classification ... base features, we began with typical text features ex- tracted from the raw text, including unstemmed uni- grams and punctuation. We did not remove stop words, although we did remove features...

Ngày tải lên: 20/02/2014, 12:20

4 519 0
Asset based approaches to poverty reduction moser 2

Asset based approaches to poverty reduction moser 2

... paper provides a brief introduction to asset -based approaches to poverty reduction in a globalized context.  e aim is to show the added value of asset -based approaches, in terms of both bet- ter ... asset -based ap- proaches, for both better understanding poverty and developing appropriate long-term poverty reduc- tion solutions.  e paper discusses asset -based approaches to poverty reduction ... contributions to the recent Brookings Institution/Ford Foundation Workshop on Asset -based approaches to poverty reduction in a globalized context held in Washington DC on 27–8 June 2006.  e paper...

Ngày tải lên: 21/02/2014, 00:21

41 404 1
Tài liệu Báo cáo khoa học: "AN EXTENDED LR PARSING ALGORITHM FOR GRAMMARS USING FEATURE-BASED SYNTACTIC CATEGORIES " pot

Tài liệu Báo cáo khoa học: "AN EXTENDED LR PARSING ALGORITHM FOR GRAMMARS USING FEATURE-BASED SYNTACTIC CATEGORIES " pot

... hand, in a grammar with feature- based categories, as proposed by most recent syntactic theories, it is no longer the case. 3 Construction of the GOTO Table for Feature -Based Categories: A ... whose feature specification within the depth allowed by the resu'ictor is identical to, or subsumed by, a previous one. In addition to the halting problem, the incorporation of feature -based ... Information -Based Syntax and Semantics VoI.1. CSLI Lecture Notes 13. Stanford: CSLI. Shieber, S. 1985. "Using Restriction to Extend Parsing Algorithms for Complex- Feature -Based Formalisms"...

Ngày tải lên: 22/02/2014, 10:20

6 334 0

Bạn có muốn tìm thêm với từ khóa:

w