0

document classification using a finite mixture model

Báo cáo khoa học:

Báo cáo khoa học: "Document Classification Using a Finite Mixture Model" pdf

Báo cáo khoa học

... Classification Using a Finite Mixture Model Hang Li Kenji Yamanishi C&C Res. Labs., NEC 4-1-1 Miyazaki Miyamae-ku Kawasaki, 216, Japan Email: {lihang,yamanisi} @sbl.cl.nec.co.j p Abstract ... crucial to the accuracy of document classification. Guthrie et. al. have devised a way suitable to documentation classification. Suppose that there are two categories cl ='tennis' and ... may individually appear in the category only very rarely; polysemy problem how to determine that a word like 'ball' in a document refers to a 'tennis ball' and not a 'soccer...
  • 9
  • 189
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Finite-State Model of Human Sentence Processing" docx

Báo cáo khoa học

... moreinformation in lexical entries and increasing am-biguity so that other ambiguity types also can bedisambiguated in a similar way via lexical cate-gory disambiguation. This idea has been ... Press, Cambridge, Massachusetts, 1999.S. Narayanan and D Jurafsky. A bayesian model predicts human parse preference and readingtimes in sentence processing. Proceedingsof Advances in Neural Information ... friend accepted the man whowas very impressed, the tagger showed a repairsince it initially preferred a past-participle analy-sis for accepted and later it had to reanalyze. Thisis a limitation...
  • 8
  • 446
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Lexical transfer using a vector-space model" doc

Báo cáo khoa học

... using matrix PRICAI-00, 2000, (to appear). Tanaka H. (1995) Statistical Learning of “Case Frame Tree” for Translating English Verbs, Journal of NLP, 2/3, pp. 49-72, (in Japanese). Yamada, ... generalization (Akiba et. al., 1996 and Tanaka, 1995); (2) approaches using structural matching: to obtain transfer rules, several search methods have been proposed for maximal structural matching between ... Laboratories 2-2 Hikaridai, Seika, Soraku Kyoto 619-0288, Japan sumita@slt.atr.co.jp Abstract Building a bilingual dictionary for transfer in a machine translation system is conventionally...
  • 7
  • 654
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automated Whole Sentence Grammar Correction Using a Noisy Channel Model" pptx

Báo cáo khoa học

... sentence.3 Base Model Our noisy channel model consists of two main com-ponents, a base language model and a noise model. The base language model is a probabilistic lan-guage model which generates an ... thetwelfth national conference on Artificial intelli-gence (vol. 1), AAAI ’94, pages 779–784, MenloPark, CA, USA. American Association for Artifi-cial Intelligence.Lavie, A. and Agarwal, A. (2007). ... speakersfor grammar correction. Using only these data sets,we can train our noisy channel model, as we haveshown using a bigram language model, and a wFSTfor our noise model. We have also...
  • 11
  • 367
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Multi-Document Summarization using Sentence-based Topic Models" docx

Báo cáo khoa học

... Bayesian sentence-based topic model for summarization by making use of boththe term -document and term-sentenceassociations. An efficient variationalBayesian algorithm is derived for model parameter ... document summarization has foundwide-ranging applications in information retrievaland web search. Many multi -document summa-rization methods have been developed to extractthe most important ... mea-sure and latent semantic analysis. In Proceedings of SIGIR.Daniel D. Lee and H. Sebastian Seung. Algorithms for non-negative matrixfactorization. In Advances in Neural Information Processing...
  • 4
  • 381
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Dictionary Definitions based Homograph Identification using a Generative Hierarchical Model" docx

Báo cáo khoa học

... USA, June 2008.c2008 Association for Computational LinguisticsDictionary Definitions based Homograph Identification using a Generative Hierarchical Model Anagha Kulkarni Jamie Callan ... MVN mixture model. The results for the semi-supervised models are non-conclusive. Our post-experimental analysis reveals that the parameter updation process using the unlabeled data has an ... Homo-graphs and Non-homographs. 2.2 Models We formulate the homograph detection process as a generative hierarchical model. Figure 2 provides the plate notation of the graphical model. The la-tent...
  • 4
  • 282
  • 0
báo cáo hóa học:

báo cáo hóa học:" Research Article Using a State-Space Model and Location Analysis to Infer Time-Delayed Regulatory Networks" potx

Hóa học - Dầu khí

... data. The tool has been demonstratedon artificial data and yeast cell-cycle gene-expression data. Using the yeast microarray data, we have illustrated that our model can help identify regulatory ... resampling withreplacement as many as 200 times. This approach, however,has a severe limitation for application to microarray databecause most currently available time-course microarraydata ... pagesdoi:10.1155/2009/484601Research Article Using a State-Space Model and Location Analysis toInfer Time-Delayed Regulatory NetworksChushin Koh,1Fang-Xiang Wu,2, 3Gopalan Selvaraj,4and Anthony J. Kusalik1,...
  • 14
  • 545
  • 0
Tài liệu STRATEGIC PLANNING FOR PROJECT MANAGEMENT USING A PROJECT MANAGEMENT MATURITY MODEL ppt

Tài liệu STRATEGIC PLANNING FOR PROJECT MANAGEMENT USING A PROJECT MANAGEMENT MATURITY MODEL ppt

Quản lý dự án

... companies. What senior managers needed was a planexpressed in terms of three broad, critical success factors: qualitative factors, or-ganizational factors, and quantitative factors. To take advantage ... company as a wholeORGANIZATIONAL FACTORSCoordination of organizational behavior in project management is a delicate bal-ancing act, something like sitting on a bar stool. Bar stools usually come ... 13EnvironmentalOpportunitiesand ThreatsOrganizationalStrengths andWeaknessesGatheringofInformationFirm’sSocialResponsibilityManagerialValues ofManagementEvaluationofInformationStrategyEvaluationStrategySelectionStrategyImplementationExternalAnalysisInternalAnalysisFIGURE...
  • 271
  • 464
  • 3
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Enhanced word decomposition by calibrating the decision threshold of probabilistic models and using a model ensemble" pdf

Báo cáo khoa học

... boundaries as hiddenvariables and include probabilities for let-ter transitions within segments. The ad-vantage of this model family is that it canlearn from small datasets and easily gen-eralises ... MITPress, Cambridge, MA, USA.K. Shalonova, B. Gol´enia, and P. A. Flach. 2009. To-wards learning morphology for under-resourced fu-sional and agglutinating languages. IEEE Transac-tions on Audio, ... terms oftraining set size. We want to remind the reader thatour two algorithms are aimed at small datasets.We randomly split each dataset into 10 subsetswhere each subset was a test set and the...
  • 9
  • 557
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Hybrid Hierarchical Model for Multi-Document Summarization" ppt

Báo cáo khoa học

... 4: Manual EvaluationsHere, we manually evaluate quality of summaries, a common DUC task. Human annotators are giventwo sets of summary text for each document set,generated from two approaches: ... using a classifier model.  HIERSUM : (Haghighi and Vanderwende,2009) A generative summarization method basedon topic models, which uses sentences as an addi-tional level. Using an approximation ... 12Overall 24 66 2Table 4: Frequency results of manual quality evaluations.Results are statistically significant based on t-test. T ie indi-cates evaluations where two summaries are rated equal.according...
  • 10
  • 559
  • 0

Xem thêm