... ClassificationUsingaFiniteMixtureModel Hang Li Kenji Yamanishi C&C Res. Labs., NEC 4-1-1 Miyazaki Miyamae-ku Kawasaki, 216, Japan Email: {lihang,yamanisi} @sbl.cl.nec.co.j p Abstract ... crucial to the accuracy of document classification. Guthrie et. al. have devised a way suitable to documentation classification. Suppose that there are two categories cl ='tennis' and ... may individually appear in the category only very rarely; polysemy problem how to determine that a word like 'ball' in adocument refers to a 'tennis ball' and not a 'soccer...
... moreinformation in lexical entries and increasing am-biguity so that other ambiguity types also can bedisambiguated in a similar way via lexical cate-gory disambiguation. This idea has been ... Press, Cambridge, Massachusetts, 1999.S. Narayanan and D Jurafsky. A bayesian model predicts human parse preference and readingtimes in sentence processing. Proceedingsof Advances in Neural Information ... friend accepted the man whowas very impressed, the tagger showed a repairsince it initially preferred a past-participle analy-sis for accepted and later it had to reanalyze. Thisis a limitation...
... using matrix PRICAI-00, 2000, (to appear). Tanaka H. (1995) Statistical Learning of “Case Frame Tree” for Translating English Verbs, Journal of NLP, 2/3, pp. 49-72, (in Japanese). Yamada, ... generalization (Akiba et. al., 1996 and Tanaka, 1995); (2) approaches using structural matching: to obtain transfer rules, several search methods have been proposed for maximal structural matching between ... Laboratories 2-2 Hikaridai, Seika, Soraku Kyoto 619-0288, Japan sumita@slt.atr.co.jp Abstract Building a bilingual dictionary for transfer in a machine translation system is conventionally...
... sentence.3 Base Model Our noisy channel model consists of two main com-ponents, a base language model and a noise model. The base language model is a probabilistic lan-guage model which generates an ... thetwelfth national conference on Artificial intelli-gence (vol. 1), AAAI ’94, pages 779–784, MenloPark, CA, USA. American Association for Artifi-cial Intelligence.Lavie, A. and Agarwal, A. (2007). ... speakersfor grammar correction. Using only these data sets,we can train our noisy channel model, as we haveshown usinga bigram language model, and a wFSTfor our noise model. We have also...
... Bayesian sentence-based topic model for summarization by making use of boththe term -document and term-sentenceassociations. An efficient variationalBayesian algorithm is derived for model parameter ... document summarization has foundwide-ranging applications in information retrievaland web search. Many multi -document summa-rization methods have been developed to extractthe most important ... mea-sure and latent semantic analysis. In Proceedings of SIGIR.Daniel D. Lee and H. Sebastian Seung. Algorithms for non-negative matrixfactorization. In Advances in Neural Information Processing...
... USA, June 2008.c2008 Association for Computational LinguisticsDictionary Definitions based Homograph Identification usinga Generative Hierarchical Model Anagha Kulkarni Jamie Callan ... MVN mixture model. The results for the semi-supervised models are non-conclusive. Our post-experimental analysis reveals that the parameter updation process using the unlabeled data has an ... Homo-graphs and Non-homographs. 2.2 Models We formulate the homograph detection process as a generative hierarchical model. Figure 2 provides the plate notation of the graphical model. The la-tent...
... data. The tool has been demonstratedon artificial data and yeast cell-cycle gene-expression data. Using the yeast microarray data, we have illustrated that our model can help identify regulatory ... resampling withreplacement as many as 200 times. This approach, however,has a severe limitation for application to microarray databecause most currently available time-course microarraydata ... pagesdoi:10.1155/2009/484601Research Article Using a State-Space Model and Location Analysis toInfer Time-Delayed Regulatory NetworksChushin Koh,1Fang-Xiang Wu,2, 3Gopalan Selvaraj,4and Anthony J. Kusalik1,...
... companies. What senior managers needed was a planexpressed in terms of three broad, critical success factors: qualitative factors, or-ganizational factors, and quantitative factors. To take advantage ... company as a wholeORGANIZATIONAL FACTORSCoordination of organizational behavior in project management is a delicate bal-ancing act, something like sitting on a bar stool. Bar stools usually come ... 13EnvironmentalOpportunitiesand ThreatsOrganizationalStrengths andWeaknessesGatheringofInformationFirm’sSocialResponsibilityManagerialValues ofManagementEvaluationofInformationStrategyEvaluationStrategySelectionStrategyImplementationExternalAnalysisInternalAnalysisFIGURE...
... boundaries as hiddenvariables and include probabilities for let-ter transitions within segments. The ad-vantage of this model family is that it canlearn from small datasets and easily gen-eralises ... MITPress, Cambridge, MA, USA.K. Shalonova, B. Gol´enia, and P. A. Flach. 2009. To-wards learning morphology for under-resourced fu-sional and agglutinating languages. IEEE Transac-tions on Audio, ... terms oftraining set size. We want to remind the reader thatour two algorithms are aimed at small datasets.We randomly split each dataset into 10 subsetswhere each subset was a test set and the...
... 4: Manual EvaluationsHere, we manually evaluate quality of summaries, a common DUC task. Human annotators are giventwo sets of summary text for each document set,generated from two approaches: ... usinga classifier model. HIERSUM : (Haghighi and Vanderwende,2009) A generative summarization method basedon topic models, which uses sentences as an addi-tional level. Using an approximation ... 12Overall 24 66 2Table 4: Frequency results of manual quality evaluations.Results are statistically significant based on t-test. T ie indi-cates evaluations where two summaries are rated equal.according...