... algorithm for senti- ment summarization that takes account of infor- mativeness and readability, simultaneously. To summarize reviews, the informativeness score is based on sentiments and the readability ... except for its calculation of the informativeness score and size limitation. Therefore, when a new sentence is added to a hy- pothesis, both the informativeness and the read-...
Ngày tải lên: 23/03/2014, 16:20
... the hierarchal rule table, and introduces a new feature to enhance the translation performance. We em- ploy the relaxed-well-formed dependency struc- ture to constrain both sides of the rule, and about 40% ... Association for Computational Linguistics, Companion Volume: Short Papers, pages 17–20. Saˇsa Hasan, Juri Ganitkevitch, Hermann Ney, and Jes´us Andr´es-Ferrer. 2008. Triplet lex...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Word Vectors and Two Kinds of Similarity" pptx
... words and their meanings, and proven to be highly useful both for many NLP applications associated with semantic processing (Widdows, 2004) and for human modeling in cognitive sci- ence (G¨ardenfors, ... Fur- nas, Thomas L. Landauer, and Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the American Society For Information Science, 41(6):391–407. Peter G...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Universal Grammar and Lexis for Quick Ramp-Up of MT Systems" doc
... Principles and Parameters of Syntactic Saturation. New York and Oxford: Oxford University Press. 979 tactic and morphological parameters activated in English. Thus, for morphology and syntax, ... total is not very high: verbs, around 30 episodes for the finite forms, and about 40 for the non-finite forms; nouns, around 20; adverbs and adjectives, under 5. Morphology...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "Investigating GIS and Smoothing for Maximum Entropy Taggers" pot
... 00- 18 for training and WSJ 22-24 for testing, and Toutanova and Manning (2000) use WSJ 00-20 for training and WSJ 23-24 for testing. Collins uses a linear perceptron, and Toutanova and Manning (T&A4) ... the 91 Kamal Nigam, John Lafferty, and Andrew McCallum. 1999. Using maximum entropy for text classification. In Pro- ceedings of the IJCAI-99 Workshop on...
Ngày tải lên: 22/02/2014, 02:20
Báo cáo khoa học: "Endocentric Constructions and the Cocke Parsing Logic" ppt
... construction formed by the IC's is added to the list of codes to be offered for testing when iterations are performed on longer strings. This interaction between a parsing logic and a routine for ... code-matching routines for connectability. However, if carried out fully and consistently, it greatly increases the length and complexity of both the codes and the rules,...
Ngày tải lên: 07/03/2014, 18:20
Báo cáo khoa học: "Alternative Phrases and Natural Language Information Retrieval" pptx
... Phrases and Natural Language Information Retrieval Gann Bierner Division of Informatics University of Edinburgh gbierner@cogsci.ed.ac.uk Abstract This paper presents a formal analysis for a large ... Eugenio, and J. D. Moore. 1999. A dialogue based tutoring system for basic elec- tricity and electronics. In Proceedings of AI in Education. Earl Sacerdoti. 1977. A Structure for Plan...
Ngày tải lên: 23/03/2014, 19:20
Báo cáo khoa học: "Ill-Formed and Non-Standard Language Problems" ppt
... Tokyo, October, 1980, 190-201. Weischedel, R.M., and N.K. Sondheimer, "A Frame- work for Processing Ill-Formed Input," Technical Memorandum H-00519, Sperry-Univac, Blue Bell, PA, October ... "A Personal View of Natural Language Understanding," SIGART Newsletter, No. 61, February, 1977, 17-18. 166 Ill-Formed and Non-Standard Language Problems Stan Kwasny...
Ngày tải lên: 24/03/2014, 01:21
Báo cáo khoa học: "Unsupervised Discrimination and Labeling of Ambiguous Names" ppt
... fea- tures and then these vectors are grouped using ag- glomerative clustering. (Pantel and Ravichandran, 2004) have proposed an algorithm for labeling semantic classes, which can be viewed as a form ... cluster is found using point-wise mutual infor- mation (PMI) and their average is used to group and rank the clusters to form a grammatical template or signature for the class. Th...
Ngày tải lên: 31/03/2014, 03:20
Báo cáo khoa học: "Integrating cohesion and coherence for Automatic Summarization" doc
... WordNet an tools for POS tagging and Named Entity recognition and classification. It can also be parametrised for obtaining summaries of various lengths and at granularity levels. As for relevance ... mainly used for discourse segmentation. The information stored in this DM lexicon was used for identifying inter- and intra-sentential dis- course segments (Alonso and Caste1...
Ngày tải lên: 31/03/2014, 20:20