Báo cáo khoa học: "Identifying Word Translations in Non-Parallel Texts" potx
... co-occurrences of German word pairs in the German corpus. As a starting point, word order in the two matrices was chosen such that word n in the German matrix was the translation of word n in the English ... German words axe in corresponding order. Word n in the English matrix is then the translation of word n in the German matrix. 3 Simulation A simulation e...
Ngày tải lên: 08/03/2014, 07:20
... which determine how words in documents might be gener- ated. Fitting a generative model means finding the best set of those latent variables in order to explain the observed data. Within that setting, ... knowledge from word- topic distributions outperform methods based on similarity measures in the original word- document space. The best results, ob- tained by combining knowledge from...
Ngày tải lên: 23/03/2014, 16:20
... different vowels found in the data 204 vowel positions are investigated, where a vowel position is, e.g., the first vowel in the word ’Wash- ington’ or the second vowel in the word ’thirty’. Factor ... of linguistic structure in the aggregate analysis is based on the analysis of the pronunciation of the vowels found in the data set. In work presented in this paper the identifi...
Ngày tải lên: 20/02/2014, 12:20
Báo cáo khoa học: "Identifying Repair Targets in Action Control Dialogue" pdf
... which we adopted in our framework. We will extend the grounding act model by introducing degree of groundedness that have a quaternary distinction instead of the orig- inal binary distinction. The ... focused on the di- alogue involving only utterances. In this paper, we discuss misunderstanding problem in the di- alogue involving participant’s actions as well as utterances. In partic...
Ngày tải lên: 08/03/2014, 21:20
Tài liệu Báo cáo khoa học: "User Participation Prediction in Online Forums" potx
... using the bag-of -word model since that would require the exact words to appear in the training set. In order to take advantage of the topic level in- formation while not losing the “fine-grained” ... capture word relationship semantically. To illustrate the words inside latent topics in the LDA model in- ferred from online forums, we show in Table 2 the top words in 3 out of...
Ngày tải lên: 22/02/2014, 03:20
Báo cáo khoa học: "Unsupervised Word Alignment with Arbitrary Features" potx
... NeurAlign: combining word alignments using neural networks. In Proc. of HLT-EMNLP. T. Berg-Kirkpatrick, A. Bouchard-C ˆ ot ´ e, J. DeNero, and D. Klein. 2010. Painless unsupervised learning with features. In ... Lin. 2010. Discriminative word alignment by linear modeling. Computational Lin- guistics, 36(3):303–339. A. Lopez. 2008. Tera-scale translation models via pat- tern matching....
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "Classifying Semantic Relations in Bioscience Texts" pot
... sur- rounding forms of the stem bind which signify entities that can enter into molecular binding re- lationships. In Srinivasan and Rindflesch (2002) MeSH term co-occurrences within MEDLINE ar- ticles ... containing it. We used a large domain-specific lexical hi- erarchy (MeSH, Medical Subject Headings 3 ) to map words into semantic categories. There are about 19,000 unique terms in MeSH a...
Ngày tải lên: 17/03/2014, 06:20
Báo cáo khoa học: "Using Linguistic Knowledge in Automatic Abstracting" potx
... topics according to the reader's interest by motivating the topics, describing en- tities and defining concepts. We have defined our method of automatic abstracting by study- ing a corpus ... topics according to the reader's inter- est by motivating the topics, describing entities, defining concepts and so on. This kind of ab- stract could be used in tasks such as accessing...
Ngày tải lên: 17/03/2014, 07:20
Báo cáo khoa học: "Hedge classification in biomedical texts with a weakly supervised selection of keywords" doc
... keyword) selection To handle the inherent noise in the training dataset that originates from its weakly supervised construc- tion, we applied the following feature selection pro- cedure. The main ... very few useful keywords were eliminated and this indicated that our feature selection procedure was capable of distinguishing useful keywords from noise (i.e. keywords having a very high specu...
Ngày tải lên: 31/03/2014, 00:20
Báo cáo khoa học: "Dynamic Strategy Selection in Flexible Parsing" potx
... interface, including the invocation of parsing strategies, dictionanes and concepts, rather than requiring any domain adaptations by the interface system itself. With these goals in mind, we ... clustering domain concepts into functionally useful categories for user interaction. Semantic grammars, like case systems, can bring domain knowledge to bear in dissmbiguatmg word meaningS...
Ngày tải lên: 31/03/2014, 17:20