named entities in text

Báo cáo khoa học: "Detecting Semantic Relations between Named Entities in Text Using Contextual Features" pdf

Báo cáo khoa học: "Detecting Semantic Relations between Named Entities in Text Using Contextual Features" pdf

Ngày tải lên : 17/03/2014, 04:20
... use cen- tering theory (Kameyama, 1986) to determine how easily a noun phrase can be referred to in the follow- ing context. 2.2 Centering Theory Centering theory is an empirical sorting rule used ... pronoun in the text, noun phrases that are in the previous context of the pronoun are sorted in order of likelihood of being the antecedent. The sorting algorithm has two steps. First, from the beginning ... asu Figure 2: Centering Structure ing referred to in the context with the following NE “amerika 32 ”. Whether or not the antecedent NE is referred to in the context with the following NE is used as...
  • 4
  • 314
  • 0
Báo cáo khoa học: "Annotating and Recognising Named Entities in Clinical Notes" pot

Báo cáo khoa học: "Annotating and Recognising Named Entities in Clinical Notes" pot

Ngày tải lên : 08/03/2014, 01:20
... clin- ical named entities in 11 entity types. This paper reports on the challenges involved in creating the annotation schema, and recog- nising and annotating clinical named enti- ties. The information ... are only used once. The clinical information extraction problem is addressed in this work by applying machine learn- ing methods to a corpus annotated for clinical named entities. The data selection ... bio-textmining. Journal of Bioinformatics, 19(1),180–182. J. Lafferty et al. 2001. Conditional Random Fields: Probabilistic Models for Segmenting and Label- ing Sequence Data Machine learning-international workshop...
  • 9
  • 413
  • 0
Báo cáo khoa học: "Recognizing Named Entities in Tweets" docx

Báo cáo khoa học: "Recognizing Named Entities in Tweets" docx

Ngày tải lên : 17/03/2014, 00:20
... 2010. Annotating named entities in twitter data with crowd- sourcing. In CSLDAMT, pages 80–88. Jenny Rose Finkel and Christopher D. Manning. 2009. Nested named entity recognition. In EMNLP, pages 141–150. Jenny ... 2009. Annotating and recognising named entities in clinical notes. In ACL-IJCNLP, pages 18– 26. Dan Wu, Wee Sun Lee, Nan Ye, and Hai Leong Chieu. 2009. Domain adaptive bootstrapping for named en- tity ... transcripts. In EMNLP, pages 320–327. Jing Jiang and ChengXiang Zhai. 2007. Instance weight- ing for domain adaptation in nlp. In ACL, pages 264– 271. Dan Klein and Christopher D. Manning. 2003....
  • 9
  • 296
  • 0
Hiệu quả với quảng cáo trực tuyến in-text

Hiệu quả với quảng cáo trực tuyến in-text

Ngày tải lên : 16/01/2013, 14:11
... qua một quảng cáo in- text. Còn khách hàng quảng cáo thì chỉ trả cho Vibrant dựa trên số lần mà người đọc thực sự click chuột vào quảng cáo đó. Thống kê cho thấy, quảng cáo in- text thu hút lượng ... cáo. Tuy nhiên, danh sách những từ ngữ này cũng thay đổi liên tục. Tờ Indianapolis Star đã bắt đầu áp dụng quảng cáo in- text từ tháng 8 vừa qua. Partricia Miller, Giám đốc phụ trách quảng cáo ... khi tỷ lệ độc giả trỏ chuột và nhấp vào các quảng cáo in- text là từ 3% đến 10%, tùy vào loại sản phẩm quảng cáo. (Theo Vneconomy/BusinessWeek) ...
  • 2
  • 401
  • 1
Tài liệu New product development in textiles potx

Tài liệu New product development in textiles potx

Ngày tải lên : 21/02/2014, 18:20
... Woodhead Publishing Limited, 2012 The Textile Institute and Woodhead Publishing The Textile Institute is a unique organisation in textiles, clothing and footwear. Incorporated in England by ... discount merchandising, opening its fi rst Target discount store in 1962. In 1966 it branched out into discount book selling. Through growing expertise in sourcing and buying in bulk, strong fi ... example, IBM (International Business Machine), 3M (Minnesota and Mining) or even retailers such as the Dayton-Hudson Corporation, now operating as Target. The International Business Machine Company...
  • 198
  • 454
  • 0
Tài liệu Báo cáo khoa học: "DISCOURSE ENTITIES IN JANUS" pot

Tài liệu Báo cáo khoa học: "DISCOURSE ENTITIES IN JANUS" pot

Ngày tải lên : 21/02/2014, 20:20
... Webber's rules to detect modal and other index-binding contexts. In representing DEs for indefinites (appearing as existential formulae in our meaning representation), we replaced Webber's ... DEs originating from non-linguistic sources, such as pointing actions, were taken into account. The discourse entities are used in intra- and extra-sentential pronoun resolution in BBN Janus. ... for generating DEs for dependent quantifiers stemming from indefinite and definite NPs which over- comes some difficulties in capturing dependencies be- tween discourse entities. In our multi-modal...
  • 8
  • 360
  • 0
Báo cáo khoa học: "Probabilistic Document Modeling for Syntax Removal in Text Summarization" ppt

Báo cáo khoa học: "Probabilistic Document Modeling for Syntax Removal in Text Summarization" ppt

Ngày tải lên : 07/03/2014, 22:20
... B. Tenenbaum. 2005. Integrating topics and syntax. In In Advances in Neural Information Pro- cessing Systems 17, pages 537–544. MIT Press. 646 to an approximation. Following Griffiths et al. (2005), ... Some future work includes applying this model to areas such as topic tracking and text segmentation, and coherently adjusting it to fit an n-gram modeling approach. Acknowledgments William Darling is supported ... statistically significant increases. 5 Conclusions and Future Work This paper has described using a domain- independent document modeling approach of avoiding low-content syntax words in an NLP task where...
  • 6
  • 448
  • 0
Báo cáo khoa học: "Using Syntax to Disambiguate Explicit Discourse Connectives in Text" pot

Báo cáo khoa học: "Using Syntax to Disambiguate Explicit Discourse Connectives in Text" pot

Ngày tải lên : 08/03/2014, 01:20
... improvement over those ob- tained by Marcu (2000) in his corpus-based ap- proach which achieves an f-score of 84.9% 3 for identifying discourse connectives in text. While bearing in mind that the evaluations ... it appears. For training and testing, we used explicit dis- course connectives annotated in the PDTB as pos- itive examples and occurrences of the same strings in the PDTB texts that were not ... high baseline, with an f-score of 75.33% and an accuracy of 85.86%. Interest- ingly, using only the syntactic features, ignoring the identity of the connective, is even better, re- sulting in an...
  • 4
  • 441
  • 0
Báo cáo khoa học: "A Study on Automatically Extracted Keywords in Text Categorization" doc

Báo cáo khoa học: "A Study on Automatically Extracted Keywords in Text Categorization" doc

Ngày tải lên : 08/03/2014, 02:21
... machines in text categorization. In Pro- ceedings of the 20th International Conference on Computational Linguistics (COLING 2004), pages 487–493. Yiming Yang and Xin Liu. 1999. A re-examination of ... to the baseline. The best performance was obtained when using a boolean feature value, and setting the minimum number of occurrence in training data to three (giving an F-measure of 56.9%). In the ... text categorization methods. In Proceedings of the 22nd Annual International ACM SIGIR Confer- ence on Research and Development in Information Retrieval, pages 42–49. 544 uments in the training...
  • 8
  • 496
  • 0
Báo cáo khoa học: " Translating Named Entities Using Monolingual and Bilingual Resources" ppt

Báo cáo khoa học: " Translating Named Entities Using Monolingual and Bilingual Resources" ppt

Ngày tải lên : 08/03/2014, 07:20
... part of the FBIS 2001 Multilingual corpus. Translating Named Entities Using Monolingual and Bilingual Resources Yaser Al-Onaizan and Kevin Knight Information Sciences Institute University of Southern ... becausemany are domain specific, not to be found in bilingual dictionaries. We present a novel algorithm for translating named entity phrases using easily obtain- able monolingual and bilingual resources. We ... to iden- tify all named entities in the top retrieved docu- ments for each sub-phrase. All named entities of the type of the named entity in question (e.g., PER- SON) foundin the retrieved documents...
  • 9
  • 297
  • 0
Báo cáo khoa học: "A multi-staged approach to identifying complex events in textual data" ppt

Báo cáo khoa học: "A multi-staged approach to identifying complex events in textual data" ppt

Ngày tải lên : 08/03/2014, 21:20
... important contextual information using text classification methods. We also use text classification methods to help users to more quickly focus on an area where interesting transactions exist in an interac- tive ... Workshop on Information Filtering. Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, and N., Watkins, C. 2002. Text classification using string kernels. Journal of Machine Learning Re- search, ... 419-444. Vilain, M. and Day, D. 1996. Finite-state Phrase Pars- ing by Rule Sequences, Proc. of COLING-96. Vilain, M. 1999. Inferential information extraction. In Pazienza, M.T. & Basili, R., Information...
  • 4
  • 404
  • 0
Báo cáo khoa học: "Finding Contradictions in Text" docx

Báo cáo khoa học: "Finding Contradictions in Text" docx

Ngày tải lên : 17/03/2014, 02:20
... polarity for textual infer- ence. In Proceedings of ICoS-5. Olivia Sanchez-Graillet and Massimo Poesio. 2007. Dis- covering contradiction protein-protein interactions in text. In Proceedings of BioNLP ... applied to intelligence reports, demonstrating which infor- mation may need further verification. In bioinfor- matics where protein-protein interaction is widely studied, automatically finding conflicting ... as shown in the last two lines of table 5. 4 This stands in contrast with the low inter-annotator agree- ment reported by Sanchez-Graillet and Poesio (2007) for con- tradictions in protein-protein interactions....
  • 9
  • 332
  • 0
Báo cáo khoa học: "Using Readers to Identify Lexical Cohesive Structures in Texts" potx

Báo cáo khoa học: "Using Readers to Identify Lexical Cohesive Structures in Texts" potx

Ngày tải lên : 17/03/2014, 06:20
... Hearst. 1997. Texttiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics, 23(1):33–64. Lynette Hirschman, Patricia Robinson, John D. Burger, and Marc Vilain. 1998. ... annota- tions. The wordlist contained words from the text, in their appearance order, excluding verbatim and in ectional repetitions 3 . People were instructed to read the text first, and then go through ... one fiction story were taken in full; others were cut at a meaningful break to stay within 1000 word limit. The texts were in English - original language for all but two texts. Our subjects were...
  • 6
  • 378
  • 0
Báo cáo khoa học: "Discovering Relations among Named Entities from Large Corpora" pot

Báo cáo khoa học: "Discovering Relations among Named Entities from Large Corpora" pot

Ngày tải lên : 17/03/2014, 06:20
... of named entities and their context 3. measuring context similarities among pairs of named entities 4. making clusters of pairs of named entities 5. labeling each cluster of pairs of named entities We ... corpora. The key idea is clustering pairs of named entities according to the similarity of con- text words intervening between the named entities. Our experiments using one year of newspapers re- veals ... little higher in the PER-GPE domain than in the COM-COM do- main, perhaps because there were more NE pairs with high cosine similarity in the PER-GPE do- main than in the COM-COM domain. However,...
  • 8
  • 283
  • 0

Xem thêm