... xét đặc điểm vừa phải mà nó không đặc biệt liên- 2 - Web mining in Thematic Search Engines I.Giới thiệuGần đây sự cải tiến của công nghệ Search engine có thế tạo chongười sử dụng Internet một ... thuật tối ưu hoá, mà thường được gọi là Web mining(khai phá dữ liệu Web) . Ở đây, chúng ta mô tả phương thức cải tiến cho kếtquả tìm kiếm chuẩn trong Search engine, ở tài liệu và trang có giá ... nhiềuđường khác nhau. Hiện nay phần lớn mọi người dùng Search engine cungcấp khả năng tìm kiếm trên cơ sở dữ liệu của hàng tỉ trang Web, nơi mànhững câu truy vấn được thực hiện ngay tức khắc....
... Lexicon engines. 1.1 WebSearchEngines Scaling Up: 1994 - 2000 Search engine technology has had to scale dramatically to keep up with the growth of the web. In 1994,one of the first websearch engines, ... large-scale searchengines on the web, very little academic research has been done on them. Furthermore, due to rapid advance intechnology and web proliferation, creating a websearch engine ... repository. 3 Related Work Search research on the web has a short and concise history. The World Wide Web Worm (WWWW)[McBryan 94] was one of the first websearch engines. It was subsequently...
... trong truy xuất thông tin 1.1.1 Hệ thống truy xuất thông tin (Information Retrieval- IR) Hệ thống Truy xuất thông tin (Information Retrieval) là hệ thống thực hiện tìm kiếm tài liệu (thường là ... without Information Retrieval, IROB PQEM ) - Mô hình mở rộng truy vấn dựa trên ontology và kết hợp với hệ thống truy xuất thông tin (Query Expansion Model with Ontology-Based with Information ... thống truy xuất thông tin (Query Expansion Model with Ontology-Based and Probability with Information Retrieval, +IROB PQEM ) 2.4.2.2 Các bước thực hiện mở rộng truy vấn dựa trên Ontology...
... Campaign: overview of the Web Peo-ple Search Clustering Task. In 2nd Web People Search Evaluation Workshop (WePS 2009), 18thWWW Conference. 2009.T. Brants and A. Franz. 2006. Web 1T 5-gram, version1. ... con-ference on Research and development in information retrieval, pages 139–146. ACM.C. Carpineto, S. Osinski, G. Romano, and DawidWeiss. 2009. A Survey of Web Clustering Engines. ACM Computing ... 2009) for image search and (Ar-tiles et al., 2009) for person name search. We seeour testbed as complementary to these ones, andexpect that it can contribute to foster research on search results...
... weightingfunctions is a very active research area in infor-mation retrieval and it is outside the scope of thispaper to provide an in-depth analysis but signifi-cant research can be found in Salton ... classification in web forums. ACM Trans. Inf. Syst., 26(3):1–34.Timothy G. Armstrong, Alistair Moffat, William Web- ber, and Justin Zobel. 2009. Improvements thatdon’t add up: ad-hoc retrieval results ... weight-ing approaches in automatic text retrieval. Technicalreport, Ithaca, NY, USA.Gerard Salton and Michael J. McGill. 1986. Intro-duction to Modern Information Retrieval. McGraw-Hill, Inc., New...
... is indispensable in cross language informationretrieval nowadays. We propose an approach of combining lexical information, web sta-tistics, and inverse search based on Google to backward ... translations in Chinese web page snippets. We thus base our system on web search engine: retrieving candidates from returned snippets, combining both linguistic and statistical information to find ... NEs per user. These users are asked to simulate a scenario of using websearch machine to per-form cross-lingual information retrieval. The proportion of different types of NEs is roughly conformed...
... others), however, the re- search described in this paper uses the information retrieval (IR) paradigm which has also been used bysome researchers.Several sentiment informationretrieval modelswere ... MingZhou. 2000. On the use of words and n-gramsfor Chinese information retrieval. In Proceedings ofthe 5th International Workshop Information Retrieval with Asian Languages, pages 141–148. ACM Press,November.Bo ... thepaper will consist of four main modules:1. Query translation2. Opinionated Information Retrieval 3. Opinionated Information Extraction4. Results presentationThe OIR module will process complex...
... Corporato Cross-Language Information Retrieval: HybridStatistics-based and Linguistics-based Approach. InProc. IRAL 2003, Sapporo, Japan.G. Salton. 1971. The SMART Retrieval System, Experi-ments ... 2001. Overview of the SecondNTCIR Work-shop. In Proc. Second NTCIR Workshop on Researchin Chinese and Japanese Text Retrieval and Text Sum-marization.P. Koehn and K. Knight. 2002. Learning a TranslationLexicon ... Uemura.2002. Exploiting and Combining Multiple Resourcesfor Query Expansion in Cross-Language Information Retrieval. IPSJ Transactions of Databases, TOD 15,43(SIG 9):39–54.F. Sadat, M. Yoshikawa...
... from searching all streams. This allows for an easy combination of alternative retrieval methods, creating a meta- search strategy which maximizes the contribution of each stream. Different information ... "Natural Language Information Retrieval: TREC°5 Report." Proceedings of TREC-5 confer° ence. Strzalkowski, Tomek. 1995. "Natural Language Information Re- trieval" Information Processing ... of summa- rization in context of an informationretrieval sys- tem, where the user needs to rapidly and efficiently review the documents returned from search for an indication of relevance...
... content. For example, the pair retrieve +information is extracted from any of the fol- lowing fragments: informationretrieval system; retrieval of information from databases; and informa- ... Michael. 1991. " ;Retrieval Performance in Ferret: A Conceptual InformationRetrieval System." Proceedings of ACM SIGIR-91, pp. 347-355. Sager, Naomi. 1981. Natural Language Information Processing. ... to as 'conceptual retrieval& apos;. The conceptual retrieval systems, though quite effective, are not yet mature enough to be con- sidered in serious informationretrieval applications,...
... 1979. Representation and classification of knowledge and information for use in interactive information re- trieval. In Human Aspects of Information Science. Oslo: Norwegian Library School. 148...
... Relations. In the proc. of SIGIR 94, the17th International Conference on Research and De-velopment in Information Retrieval. Berlin: Springer-Verlag, 61-69.Voorhees, E. M. (1998). Using WordNet ... analysis. In Proc. of the19th annual international ACM SIGIR conference onResearch and development in information retrieval. 287Veale and Hao (2007) exploit the simile frame“as X as Y” to harvest ... creative informationretrieval to explore how anIR system can itself provide a degree of creativeanticipation, acting as a mediator between the lit-eral specification of a meaning and the retrieval...
... is a common behaviour in web search queries as shown in figure (1). It means that even ID rules are not powerful enough to model the free-word-order nature of web search queries. This leads ... 2005: 330-337. Xue, GR, HJ Zeng, Z Chen, Y Yu, WY Ma, WS Xi, WG Fan, (2004), Optimizing websearch using web click-through data, Proceedings of the thirteenth ACM international conference. ... of websearch queries. The model’s power, however, comes at the cost of increased time complexity, which is exponen-tial in the length of the query. This, is less of an issue for parsing web...
... trigram information into our training procedure. Corpus information might be used in more delicate way to improve the performance. References A. Berger and J. Lafferty, " ;Information retrieval ... new lan-guage models of informationretrieval to the tradi-tional retrieval models: University of Twente [Host]; University of Twente, Centre for Telemat-ics and Information Technology, 2000. ... Croft, "A general language model for information retrieval, " 1999, pp. 316-321. G. Salton and M. J. McGill, Introduction to Modern Information Retrieval: McGraw-Hill, Inc. New York,...
... Agency.3 Document Retrieval Since informationretrieval systems supply the ini-tial set of documents on which a question answer-ing system operates, it makes sense to optimizedocument retrieval performance ... techniques introducedin this paper are applicable to the different typesof information needs discussed above.While informationretrieval techniques form astrong baseline for answering relationship ... deter-mine the statistical significance of the results. Thistest is commonly used in information retrieval research because it makes minimal assumptionsabout the underlying distribution of differences.Significance...