Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống
1
/ 125 trang
THÔNG TIN TÀI LIỆU
Thông tin cơ bản
Định dạng
Số trang
125
Dung lượng
2,02 MB
Nội dung
Acknowledgements First of all, I feel deeply indebted to my supervisors Associate Professor Pung Hung Keng and Dr Ng See-Kiong, without whom the completion of this thesis could not have been possible I would like to take this opportunity to express my deepest appreciation and sincere gratitude to them for their inspiring guidance, advice and kindly patience I am grateful to the team-workers in M-Comm group of NSS (Network Systems & Services) lab for their innovative thinking and works during the project I am also grateful to Haman Lee, Zhang Zhuo and all my colleagues in the Knowledge Discovery Department of the Institute for Infocomm Research (I2R) for their valuable instruction and generous assistance, which have been a great source of help I am grateful to He Jun, Zhou Lifeng, Peng Bin and Gu Tao in NSS lab who have been always encouraging, supporting and helping me during my postgraduate study I gratefully acknowledge the financial support from I2R, the Agency for Science, Technology and Research and National University of Singapore for the duration of this project Otherwise, I would not be able to undertake my further study on this project Finally, I want to show my deep appreciation to my boy friend for his constant caring and support throughout my life There are many others who have assisted me in various ways during this project I gratefully acknowledge their help I Table of Content ACKNOWLEDGEMENTS I TABLE OF CONTENT II LIST OF TABLES V LIST OF FIGURES .VI SUMMARY .VIII CHAPTER INTRODUCTION 1.1 RESEARCH PROBLEM 1.2 PROPOSED APPROACH 1.3 CONTRIBUTIONS 1.4 THESIS OVERVIEW CHAPTER BACKGROUND AND RELATED WORK 2.1 WEB SERVICE ARCHITECTURE AND STANDARDS 2.1.1 Web Service Architecture 2.1.2 Web Service Standards 11 2.2 WEB SERVICE SELECTION 15 2.2.1 UDDI for Service Selection 16 2.2.2 Service Selection in B2C 17 2.3 RELATED WORK 18 2.3.1 Semantic Annotations to Web Service 18 2.3.2 Semantic Annotations to UDDI 23 2.3.3 Semantic Annotations to WSDL 24 2.3.4 Semantic Annotations to Ontology 26 2.3.5 User-Oriented Web Service Selection 27 II 2.4 OUR APPROACH – A PREVIEW 28 CHAPTER SERVICE SELECTION IN MOBILE COMMERCE 30 3.1 3.1.1 3.2 THE M-COMM SYSTEM 30 M-Comm User Scenarios 33 SERVICE SELECTION IN M-COMM 34 3.2.1 Service Selection System Architecture 35 3.2.2 Discussions 38 CHAPTER INTELLIGENT WEB SERVICE SELECTION ENGINE (ISSE) 40 4.1 SERVICE DESCRIPTION BY TEXTUAL INFORMATION 44 4.1.1 Text-based Web Service Representation 45 4.1.2 Representing Web Services by Words 47 4.2 DOMAIN IDENTIFICATION USING SVM 49 4.2.1 Motivation 49 4.2.2 Domain Modeling 51 4.2.3 Domain Identification 59 4.3 SERVICE SELECTION 62 4.3.1 Ontology Concept Annotation 62 4.3.2 Target Service Template Construction 64 4.4 CONCLUSION 66 CHAPTER EVALUATION 68 5.1 SYSTEM IMPLEMENTATION 68 5.2 DATA 69 5.3 EVALUATION 73 5.3.1 Selection accuracy 73 5.3.2 Time latency 78 5.4 CONCLUSION 79 III CHAPTER CONCLUSIONS 80 APPENDIX A: UDDI DATA STRUCTURE 83 A.1 DATA STRUCTURES IN UDDI 83 A.2 AN EXAMPLE 90 APPENDIX B SAMPLE OWL FOR POSTAL CODE SERVICE 92 APPENDIX C THE M-COMM SYSTEM 97 C.1 SYSTEM ARCHITECTURE 97 C.2 SYSTEM DEVELOPMENT MODEL AND METHODS 99 APPENDIX D M-COMM SERVICE DISCOVERY SUB-SYSTEM DATA FLOW 103 REFERENCE 107 IV List of Tables Table4.1 Presidential Election Results 2004 – Service Presentation 47 Table4.2 Two Web Service Presented in iSSE 57 Table 4.3 Sample Features in Postal Code Service 58 Table 4.4 Mapping Question Words to Noun 60 Table 5.1 System Environment 68 Table5.2 Domain Categories and number of services in each domain 70 Table 5.3 OWL design for Postal Code Service 71 Table5.4 Precision and Recall in Service Selection Process of iSSE 74 V List of Figures Fig 2.1, Service – Oriented Web Services Architecture 10 Fig2.2 Web Service Technology Stack 12 Fig3 M-Comm System for Shopping - General Structure 32 Fig3 Components of M-Comm Service Discovery Engine 36 Fig4.1 Web Service Discovery in M-Comm System 42 Fig 4.2 iSSE System Process and Components 42 Fig 4.3 Distribution of # of Words and Occurrence 58 Fig5.1 Distribution of Words and Occurrences from the Testing Data 72 Fig 5.2 Precision Comparison between iSSE and SalCentral.com 75 Fig5.3 Recall Comparison between iSSE and SalCentral.com 75 Fig A.1 Five Data Structures defined in UDDI Data Structure Specification 2.03 84 Fig A.2 BusinessEntity data Structure 84 Fig.A.3 BusinessEntity Example 85 Fig A.4 BusinessService Data Structure Specification 85 FigA.5 BusinessService Example 86 Fig A.6 BindingTemplate structure specification 87 FigA.7 BindingTemplate Example 87 Fig A.8 tModel Data Structure Specification 88 FigA.9 tModel Example 89 Fig A.10 CategoryBag Data Structure Specification 90 FigA.11 CategoryBag Example 90 VI FigA.12 Example of UDDI data structure 91 Figure C.1 System architecture diagram 97 Figure C.2 Technology vs System architecture 101 Figure C.3 Deployment View of M-Comm System 102 Fig D.1 Level 0: DFD Diagram for Service Discovery Sub-system 103 Fig D.2 Level DFD Diagram for Service Invoking 104 Fig D.3 Level DFD Diagram for Query Process 105 VII Summary The World Wide Web, once just a simple repository of hyper-linked web pages, is now rapidly evolving into a provider of services Although web services were originally designed for B2B scenarios, as the popularity of the web increases, there is a growing need to adapt web services for B2C scenarios Current web service standards require the user to be conversant with the high technicalities associated with web services in order to select the relevant web services to apply However, B2C scenarios are inherently dynamic and heterogeneous—useroriented processes are necessary in order for web services to be useful In this thesis, we investigate the challenging problem of providing an intelligent engine for user-oriented web service selection in a B2C environment Our main contribution in this thesis is a novel light-weight semantic approach that utilizes machine learning and information retrieval techniques to identify web services that are relevant to a user’ query in free text Our approach treats web services as textual web pages from which their semantic content can be extracted using natural language text processing methods We employ a Support Vector Machine to identify the domain of the web services requested by a user’s query, and we then map the semantic content of the user’s free text query to the semantic content of the domain’s web services’ ontology concepts to select a list of relevant web services in the identified domain for the user We have implemented VIII our intelligent web service selection method as part of the service discovery engine in the M-Comm system Using actual web services from xMethods, we show that the iSSE (Intelligent Service Selection Engine) can accurately select relevant web services for different domains IX Chapter Introduction To begin, we provide the motivation of our thesis work in this introductory chapter We also give an overview of our proposed approach and our research contributions in this work 1.1 Research Problem The World Wide Web—once just a simple repository of hyperlinked web pages— is now evolving into a provider of services, or more specifically, “web services”.[1] Web services are automated resources that can be accessed via the Internet—they have been hailed as the next wave of internet based applications that will dramatically change the use of internet An increasing number of web services have now been published to several service registries Web users can use these published web services to perform many everyday information retrieval tasks and access the information providers’ technology platform directly For example, we can automate our access of catalog data at the online bookstore Amazon.com, create and populate an Amazon online shopping cart (a book sale platform), and even initiate the electronic checkout process by using the web services provided by Amazon (AWS, or Amazon.com Web Service) [4] The Figure C.3 Deployment View of M-Comm System 102 Appendix D M-Comm Service Discovery Sub- System Data Flow Figure D.1 shows the DFD diagram of the M-Comm Service Discovery Subsystem Fig D.1 Level 0: DFD Diagram for Service Discovery Sub-system Process in Figure D.1 is the process that receives service parameters from M-Comm, package as SOAP message and invoke the services on shop-side and return the result back to M-Comm (Figure D.2) 103 Fig D.2 Level DFD Diagram for Service Invoking • The Message Parser receives the parameters from M-Comm server and translates into XML-format parameters that the Service Invoker can handle At the same time, the Message Parser is in charge of translating the returned service results into XML formats and returns the XML formatted results to the M-Comm server • The Service Invoker gets the service parameters from message parser, invokes the web services at the shop-side, and gets the result back and transfer the result the Message Parser Process in Figure D.1 is the process to receive the query XML string from MComm, and return the candidate services back to M-Comm server (Fig D.3) 104 Fig D.3 Level DFD Diagram for Query Process • The query message is parsed by the Message Parser Process At the same time, the Message Parser is in charge of packaging and returning the candidate services to M-Comm • After parsing, the parsed query message is passed to the Query Preprocess to the traditional information retrieval process • The query string is then passed on to the Inference Process The Inference Process retrieves the rules and entities from knowledge base, and use them to retrieve candidate service information from the service storage database and then returns the candidate services to the Message Parser 105 Process in Figure D.1 is the process to receive the registry request and information from shop side and store the information into Service Registry Database 106 References [1] Sheila A McIlraith, Tran Cao Son, Honglei Zeng: Semantic Web Services IEEE Intelligent Systems 16(2): 46-53 (2001) [2] Web Services Description Language (WSDL) 1.1 http://www.w3.org/TR/wsdl [3] UDDI Technical White Paper http://www.uddi.org/pubs/Iru_UDDI_Technical_White_Paper.pdf [4] Amazon.com Web Service Home Page: http://www.amazon.com/gp/browse.html/002-63742896209619?node=3435361 [5] Google Web APIs Home Page http://www.google.com/apis/ [6] Christiane Fellbaum: WordNet: An Electronic Lexical Database The MIT Press, 1998 [7] Stans Kleijnen, Srikanth Raju: An Open Web Services Architecture ACM Queue 1(1): (2003) [8] Martin Gudgin: Secure, Reliable, Transacted; Innovation in Web Services Architecture SIGMOD Conference 2004: 879-880 107 [9] David Booth, Hugo Haas, Francis McCabe,et.al.: Web Services Architecture W3C Working Group Note, 11 February 2004 [10] Frank Leymann: Web Service Flow Language (WSFL 1.0) IBM, May 2001 [11] SOAP Version 1.2 Specification Assertions and Test Collection http://www.w3.org/TR/soap12-testcollection/ [12] SOAP Version 1.2 Part 0: Primer http://www.w3.org/TR/2003/REC-soap12- part0-20030624/#L1092 [13] SOAP Version 1.2 Part 1: Messaging Framework http://www.w3.org/TR/2003/REC-soap12-part1-20030624/ [14] SOAP Version 1.2 Part 2: Adjuncts http://www.w3.org/TR/2003/REC-soap12-part2-20030624/ [15] Using WSDL in a UDDI Registry, Version 1.07,UDDI Best Practice, May 21, 2002 [16]Michael C Daconta, Leo J Obrst, Kevin T Smith: The Semantic Web: A Guide to the Future of XML, Web Services, and Knowledge Management, John Wiley & Sons, ISBN 0-471-43257-1 [17]Dieter Fensel, Wolfgang Wahlster, Henry Lieberman, James Hendler: Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential, MIT Press, ISBN 0-262-06232-1 108 [18]Vladimir Geroimenko, Chaomei Chen: Visualizing the Semantic Web, Springer Verlag, ISBN 1-85233-576-9 [19].Gruber, T.R (1993): A translation approach to portable ontology specifications Knowledge Acquisition, 5, 199-220 [20] B Mahony and J.S Dong: Deep Semantic Links of TCSP and Object-Z: TCOZ Approach Formal Aspects of Computing journal, 13:142-160, Springer, 2002 [21] S C Qin, J S Dong and W N Chin: A Semantic Foundation of TCOZ in Unifying Theory of Programming FM'03 LNCS, Springer-Verlag, Pisa, Italy, Sep 2003 [22] Semantic Web Services Language (SWSL) Function Requirement http://www.daml.org/services/swsl/ [23] David Martin, Massimo Paolucci, Sheila McIlraith, Mark Burstein, Drew McDermott, Deborah McGuinness, Bijan Parsia, Terry Payne, Marta Sabou, Monika Solanki, Naveen Srinivasan, Katia Sycara: Bringing Semantics to Web Services: The OWL-S Approach, Proceedings of the First International Workshop on Semantic Web Services and Web Process Composition (SWSWPC 2004), July 6-9, 2004, San Diego, California, USA 109 [24] Massimo Paolucci, Takahiro Kawamura, Terry R Payne, Katia P Sycara: Semantic Matching of Web Services Capabilities International Semantic Web Conference 2002: 333-347 [25] T Kawamura, J.-A De Blasio, T Hasegawa, Massimo Paolucci, Katia P Sycara: Preliminary Report of Public Experiment of Semantic Service Matchmaker with UDDI Business Registry ICSOC 2003: 208-224 [26] M Paolucci, T Kawamura, T R Payne, and K Sycara: Importing the semantic web in uddi In Proceedings of E-Services and the Semantic Web Workshop, 2002 [27] Anupriya Ankolenkar, Mark H Burstein, Jerry R Hobbs, Ora Lassila, David L Martin, Sheila A McIlraith, Srini Narayanan, Massimo Paolucci, Terry R Payne, Katia P Sycara, Honglei Zeng: DAML-S: Semantic Markup for Web Services The Emerging Semantic Web 2001 [28] Anupriya Ankolenkar, Mark H Burstein, Jerry R Hobbs, Ora Lassila, David L Martin, Sheila A McIlraith, Srini Narayanan, Massimo Paolucci, Terry R Payne, Katia P Sycara, Honglei Zeng: DAML-S: Semantic Markup for Web Services SWWS 2001: 411-430 110 [29] K Sycara, J Lu, M Klusch, and S Widoff Matchmaking Among Heterogeneous Agents in the Internet Proceedings AAAI Spring Symposium on Intelligent Agents in Cyberspace , Stanford, USA, 1999 [30] K Sycara, J Lu, M Klusch, and S Widoff Dynamic Service Matchmaking among Agents in Open Information Environments Journal ACM SIGMOD Record , Special Issue on Semantic Interoperability in Global Information Systems, A Ouksel, A Sheth (Eds.), 1999 [31] Katia Sycara, Seth Widoff, Matthias Klusch and Jianguo Lu: LARKS: Dynamic Matchmaking Among Heterogeneous Software Agents in Cyberspace Autonomous Agents and Multi-Agent Systems, 5, 173–203, 2002 [32] Sivashanmugam, K (2003): The METEOR-S Framework for Semantic Web Process Composition , M.S Thesis , Department of Computer Science, University of Georgia, Athens, GA [33] Verma, K., Sivashanmugam, K , Sheth, A., Patil, A., Oundhakar, S and Miller, J METEOR-S WSDI: A Scalable Infrastructure of Registries for Semantic Publication and Discovery of Web Services , To Appear in Journal of Information Technology and Management [34] XML Topic Maps (XTM 1.0) http://www.topicmaps.org/xtm/ 111 [35] Steve Pepper: The TAO of Topic Maps: Find the Way in the Age of Infoglut, XML Europe 2000 [36] Value Set Overview Documents, Technical Overview http://www.oasis-open.org/committees/uddi-spec/doc/tn/uddi-spec-tc-tnvaluesetoverviewdocument-20040316.htm [37] Max Voskob: Taxonomies and Value Set Discussion Paper, November, 2003 [38] Asuman Dogac, Gokce Laleci, Yildiray Kabak, Ibrahim Cingil: ExploitingWeb Service Semantics: Taxonomies vs Ontologies CoRR cs.DB/0212051: (2002) [39] Sivashanmugam, K., Sheth A., Miller J., Verma K., Aggarwal R., Rajasekaran P., Metadata and Semantics for Web Services and Processes , Book Chapter, Datenbanken und Informationssysteme, Festschrift zum 60 Geburtstag von Gunter Schlageter, Publication Hagen, October 2003-09-26 [40] Andreas Heß, Nicholas Kushmerick: Automatically attaching semantic metadata to Web Services IIWeb 2003: 111-116 [41] Andreas Heß, Nicholas Kushmerick: Learning to Attach Semantic Metadata to Web Services International Semantic Web Conference 2003: 258-273 112 [42] Heinecke, Johannes; Toumani, Farouk: A natural language mediation system for e-Commerce applications: An ontology based approach Paper to read at the Workshop Human Language Technology for the Semantic Web, 2nd International Semantic Web Conference, 20th-23rd October 2003, Sanibel Island [43] B Magnini and M Speranza: Merging Global and Specialized Linguistic ontologies, In Proceedings of the Workshop Ontolex-2002 Ontologies and Lexical Knowledge Bases, LREC-2002,, Las Palmas, Canary Islands - Spain, 2002 [44] DoubleClick 2001, European Digital Marketing Survey http://www.doubleclick.com/us/knowledge_central/documents/research/dc _emea_digital_marketing_0112.pdf [45] M-Comm System, Project Report in NSS Lab of School of Computing, NUS [46] UDDI Specification V2, http://www.oasis-open.org/committees/uddi-spec/doc/tcspecs.htm#uddiv2 [47] K Bontcheva, H Cunningham The Semantic Web: A New Opportunity and Challenge for Human Language Technology Workshop on Human Language Technology for the Semantic Web and Web Services Held in conjunction with the Second International Semantic Web Conference (ISWC'03) H Cunningham, Y Ding, A Kiryakov (eds) Florida, USA October 2003 113 [48] Scott C Deerwester, Susan T Dumais, Thomas K Landauer, George W Furnas, Richard A Harshman: Indexing by Latent Semantic Analysis JASIS 41(6): 391-407 (1990) [49] D Maynard, V Tablan, K Bontcheva, and H Cunningham Rapid customization of an Information Extraction system for surprise languages Special issue of ACM Transactions on Asian Language Information Processing: Rapid Development of Language Capabilities: The Surprise Languages, 2003 [50] B Schölkopf, A Smola, R Williamson, and P L Bartlett New support vector algorithms Neural Computation, 12, 2000, 1207-1245 [51] B Schölkopf, J Platt, J Shawe-Taylor, A J Smola, and R C Williamson Estimating the support of a high-dimensional distribution Neural Computation, 13, 2001, 1443-1471 [52] Thorsten Joachims: A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization ICML 1997: 143-151 [53] Lewis, D Representation and Learning in Information Retrieval PhD thesis, Department of Computer and Information Science, University of Massachusetts 114 [54] Yang, Y: An evaluation of statistical approaches to text classification Information Retrieval, 1(1-2):69-90, 1999 [55] Thorsten Joachims: A Statistical Learning Model of Text Classification for Support Vector Machines SIGIR 2001: 128-136 [56] Thorsten Joachims: Learning To Classify Text Using Support Vector Machines, Kluwer Academic Publishers, 2001 [57] C.-W Hsu and C.-J Lin A comparison of methods for multi-class support vector machines , IEEE Transactions on Neural Networks, 13(2002), 415-425 [58] M Wagner, W.-T Balke, R Hirschfeld, W Kellerer: A Roadmap to Advanced Personalization of Mobile Services, in Proc International Federated Conference DOA, ODBASE, CoopIS 2002, Irvine, CA, USA, October 30 November 1, 2002 [59] W.-T Balke, M Wagner: Towards Personalized Selection of Web Services, in Proc WWW 2003, Budapest, Hungary, May 2003 [60] W.-T Balke, M Wagner: Cooperative Discovery for User-centered Web Service Provisioning, in Proc International Web Services Conference, Las Vegas, NV, USA, June 2003 115 [61] Heaps, H S Information Retrieval: Computational and Theoretical Aspects Academic Press, New York, 1978 [62] Araujo, M D., Navarro, G., and Ziviani, N Large text searching allows errors In Baeza-Yates, R., editor Proceedings of the 4th south Americam Workshop on String Processing, Page2-20, Valparaiso, Chile Carleton University Press [63] Brill, Eric 1995 Universal learning of disambiguation rules for part of speech tagging [64] Brill, Eric & Marcus, M 1993 Tagging an unfamiliar text with minimal human supervision ARPA Technical Report [65] N F Noy, M Sinteck, S Decker, M Crubezy, R.W Fergerson, & M A Musen Creating Semantic Web Contents with Protege-2000 IEEE Intelligent Systems 16(2):60-71, 2001 [66] Protégé website: http://protege.stanford.edu/index.html [67] S T Dumais, J Platt, D Heckerman, and M Sahami Inductive learning algorithms and representations for text categorization In CIKM-98: Proceedings of the Seventh International Conference on Information and Knowledge Management, 1998 116 [...]... selection engines, and then return (or even invoke) 15 a list of relevant candidate web services for the user Some efforts have been attempted in adapting the current B2B-focuses web services standards for useroriented service selection [58, 59, and 60] In the next section, we discuss the pros and cons of using the current web service protocol—UDDI for service selection tasks 2.2.1 UDDI for Service Selection. .. with web services, an efficient and precise service discovery and selection mechanism for web services is necessary Web service selection is a process to automatically select one web service or a list of services according to users’ requirement So far, research in the area of web services mainly focuses on B2B web services with well defined interfaces and terminologies defining a shared meaning for. .. syntactic meaning in web service, there are also some semantic descriptions about the web service that can be used for intelligent service selection For example, the names of the operations, the structures of the web services, and the relationships between the various parameters can be very informative for service selection There are approaches in semantic web services that employ WSDL as a means for applying... that semantic annotations can be introduced into UDDI to enable common semantic understanding between the service providers and services requesters 23 These approaches to semantically extend UDDI for enhanced service selection ability try to seek a balance between the UDDI compliance and semantic annotation On one hand, they share the same disadvantages from semantic web services On the other hand, they... key standards in Web service architecture They define the protocol of Web Services for messaging, service description and service registry & selection respectively 2.1.1 Web Service Architecture The web service architecture is a Service -Oriented Architecture (SOA for short), which is also the structure that Grid computing is now based on Three key roles – Service Registry, Service Provider and Service. .. provide services that facilitate the development, deployment, and usage of web services and web service network 3 1.2 Proposed Approach We present a novel automated web service selection method that utilizes machine learning and information retrieval techniques to identify relevant web services based on a users’ service query in free text format Our approach is to treat web services as textual web pages... keywords and the search engine searches the web pages stored in their database to find the web sites that contain some or all of the user query keywords The new user- oriented web service selection engines will have to parse users’ input requirements in free text for some specific task, and translate them into appropriate service requirement specifications that can be understood by the web service selection. .. uniform presentation of the services, as in the case of B2B scenarios Domain selection is the first step of B2C web service selections o Lacking common language between service providers and human users: In B2B scenario, a common language, such as taxonomies and specifications, is introduced to seek a common understanding between service providers and service requesters In a B2C scenario, human users... XML-based messaging exchanging protocol, and the underlying network protocol can be HTTP, SMTP, and so on 2.1.2 Web Service Standards To ensure interoperability, there exist published standards for performing the actions on web service architectures for operations involved in business interactions The standard web service stack is shown in Figure 2.2 The various web service standards are mainly categorized... Chapter 4 for details) 2.2.2 Service Selection in B2C Initially, web services are deployed as one of the most important standards in SOA (Service Oriented Architecture) to seamlessly integrate in B2B scenario Thus major research emphasis and board interest in standardlizing in web services are in applying web services in B2B scenario, whist human user interaction are omitted or taken for granted With ... been developed for semantic web services: OWL-S (formerly DAML-S) [24, 25, 26, 27, 28], and LARKS [29, 30, 31] OWL-S OWL-S (DAML-S) is a web service ontology language developed for describing... matched service from the storage In this thesis, we implement an intelligent web service selection engine for M-Comm’s service discovery subsystem We call our system iSSE for intelligent Service Selection. .. of relevant candidate web services for the user Some efforts have been attempted in adapting the current B2B-focuses web services standards for useroriented service selection [58, 59, and 60]