1. Trang chủ
  2. » Giáo Dục - Đào Tạo

Towards a framework for building an annotated named entities corpus luận văn ths công nghệ thông tin 1 01 10

6 13 0

Đang tải... (xem toàn văn)

THÔNG TIN TÀI LIỆU

Nội dung

Towards a framework for building an Annotated Named Entities Corpus Hoang Huu Son Faculty of Information Technology University of technology and engineering Vietnam National University, Hanoi Supervised by Doctor Pham Bao Son A thesis submitted in fulfillment of the requirements for the degree of Master of Information Technology June, 2010 Table of Contents Introduction 1.1 1.2 Overview Name Entity recognition(NER) NER Approach 1.2.1 1.2.2 1.2.3 Thesis contribution Thesis structure 1.3 1.4 Related Work 2.1 2.2 2.3 2.4 2.5 Overview our problem Building NER corpus research Researches about building corpus Proces Overview annotate tools Summary Corpus building process 3.1 3.2 3.3 3.4 ii Corpus building process 3.1.1 3.1.2 3.1.3 3.1.4 Building Vietnamese NER corpus by off-l 3.2.1 3.2.2 3.2.3 Discus about Vietnamese NER corpus bu Conclusion TABLE OF CONTENTS Online Annotation Framework 4.1 4.2 4.3 4.4 4.5 Evaluation 5.1 5.2 5.3 5.4 5.5 Conclusion And Future work 6.1 6.2 Introduction Training section Annotation documents 4.3.1 4.3.2 4.3.3 Quality control 4.4.1 4.4.2 4.4.3 Conclusion Introduction Corpus evaluation 5.2.1 5.2.2 5.2.3 Time costing 5.3.1 5.3.2 5.3.3 Named entity recognition system 5.4.1 5.4.2 5.4.3 5.4.4 Summary Conclusion Future work 6.2.1 6.2.2 6.2.3 iv A Name Entity guideline A.1 A.2 Basic concepts A.1.1 Entity and Entity Name A.1.2 A.1.3 A.1.4 Entity classification A.2.1 A.2.2 A.2.3 A.2.4 Facility A.2.5 Toward a Framework for building Named Entity Corpus Hoang Huu Son University of Engineering and Technology Vietnam National University, Hanoi 144, Xuan Thuy, Cau Giay, Hanoi, Vietnam Abstract Named entities recognition (NER) problem is one of the most interesting in nature language processing domain However a main NER research barrier is difficult to build a NER corpus and there is any NER corpus have been published So that in the thesis, we release a corpus building process and frameworks to build NER corpus - special Viet-namese named entity corpus Introduction Please be noted some points as follows The context of the research and its role/importance Related studies and their methods/solutions/approaches released corpus of Czech sentences with manually anno-tated named entities, in which a rich two-level classification scheme was used - How are the models designed? You can design different models/parameters, so please describe them in detail - How are the data prepared? The results should be presented in Tables and Graphs - It is important of giving the discussion after obtaining experimental results Conclusions - With regard to the objective of this study as you showed in the introduction, which have been done? - The contribution of your work, the meaning of obtained results - Present future work if needed The remain problems and objective of this study/thesis - Your proposal What will be carried out? Jana and Zabokrtsky,´ Zdenekˇ have built Czech Named En-tity Corpus which present in paper [?] In this recently - You can arrange one or more sections after the Intro-duction - You can use subsections - Show how the problem are formulated You may give some foundations if necessary - Show different aspects of the problems, for examples: the feature selections, learning algorithms, etc - Show your proposal, it is good if you can present the differences between your proposal and previous studies It is also important to show/analyze the solution in a reason-able way Show how features are selected/built; the algo-rithms/methods you will use - Experiments You should give the information as follows: Kravalova,´ ˇ Publications Give here your publications during this master course - You can also give here your submission and its status (i.e submitted, revising, in press, ) References [1] I M Author Some related article I wrote Some Fine Journal, 99(7):1–100, January 1999 [2] A N Expert A Book He Wrote His Publisher, Erewhon, NC, 1999 ... A. 2.5 Toward a Framework for building Named Entity Corpus Hoang Huu Son University of Engineering and Technology Vietnam National University, Hanoi 14 4, Xuan Thuy, Cau Giay, Hanoi, Vietnam Abstract... 6.2 .1 6.2.2 6.2.3 iv A Name Entity guideline A. 1 A. 2 Basic concepts A. 1. 1 Entity and Entity Name A. 1. 2 A. 1. 3 A. 1. 4 Entity classification A. 2 .1 A. 2.2 A. 2.3 A. 2.4 Facility... points as follows The context of the research and its role/importance Related studies and their methods/solutions/approaches released corpus of Czech sentences with manually anno-tated named entities,

Ngày đăng: 11/11/2020, 22:19

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN

w