1. Trang chủ
  2. » Luận Văn - Báo Cáo

Information Extraction for Vietnamese Real-Estate Advertisements

55 666 1

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Cấu trúc

  • Contents

  • List of Figures

  • List of Tables

  • Chapter 1 Introduction

  • 1.1 Problem and Idea

  • 1.2 Scope of the thesis

  • 1.3 Thesis' structure

  • Chapter 2 Related Work

  • 2.1 Approaches

  • 2.1.1 Rule-based approach

  • 2.1.2 Machine-learning approach

  • 2.1.3 Hybrid approach

  • 2.2 GATE framework

  • 2.2.1 Introduction

  • 2.2.2 General Architecture of GATE

  • 2.2.3 An example: ANNIE - A Nearly-New Information Extraction System

  • 2.2.4 Working with GATE

  • 2.2.5 Gazetteers

  • 2.2.6 JAPE

  • Chapter 3 Our Vietnamese Real-Estate Information Extraction system

  • 3.1 Template Definition

  • 3.2 Corpus Development

  • 3.2.1 Criterion of data collection

  • 3.2.2 Data collection

  • 3.2.3 Data normalization

  • 3.2.4 Corpus Annotation

  • 3.3 System Development

  • 3.3.1 Tokenizer

  • 3.3.2 Gazetteer

  • 3.3.3 JAPE Transducer

  • 3.3.3.1 Remove incorrect Lookup annotations

  • 3.3.3.2 Recognizing <TypeEstate> entities

  • 3.3.3.3 Recognizing <CategoryEstate> entities

  • 3.3.3.4 Recognizing <Zone> entities

  • 3.3.3.5 Recognizing <Area>, <Price> and <Telephone> entities

  • 3.3.3.6 Recognizing <Fullname> entities

  • 3.3.3.7 Recognizing <Address> entities

  • 3.3.3.8 Recognizing <Email> entities

  • 3.4 Summary

  • Chapter 4 Experiments and Error Analysis

  • 4.1 Evaluation metrics

  • 4.2 Experimental result

  • 4.3 Errors Analysis

  • Chapter 5 Conclusion and Future Works

  • 5.1 Conclusion

  • 5.2 Future Works

  • Appendix A

  • Appendix B

  • Bibliography

Nội dung

Ngày đăng: 25/03/2015, 09:44

TỪ KHÓA LIÊN QUAN