Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống
1
/ 27 trang
THÔNG TIN TÀI LIỆU
Thông tin cơ bản
Định dạng
Số trang
27
Dung lượng
546,97 KB
Nội dung
1 I H C QU C GIA HÀ N I TR NGă IăH CăCÔNGăNGH PHAN V N THU N NH N D NG CÁC BI UăM U TÀI LI U ChuyênăngƠnh:ăCôngăngh ăthôngătin Mƣăs :ă1.01.10 LU NăV NăTH CăS NG IăH NGăD NăKHOAăH C:ăăPGS.TS.ăNGÔ QU C T O HÀăN Iăậ 2007 L IăC Mă N! Lu n v n lƠ k t qu đ t đ tr ng i h c Công Ngh - c sau hai n m h c cao h c (2004-2006) t i i h c Qu c Gia HƠ N i vƠ trình nghiên c u, công tác t i Trung tơm nghiên c u vƠ phát tri n Công ngh ph n m m, ih c Công ngh , HQGHN Tr c tiên xin g i l i bi t n chơn thƠnh nh t t i PGS TS Ngô Qu c T o vƠ Th c s Nguy n Ti n S , ng i th y đƣ truy n đ t cho nh ng ki n th c đ u tiên v X lý nh vƠ nh n d ng, ng i th y đƣ dìu d t, h ng d n vƠ cho nh ng ý ki n quý báu, nh ng l i khuyên b ích su t trình tìm hi u vƠ hoƠn thƠnh lu n v n nƠy Tôi xin bƠy t lòng bi t n đ n th y cô Tr ng i h c Công ngh đƣ d y d su t hai n m h c cao h c v a qua Tôi xin g i l i c m n đ n Trung tơm nghiên c u vƠ phát tri n Công ngh Ph n m m, n i đƣ t o u ki n cho h c t p su t th i gian h c cao h c Tôi xin bƠy t lòng cám n sơu s c đ n anh ch , b n bè l p K11T2 đƣ cho nh ng l i đ ng viên, góp ý vƠ giúp đ trình h c t p vƠ th c hi n lu n v n Tôi xin c m n th y, anh ch vƠ b n Trung tơm nghiên c u vƠ phát tri n Công ngh ph n m m cho nh ng l i đ ng viên, khích l , nh ng góp ý sơu s c vƠ nh ng đ nh h ng đ s m hoƠn thƠnh lu n v n nƠy Xin c m n Khoa Công ngh thông tin, Tr ng i h c Công ngh , HQGHN t o u ki n t t cho h c t p vƠ công tác su t trình h c t p vƠ lƠm lu n v n Cu i xin bƠy t lòng bi t n sơu s c t i gia đình, n i ng h , giúp đ vƠ bên m i hoƠn c nh khó kh n HƠ N i, tháng 11/2007 Phan V n Thu n L IăCAMă OAN Tôi xin cam đoan k t qu đ t đ nhơn tôi, không chép l i c a ng c lu n v n lƠ s n ph m c a riêng cá i khác Lu n v n lƠ k t qu c a trình h c t p, nghiên c u vƠ lƠm vi c nghiêm túc su t h n hai n m cao h c Trong toƠn b n i dung c a lu n v n, nh ng u đ c trình bƠy ho c lƠ k t qu nghiên c u c a cá nhơn ho c lƠ k t qu t ng h p t nhi u ngu n tƠi li u khác Nh ng k t qu nghiên c u nƠo c a cá nhơn đ u đ c ch rõ rƠng lu n v n Các thông tin t ng h p hay k t qu l y t nhi u ngu n tƠi li u khác đ c trích d n m t cách đ y đ vƠ h p lý T t c tƠi li u tham kh o đ u có xu t x rõ rƠng vƠ đ c trích d n h p pháp Tôi xin hoƠn toƠn ch u trách nhi m vƠ ch u m i hình th c k lu t theo quy đ nh cho l i cam đoan c a HƠ N i, tháng 11/2007 Phan V n Thu n M CăL C M U CH NG - T ng quan 1.1 t v n đ 1.2 N i dung vƠ c u trúc c a lu n v n 10 CH NG - Khái quát m t s ph ng pháp phơn vùng nh vƠ kh nghiêng Error! Bookmark not defined 2.1 T ng quan v x lý bi u m u Error! Bookmark not defined 2.1.1 Các đ c tr ng c a bi u m u Error! Bookmark not defined 2.1.2 Quá trình x lý bi u m u d a Ngôn ng mô t bi u m u Error! Bookmark not defined 2.1.3 Quá trình x lý bi u m u d a đ ng ký bi u m u (Form registration) Error! Bookmark not defined 2.1.4 H th ng x lý bi u m u Error! Bookmark not defined 2.2 Các k thu t th ng đ c s d ng nh n d ng c u trúc bi u m u Error! Bookmark not defined 2.2.1 Ch nh đ nghiêng c a v n b n bi u m u Error! Bookmark not defined 2.2.1.1 2.2.1.2 Phơn tích hình nh c a phép chi u Error! Bookmark not defined Xác đ nh góc nghiêng d a vƠo bi n đ i Hough Error! Bookmark not defined 2.2.1.3 2.2.1.4 2.2.1.5 defined 2.2.1.6 Phơn c m hƠng xóm g n nh t Error! Bookmark not defined S t ng quan c a đ ng th ng Error! Bookmark not defined M t s k thu t tính góc nghiêng khác Error! Bookmark not B ng t ng k t v thu t toán xác đ nh góc nghiêng .Error! Bookmark not defined 2.2.2 Phơn tích trang Error! Bookmark not defined 2.2.2.1 2.2.2.2 defined 2.2.2.3 defined 2.2.2.4 2.2.2.5 Các k thu t phơn vùng ký t Error! Bookmark not defined Các k thu t phơn vùng trang v n b n Error! Bookmark not Các k thu t k t h p phơn vùng/phơn l p Error! Bookmark not Các k thu t phơn l p kh i Error! Bookmark not defined B ng t ng k t v k thu t phơn tích trang Error! Bookmark not defined CH NG - xu t gi i pháp s d ng bi u m u đ ng Error! Bookmark not defined 3.1 H th ng x lý bi u m u đ ng Error! Bookmark not defined 3.1.1 Gi i thi u v h th ng x lý bi u m u Error! Bookmark not defined 3.1.2 So sánh v i ph ng pháp truy n th ng Error! Bookmark not defined 3.1.3 Ph ng pháp nhơn d ng c b n Error! Bookmark not defined 3.1.4 Thi t k h th ng Error! Bookmark not defined 3.1.4.1 3.1.4.2 3.1.4.3 3.1.4.4 Ki n trúc h th ng Error! Bookmark not defined H th ng chu n b bi u m u Error! Bookmark not defined H th ng x lý bi u m u Error! Bookmark not defined ng d ng Error! Bookmark not defined 3.2 Các thu t toán x lý nh áp d ng cho x lý bi u m u đ ng Error! Bookmark not defined 3.2.1 Khái ni m bi u m u đ ng Error! Bookmark not defined 3.2.2 L c đ x lý bi u m u đ ng Error! Bookmark not defined 3.2.3 Các ph ng pháp tách ch vi t tay kh i khung n .Error! Bookmark not defined 3.2.3.1 3.2.3.2 Tìm hi u thu t toán tách c b n Error! Bookmark not defined Tách ch vi t tay kh i khung n Error! Bookmark not defined 3.2.4.1 3.2.4.2 Ph Ph 3.2.4 Xác đ nh v trí khung n Error! Bookmark not defined ng pháp chi u Error! Bookmark not defined ng pháp b qua kho ng tr ng Error! Bookmark not defined 3.2.5 Xác đ nh thông tin n ch a khung n Error! Bookmark not defined 3.3 Các thu t toán khác áp d ng cho bi u m u đ ng Error! Bookmark not defined 3.3.1 Thu t toán bóc vi n Error! Bookmark not defined 3.3.2 Thu t toán xác đ nh góc nghiêng Error! Bookmark not defined 3.3.2.1 3.3.2.2 Thu t toán s d ng phép chi u Error! Bookmark not defined Thu t toán s d ng đ ng th ng dƠy Error! Bookmark not defined 3.4 Th c nghi m Error! Bookmark not defined 3.4.1 Môi tr ng th c nghi m Error! Bookmark not defined 3.4.2 Th c nghi m đ i v i thu t toán c b n bi u m u đ ng Error! Bookmark not defined 3.4.2.1 Th c nghi m tách ch vi t tay kh i khung n Error! Bookmark not defined 3.4.2.2 Th c nghi m xác đ nh v trí khung n thông tin .Error! Bookmark not defined 3.4.2.3 Th c nghi m xác đ nh c u trúc m nh t ng khung Error! Bookmark not defined 3.4.3 Th c nghi m đ i v i thu t toán khác Error! Bookmark not defined 3.4.3.1 Xác đ nh góc nghiêng c a nh theo ph ng pháp chi u Error! Bookmark not defined 3.4.3.2 Th c nghi m nh n d ng đ ng th ng dƠy Error! Bookmark not defined K t lu n Error! Bookmark not defined o TÀI LI U THAM KH O 12 DANHăM CăHÌNHă NH ảình 2.1: L c đ trình x lý bi u m u d a Ngôn ng mô t bi u m u Error! Bookmark not defined ảình 2.2: M t h th ng x lý bi u m u v n b n Error! Bookmark not defined ảình 2.3: (a) nh tr c kh nghiêng; (b) nh sau kh nghiêng Error! Bookmark not defined ảình 2.4: (a), (b), (c) : Các đ i t ng nh ; (d), (e), (f) : Phân c m đ i t ng [17] Error! Bookmark not defined ảình 3.1 Phân tách ch vi t tay khung Error! Bookmark not defined ảình 3.2 ả th ng bi u m u đ ng Error! Bookmark not defined ảình 3.3: Data set Error! Bookmark not defined ảình 3.4: Vùng mã hóa thông tin Error! Bookmark not defined ảình 3.5: M t minh h a v bi u m u đ ng Error! Bookmark not defined ảình 3.6: Lu ng x lý Error! Bookmark not defined ảình 3.7: ảình nh c a ch m nh Error! Bookmark not defined ảình 3.8: M t c u trúc c a khung Error! Bookmark not defined ảình 3.9: M t bi u m u đ ng Error! Bookmark not defined ảình 3.10: M t s c u trúc ch m nh Error! Bookmark not defined ảình 3.11: M t l c đ x lý bi u m u đ ng Error! Bookmark not defined ảình 3.12 : Khung tr c tách Error! Bookmark not defined ảình 3.13 : Khung sau tách ch vi t tay Error! Bookmark not defined ảình 3.14 : Ch vi t tay sau tách khung Error! Bookmark not defined ảình 3.15 : Phép chi u theo chi u ngang Error! Bookmark not defined ảình 3.16 : Phép chi u khung theo chi u d c Error! Bookmark not defined ảình 3.17 : Phép chi u theo chi u d c m nh Error! Bookmark not defined ảình 3.18: nh scan có đ ng vi n màu đen Error! Bookmark not defined ảình 3.19: Các ki u hình d ng khác c a c m, m i màu t ng ng v i m t c m Error! Bookmark not defined ảình 3.20: Mô t m lân c n tr c ti p Error! Bookmark not defined ảình 3.21: Mô t m lân c n Error! Bookmark not defined ảình 3.22: Mô t m n i Error! Bookmark not defined ảình 3.23: nh scan có đ ng vi n màu đen nh sau bóc vi n đen Error! Bookmark not defined ảình 3.24: (a) nh scan, đ ng màu đ đ ng chi u góc nghiêng (b) nh đ c x p x b i hình bình hành Error! Bookmark not defined ảình 3.25: (a) nh scan (b) (c) (d) nh đ ng chi u đen tr ng góc nghiêng -2.50, -3.50, 20 Error! Bookmark not defined ảình 3.26: (a) nh scan (b) nh sau xác góc nghiêng Error! Bookmark not defined ảình 3.27 nh tr c sau xoay chi u Error! Bookmark not defined ảình 3.28 Phi u tr l i đ c quét ng c chi u Error! Bookmark not defined M U Nh p d li u t đ ng lƠ bƠi toán ngƠy cƠng thu hút nhi u s ý vƠ đ u t nghiên c u b i đơy th t s lƠ m t v n đ quan tr ng, c n thi t kh n ng áp d ng r ng rƣi vƠo th c t c ng nh hi u qu mƠ mang l i Trong bƠi toán nƠy, kh nhi u, kh nghiêng vƠ phơn vùng nh lƠ m t ph n có vai trò đ c bi t quan tr ng Ch c n ng c a lƠ xác nh, tách vùng đ c nh p thông tin, tách ch vi t tay kh i khung n thông tin đ lƠm đ u vƠo cho module nh n d ng ch Trong lu n v n nƠy trình bƠy m t ph ng pháp s d ng “c u trúc ch m nh ” đ t o khung n thông tin C u trúc ch m nh nƠy đ c t o b i r t nhi u ch m nh ho c đo n th ng nh Ph ng pháp c a có nh ng đ c m n i b t sau: Không c n thi t k bi u m u có mƠu s c mƠ v n tách ký t ch vi t tay kh i khung n D dƠng tách ph n ch vi t tay kh i khung n m t cách nhanh chóng vƠ d dƠng b ng thu t toán x lý nh đ n gi n, đ c bi t tr ng h p ch vi t tay đè lên khung Chi phí cho x lý bi u m u lƠ th p Không c n bi t tr c v trí logic c a khung n ch vƠ tìm cách x lý t đ ng ký t khung Chúng đƣ ti n hƠnh th c nghi m nhi u m u bi u m u đ ng vƠ thu đ c nh ng k t qu r t kh quan T khóa – X lý nh, Phân tích trang tài li u, Nh n d ng, Bi u m u, Phép toán hình thái, Active form 10 CH 1.1 NGă1ă- T ngăquan tăv năđ Trong th c t , công vi c nh p d li u chi m m t chi phí l n kh i l ng d li u ph i nh p vƠ công s c b đ đ m b o vi c nh p có đ xác cao Vì th t lơu v n đ nh p li u t đ ng đƣ đ c đ u t nhi u Ngu n d li u ph bi n nh t lƠ v n b n gi y, gi i pháp ch y u lƠ ph i l y nh vƠ nh n d ng Nh n d ng lƠ bƠi toán đƣ xu t hi n lơu vƠ đƣ đ t đ c nhi u thƠnh t u to l n Tuy nhiên nh n d ng m t v n b n b t kì bao g m c v n b n có l n ch vi t tay hay hình nh lƠ m t bƠi toán khó vƠ hi n v n ch a th t s có gi i pháp hoƠn ch nh Trên th gi i, hi n đƣ có nhi u ng d ng liên quan đ n v n đ nh n d ng v n b n hay nh p d li u t đ ng Có th k đ n nh : s n ph m FineReader, Scan To Office c a hƣng ABBYY, Smart scan Xpress c a Pegasus Image, ng d ng ch m thi t đ ng ầ Vi t Nam c ng đƣ có ng d ng nh n d ng v n b n nh VNDocR c a Vi n Công ngh Thông tin hay ImageScan c a CardPro ơy lƠ ng d ng nh n d ng ch in Vi c nh n d ng ch vi t tay lƠ m t thách th c M t s nghiên c u v nh n d ng ch vi t tay đƣ đ c th c hi n t i Vi n CNTT vƠ B môn Công ngh Ph n m m Tuy nhiên ng d ng nƠy hi n v n r t nhi u h n ch kh n ng nh n d ng ch vi t tay ch a đ t đ c đ xác c n thi t đ có th áp d ng r ng rƣi th c t Có hai đ i t ph ng v n b n c n nh n d ng có đ t tr ng khác đòi h i có ng pháp x lý khác ó lƠ nh n d ng v n b n phi c u trúc vƠ nh n d ng v n b n ki u bi u m u v i d li u ch đ c s p x p nh ng vùng xác đ nh Cùng v i s phát tri n c a công ngh x lý nh hi n nay, thu t toán nh n d ng ngƠy cƠng xác vƠ đ a đ ch vi t tay c ng có th đ t đ c k t qu đáng tin c y Ngay c đ i v i c đ xác cao v i u ki n lƠ ch nh n d ng t ng ch riêng bi t vƠ ch vi t đ p V i v n b n thông th ng, ta khó có th đ t 13 Ch ng nƠy trình bƠy v ph n vi c mƠ đƣ th c hi n đ tƠi chung: Gi i pháp s d ng h th ng x lý bi u m u đ ng cho vi c phơn vùng nh vƠ tách ch vi t tay N i dung c a ch ng t p trung vƠo: a khái ni m v bi u m u đ ng vƠ h th ng x lý bi u m u đ ng Trình bƠy thu t toán c b n ng d ng cho vi c nh n d ng bi u m u đ ng ánh giá u vƠ nh c m c a thu t toán nƠy Mô t chi ti t trình th c nghi m thu t toán đƣ đ xu t Ch ngă4:ăK tălu nă Ch ng b n t ng k t l i nh ng k t qu đ t đ ti p t c th c hi n t ng lai c vƠ nh ng vi c c n đ c 14 o TÀIăLI UăTHAMăKH O TƠiăli uăti ngăVi t STT [1] Phan V n Thu n & Ngô Qu c T o:“ K t h p m t s k thu t x lý nh nơng cao cho x lý bi u m u đ ng vƠ ng d ng”, ả i th o khoa h c qu c gia l n th 3-Nghiên c u c b n ng d ng công ngh thông tin, 10-2007 [2] Phan V n Thu n & Ơo Thanh Khi t: “X lý t đ ng phi u u tra”, Khóa lu n t t nghi p, đ i h c Công ngh , đ i h c Qu c gia ảà n i, 5-2004 [3] Tr n Thanh Phúc & inh V n Ph ng: “ Nghiên c u vƠ xơy d ng h th ng nh p tƠi li u t đ ng b ng nh n d ng quang h c”, Khóa lu n t t nghi p , đ i h c Công ngh , đ i h c Qu c gia ảà n i, 5-2006 TƠiăli uăti ngăAnh STT [4] Yoji Maeda, Masaki Nakagawa: Design of paper based user interface for editing document, Proc SPIE, Vol.4307, pp184- 192 (2001) [5] http://www.parc.com/solutions/dataglyphs/ [6] Koichi Kise, Yasuo Miki, Keinosuke Matsumoto: Backgrounds as Information Carriers for Printed Documents, Proc.15th ICPR’2000, Vol.4, pp380-384 (2000.9) [7] Bilan Zhu, Masaki Nakagawa: Informationd Encoding Into and Decoding From Dot Texture for Active Form, Proc ACM Symposium on Document Engineering, pp105- 144 (2003.11) [8] Taro SHIMAMURA Bilan ZHU Atsushi MASUDA Motoki ONUMA Takeshi SAKURADA Masaki NAKAGAWA: “A Prototype of An Active Form System”, Seventh International Conference on Document Analysis and Recognition (ICDAR), Edinburgh, Scotland, pp921-925 (2003.8) 15 [9] Koichi Kise, Yasuo Miki, Keinosuke Matsumoto: Backgrounds as Information Carriers for Printed Documents, Proc.15th ICPR’2000, Vol.4, pp380-384 (2000.9) [10] Bilan Zhu, Taro Shimamura, Masaki Nakagawa:Document processing methods for active form, Proc of the 3rd IASTED International Conference Visualation, Image and Image processing, September 8-10 -2003, Spain [11] S.V Rice, F.R Jenkins, and T.A Nartker The Fifth Annual Test of OCR Accuracy Technical Report TR-96-01, Information Science Research Institute, University of Nevada, Las Vegas, April 1996 [12] RM Haralick Document Image Understanding: Geometric and Logical Layout In IEEE Computer Society Con] Computer Vision and Pattern Recognition (CVPR), pages 385390, Seattle, Washington, 1994 [13] Y.Y Tang, S.\\I Lee, and C.Y Suen Automatic Document Processing: a Survey Pattern Recognition, 29(12):19311952, 1996 [14] A.K Jain and B Yu Document Representation and Its Application to Page Decomposition Technical Report MSUCPS:TR96-63, Michigan State University, East Lansing, MI, December 1996 [15] L O'Gorman and R Kasturi Dowment Image Analysis IEEE Computer Society Press, Los Alamitos, CA, 1995 [16] F Fignoni, S Messelodi, and C.M Modena Review of the State of the Art in Optical Character Recognition Part 1: Machine Printed Documents Technical Report #9607-03, IRST, Trento, Italy, June 1996 [17] J.M White and G.D Rohrer Image Thresholding for Optical Character Recognition and Other Apllications Requiring Character Image Extraction IBM Journal of Besercl; and Development, 27(4):400411, July 1983 [18] T Taxt, P.J Flynn, and A.K Jain Segmentation of Document Images IEEE Transaction on Pattern Analysis and Machine Intelligence, 1(12):13221329, December 1989 [19] L O'Gorman Binarization and Multithresholding of Document Images 16 Using Connectivity CVGIP: Graphical Models and Image Processing, 56(6):494506, 1994 [20] H.-S Don A Noise Attribute Thresholding Method for Document Image Binarization In Proc of the 3th International Conference on Document Analysis and Recognition, pages 231234, Montreal, Canada, August 1995 [21] Y Liu and S.N Srihari Document Image Binarization Based on Texture Features IEEE Transactions on Pattern Analysis and lldachine Intelligence, 19(5):540544, May 1997 [22] J Sauvola, T Seppanen, S Haapakoski, and M Pietikainen Adaptive Document Binarization In Proc of the 4th International Conference on Dowment Analysis and Recognition, pages 147152, Ulm, Germany, August 1997 [23] P.W Palumbo, P Swarninathan, and S.N Srihari Document image binarization: Evaluation of algorithms In Proc of SPIE Symposium Applications of Digital Image Processing IX, volume 697, pages 278 285, San Diego, California, August 1986 [24] O.D Trier and T Taxt Evaluation of Binarization Methods for Document Images IEEE Transactions on Pattern Analysis and lldachine Intelligence, 17(3):312315, March 1995 [25] A.T Abak, U Baris, and B Sankur The Performance Evaluation of Thresholding Algorithms for Optical Character Recognition In Proc of the 4th International Conference on Dowment Analysis and Recognition, pages 697700, Ulm, Germany, August 1997 [26] \"1 Post! Detection of linear oblique structures and skew scan in digitized documents In Proc of the 8th International Conference on Pattern Recognition, pages 687689, Paris, France, 1986 [27] H.S Baird The skew angle of printed documents In Proc of the Conference Society of Photographic Scientists and Engineers, volume 40, pages 2124, Rochester, NY, May, 20-21 1987 17 [28] G Ciardiello, G Scafuro, M.T Degrandi, M.R Spada, and M.P Roccotelli An experimental system for office document handling and text recognition In Proc of the 9th International Conference on Pattern Recognition, volume 2, pages 739743, Roma, Italy, November, 14-17 1988 [29] Y Ishitani Document Skew Detection Based on Local Region Complexity In Proc of the 2nd International Conference on Dowment Analysis and Recognition, pages 4952, 'I'sukuba, Japan, October 1993 IEEE Computer Society [30] A Bagdanov and J Kanai Projection Profile Based Skew Estimation Algorithm for JBIG Compressed Images In Proc of the 4th International Conference on Dowment Analysis and Recognition, pages 401405, Ulm, Germany, August 1997 [31] S.N Srihari and V Govindaraju Analysis of Textual Images Using the Hough Transform Machine Vision and Applications, 2(3):141 153, 1989 [32] S Hinds, J Fisher, and D D'Amato A document skew detection method using run-length encoding and the Hough transform In Proc of the 10th International Conference on Pattern Recognition, pages 464468, Atlantic City, NJ, June, 17-21 1990 [33] A.L Spitz Skew Determination in CCITT Group Compressed Document Images In Proc of the Symposium on Document Analysis and Information Retrieval, pages 1125, Las Vegas, 1992 [34] D.S Le, G.R Thoma, and H Wechsler Automated Page Orientation and Skew Angle Detection for Binary Document Images Pattern Recognition, 27(10):13251344, 1994 [35] Y Min, S.-B Cho, and Y Lee A Data Reduction Method for Efficient Document Skew Estimation Based on Hough Transformation In Proc of the 13th International Conference on Pattern Recognition, pages 732736, Vienna, Austria, August 1996 IEEE Press [36] U Pal and B.B Chaudhuri An improved document skew angle estimation 18 technique Pattern Recognition Letters, 17(8):899904, July 1996 [37] B Yu and A.K Jain A Robust and Fast Skew Detection Algorithm for Generic Documents Pattern Recognition, 29(10):15991629, 1996 [38] A Hashizume, P.S Yeh, and A Rosenfeld A method of detecting the orientation of aligned components Pattern Recognition Letters, 4:125132, 1986 [39] L O'Gorman The Document Spectrum for Page Layout Analysis IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(11):11621173, 1993 [40] R Smith A Simple and Efficient Skew Detection Algorithm via Text Row Accumulation In Proc of the 3th International Conference on Dowment Analysis and Recognition, pages 11451148, Montreal, Canada, August 1995 [41] T Akiyama and N Hagita Automated Entry System for Printed Documents Pattern Recognition, 23(11):11411154, 1990 [42] H Yan Skew Correction of Document Images Using Interline CrossCorrelation CVGIP: Graphical Models and Image Processing, 55(6):538543, November 1993 [43] B Gatos, N Paparnarkos, and C Charnzas Skew Detection and Text Line Position Determination in Digitized Documents Pattern Recognition, 30(9):15051519, 1997 [44] J Sauvola and M Pietikainen Skew Angle Detection Using Texture Direction Analysis In Proc of the 9th Scandinavian Conference on Image Analysis, pages 10991106, Uppsala, Sweden, June 1995 [45] C Sun and D Si Skew and Slant Correction for Document Images Using Gradient Direction In Proc ofthe 4th International Conference on Document Analysis and Recognition, pages 142146, Ulm, Germany, August 1997 [46] S Chen and RM Haralick An Automatic Algorithm for Text Skew 19 Estimation in Document Images Using Recursive Morphological transforms In Proc of the first IEEE International Conference on Image Processing, pages 139143, Austin, Texas, 1994 [47] H K Aghajan, B H Khalaj, and T Kailath Estimation of skew angle in text-image analysis by SLIDE: subspace-based line detection Machine Vision and Applications, 7:267276, 1994 [48] H.S Baird Anatomy of a Versatile Page Reader Proc of the IEEE, 80(7):10591065, 1992 [49] P.V.C Hough Methods and means for recognizing complex patterns US Patent #3,069,654, December 18, 1962 [50] E.R Davies Machine Vision: Theory, Algorithms, Practicalities Academic Press, 1992 [51] S Chen and RM Haralick Recursive Erosion, Dilation, Opening and Closing Transforms IEEE Transaction on Image Processing, 4(3):335345, March 1995 [52] A.P Dias Minimum Spanning Trees for Text Segmentation In Proc of Fifth Annual Symposium on Dowment Analysis and Information Retrieval, Las Vegas, Nevada, 1996 [53] H.S Baird Global-tn-Local Layout Analysis In Proc of the IAPR Workshop on Syntactic and Structnrol Pattern Recognition, pages 136 147, Pont-a-Mousson, France, September 1988 [54] H.S Baird, S.E Jones, and S.J Fortune Image Segmentation using ShapeDirected Covers In Proc of the 10th International Conference on Pattern Recognition, Atlantic City, NJ, June, 17-211990 [55] H.S Baird Background Structure in Document Images In Advances in Stmctural and Syntactic Pattern Recognition, pages 253269 World Scientific, Singapore, 1992 [56] J Ha, RM Haralick, and LT Phillips Document Page Decomposition by the Bounding-Box Projection Technique In Proc 0/ the 3th International 20 Conference on Dowment Analysis and Recognition, Montreal, Canada, August 1995 [57] S Chen, RM Haralick, and LT Phillips Extraction of Text Layout Structures on Document Images based on Statistical Characterization In ISe1T/SPIE Symposium on Electronic Imaging Science and Technology, Dowment Recognition II, pages 128139, San Jose', USA, 1995 [58] E.G Johnston SHORT NOTE: Printed Text Discrimination Computer Graphics and Image Processing, 3:8389, 1974 [59] F Heines and J Lichter Layout extraction of mixed mode documents Machine Vision and Applications, 7:237246, 1994 [60] O Deforges and D Barba Segmentation of Complex Documents Multilevel Images: a Robust and Fast Text Bodies-Headers Detection and Extraction Scheme In Proc 0/ the 3th International Conference on Document Analysis and Recognition, pages 770773, Montreal, Canada, August 1995 [61] P Parodi and G Piccioli An Efficient Pre-Processing of MixedContent Document Images for OCR Systems In Proc 0/ the 13th International Conference on Pattern Recognition, pages 778782, Vienna, Austria, August 1996 IEEE Press [62] M.B.H Ali, F Fein, F Heines, T Jager, and A Weigel Document Analysis at DFKL Part 1: Image Anlysis and Text Recognition Technical Report RR95-02, German Research Center for Artificial Intelligence (DKFI), Kaiserslautern, Germany, March 1995 [63] K.J Wong, RG Casey, and F.M Wahl Document Analysis System IBM Journal 0/ Resercli and Development, 26(6):647656, 1982 [64] G Nagy and S.C Seth Hierarchical Representation of Optically Scanned Documents In Proc 0/ the 7th International Conference on Pattern Recognition, pages 347349, Montreal, Canada, 1984 [65] D Wang and S.N Srihari Classification of Newspaper Image Blocks Using Texture Analysis Computer Vision, Graphics and Image Processing, 21 47:327352, 1989 [66] G Nagy, S Seth, and M Viswanathan A Prototype Document Image Analysis System for Technical Journals Computer, 25(7):1022, 1992 [67] M Krishnamoorthy, G Nagy, S Seth, and M Viswanathan Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals IEEE Transactions on Pattern Analysis and lldachine Intelligence, 15(7):737747, 1993 [68] D Sylwester and S Seth A Trainable, Single-Pass Algorithm for Column Segmentation In Proc of the 3th International Conference on Dowment Analysis and Recognition, pages 615618, Montreal, Canada, August 1995 [69] T Pavlidis and J Zhou Page Segmentation and Classification CVGIP: Graphical Models and Image Processing, 54(6):484496, 1992 [70] A.K Jain and S Bhattacharjee Text Segmentation using Gabor filters for automatic document processing lldachine Vision and Applications, 5(3):169184, 1992 [71] Y.Y Tang, H Ma, X Mao, D Liu, and C.Y Suen A New Approach to Document Analysis Based on Modified Fractal Signature In Proc of the 3th International Conference on Dowment Analysis and Recognition, pages 567570, Montreal, Canada, August 1995 [72] N Normand and C Viani-Gaudin A Background Based Adaptive Page Segmentation Algorithm In Proc of the 3th International Conference on Dowment Analysis and Recognition, pages 138141, Montreal, Canada, August 1995 [73] K Kise, O Yanagida, and S Takarnatsu Page Segmentation Based on Thinning of Background In Proc of the 13th International Conference on Pattern Recognition, pages 788792, Vienna, Austria, August 1996 IEEE Press [74] O.T Akindele and A Belaid Page Segmentation by Segment Tracing In Proc of the 2nd International Conference on Dowment Analysis and 22 Recognition, pages 341344, 'Tsnkuba, Japan, October 1993 IEEE Computer Society [75] L.A Fletcher and R Kasturi A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images IEEE Transactions on Pattern Analysis and Machine Intelligence, 10(6):910918, 1988 [76] A.A Zlatopolsky Automated document segmentation Pattern Recognition Letters, 15(7):699704, July 1994 [77] S.-Y Wang and T Yagasaki Block Selection: A Method for Segmenting Page Image of Various Editing Styles In Proc of the 3th International Conference on Dowment Analysis and Recognition, pages 128133, Montreal, Canada, August 1995 [78] A Simon, J.-C Pret, and A.P Johnson A Fast Algorithm for BottomUp Document Layout Analysis IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(3):273277, 1997 [79] T Saitoh and T Pavlidis Page Segmentation without Rectangle Assumption In Proc of the 11th International Conference on Pattern Recognition, pages 277280, The Hague, 1992 [80] Y Hirayama A Block Segmentation Method For Document Images with Complicated Column Structures In Proc of the 2nd International Conference on Dowment Analysis and Recognition, pages 91 94, T'sukuba, Japan, October 1993 IEEE Computer Society [81] S Tsujimoto and H Asada Major components of a Complete Text Reading System Proceedings of the IEEE, 80(7):11331149, 1992 [82] F Lebourgeois, Z Bublinski, and H Emptoz A Fast and Efficient Method For Extracting Text Paragraphs and Graphics from Unconstrained Documents In Proc of the 11th International Conference on Pattern Recognition, pages 272276, The Hague, 1992 [83] W Scherl, F Wahl, and H Fuchsberger Automatic Separation of Text, Graphic and Picture Segments in Printed Material In E.S Gelsema and L.N 23 Kanal, editors, "Pattern Recognition in Practice", pages 213221 NorthHolland, Amsterdam, 1980 [84] J Sauvola and M Pietikainen Page Segmentation and Classification using fast Feature Extraction and Connectivity Analysis In Proc of the 3th International Conference on Dowment Analysis and Recognition, pages 11271131, Montreal, Canada, August 1995 [85] A.K Jain and Y Zhong Page Layout Segmentation based on Texture Analysis Pattern Recognition, 29(5):743770, 1996 [86] K Etemad, D.S Doermann, and R Chellappa Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(1):9296, January 1997 [87] F.Y Shih and S.S Chen Adaptive Document Block Segmentation and Classification IEEE Transactions on Systems, Man, and Cybernetics, 26(5):797802, 1996 [88] R Sivararnaakrishnan, LT Phillips, J Ha, S Subrarnanium, and RM Haralick Zone Classification in a Document using the Method of Feature Vector Generation In Proc of the 3th International Conference on Document Analysis and Recognition, pages 541544, Montreal, Canada, 1995 [89] D.X Le, G.R Thoma, and H Wechsler Classification of Binary Document Images into Textual or Nontextual Data Blocks using Neural Network Models Machine Vision and Applications, 8:289304, 1995 [90] O Deforges and D Barba A Fast Multiresolution Text-line and Non Textline Structure Extraction and Discrimination Scheme for Document Image Analysis In Proc of the International Conference on Image Processing, pages 134138, Austin, Texas, 1994 [91] D Ittner Automatic Inference of Textline Orientation In Proc of the Second Annual Symposium on Document Analysis and Information 24 Retrieval, Las Vegas, Nevada, 1993 [92] J Fisher, S Hinds, and D D'Amato A Rule-Based System For Document Image Segmentation In Proc of the 10th International Conference on Pattern Recognition, pages 567572, Atlantic City, NJ, June, 17-21 1990 [93] N Amamoto, S Torigoe, and Y Hirogaki Block segmentation and Text Area Extraction of Vertically/Horizontally Written Document In Proc of the 2nd International Conference on Dowment Analysis and Recognition, pages 341344, Tsukuba, October 1993 IEEE Computer Society [94] G Nagy, S.C Seth, and S.D Stoddard Document analysis with an expert system In Proceedings Pattern Recognition in Practice II, Amsterdam, The Neaderlands, June, 19-21 1985 [95] K.S Fu Applications of Pattern Recognition CRC Press, Boca Raton, FL, 1982 [96] LT Phillips, S Chen, and R.M Haralick English Document Database Standard In Proc of the 2nd International Conference on Dowment Analysis and Recognition, pages 478483, Tsukuba, Japan, October 1993 IEEE Computer Society [97] G Nagy At the Frontiers of OCR Proceedings of the IEEE, 80(7):10931100, 1992 [98] J Kanai, S.V Rice, and T.A Nartker A Preliminary Evaluation of Automatic Zoning Technical Report TR-93-02, Information Science Research Institute, University of Nevada, Las Vegas, April 1993 [99] J Kanai, T.A Nartker, S.V Rice, and G Nagy Performance Metrics for Document Understanding Systems In Proc of the 2nd International Conference on Dowment Analysis and Recognition, pages 424427, 'Tsnkuba, Japan, October 1993 IEEE Computer Society [100] J Kanai, S.V Rice, T Nartker, and G Nagy Automated Evaluation of OCR Zoning IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(1):8690, January 1995 25 [101] S Randriamasy, L Vincent, and B Wittner An Automatic Benchmarking Scheme for Page Segmentation In L Vincent and T Pavlidis, editors, SPIE/SPSE Dowment Recognition, volume 2181, San Jose', CA, February 1994 SPIE [102] B.A Yanikoglu and L Vincent Ground-truthing and Benchmarking Document Page Segmentation In Proc of the 3th International Conference on Dowment Analysis and Recognition, pages 601604, Montreal, Canada, August 1995 [103] B.A Yanikoglu and L Vincent Pink Panther: A Complete Environment for Ground-truthing and Benchmarking Document Page Segmentation Technical report, Xerox Desktop Document Systems, February 1996 [104] J Ha, LT Phillips, and R.M Haralick Document Page Decomposition Using Bounding Boxes of Connected Components of Black Pixels In ISf?1T/SPIE Symposium on Electronic Imaging Science and Technology, Dowment Recognition II, pages 140151, San Jose', USA, 1995 [105] J Liang, R Rogers, RM Haralick, and LT Phillips UW-ISL Document Image Analysis Toolbox: An Experimental Environment In Proc of the 4th International Conference on Dowment Analysis and Recognition, pages 984988, Ulm, Germany, August 1997 [106] E Ukkonen Algorithms for approximate string matching Information and Control, 64:100118, 1985 [107] S Latifi How can permutations be used in the evaluation of zoning algorithms? International Journal of Pattern Recognition and Artificial Intelligence, 10(3):223237, 1996 [108] Adobe Systems Inc Post.Script Language Reference Manual, 2nd edition, December 1990 [109] J.M Smith and RS Stutely SGML: The Users' Guide to ISO 8879 Chichester/New York: Ellis Horwood/Halsted, 1988 [110] Horak Office document architecture and office document interchange 26 formats: current status of international standardization IEEE Computer, 18(10):5060, October 1985 [111] P Mancino Can the Open Document Architecture (ODA) Standard Change the World of Information Technology? A study of the Documentation Standard Open Document Architecture (ODA, ISO 8613) for Information Technology Master's thesis, Politecnico di Torino, Rijen, The Netherlands/Stockholm, Sweden: Ericsson Telecom, September 1994 [112] Adobe Systems Inc Portable Dowment Format Reference Manual, November 1996 [113] A Dengel, R Bleisinger, R Hoch, F Fein, and F Hoenes From Paper to Office Document Standard Representation Computer, 25(7):6367, July 1992 [114] RAF Technology Inc DAFS: Dowment Attribute Format Specification, 1995 [115] J Liang, R Rogers, B Chanda, LT Phillips, and R.M Haralick From Image to Desktop Publishing: a Complete Document Understanding System Working paper, 1995 [116] J Schiirmann, N Bartneck, T Bayer, J Franke, E Mandler, and M Oberlander Document Analysis -From Pixels to Contents Proceedings of the IEEE, 80(7):11011119, 1992 [117] J Higashino, H Fujisawa, Y Nakano, and M Ejiri A knowledgebased segmentation method for document understanding In Proc of the 8th International Conference on Pattern Recognition, pages 745 748, Paris, France, 1986 [118] D Derrien-Peden Frame-based System for Macro-typographical Structure Analysis in Scientific Papers In Proc of the lst International Conference on Docurneni Analysis and Recognition, pages 311319, France, September 1991 [119] S.N Srihari, C Wang, P Palumbo, and J Hull Recognizing Address 27 Blocks on Mail Pieces: Specialized Tools and Problem-Solving Architecture AI Magazine, 8(4):2540, Winter 1987 [120] C Wang and S.N Srihari A Framework for Object Recognition in a Visual Complex Environment and its Application to Locating Address Blocks on Mail Pieces International Journal of Computer Vision, 2:125151, 1988 [121] P.S Yeh, S Antoy, A Litcher, and A Rosenfeld Address Location on Envelopes Pattern Recognition, 20(2):213227, 1987 [122] G.E Kopec and P.A Chou Document Image Decoding Using Markov Source Models IEEE Transactions on Pattern Analysis and lldachine Intelligence, 16(6):602617, June 1994 [123] A.C Kam and G.E Kopec Document Image Decoding by Heuristic Search IEEE Transactions on Pattern Analysis and Machine Intelligence, 18(9):945950, September 1996 [124] F Esposito, D Malerba, and G Semeraro Multistrategy Learning for Document Recognition Applied Artificial Intelligence, 8(1):3384, 1994 [125] F Bapst, R Brugger, and R Ingold Towards an Interactive Document Structure Recognition System Internal working paper, University of Fribourg (Switzerland), April 1995 [126] Yuan Y.Tang: Document analysis and recognition by computers, Handbook of Pattern recognition and Computer Vision, pp 000-000 @ Word Scientific Publishing Company ... 2.2.2.5 Các k thu t phơn vùng ký t Error! Bookmark not defined Các k thu t phơn vùng trang v n b n Error! Bookmark not Các k thu t k t h p phơn vùng/phơn l p Error! Bookmark not Các k thu... nghiên c u nƠo c a cá nhơn đ u đ c ch rõ rƠng lu n v n Các thông tin t ng h p hay k t qu l y t nhi u ngu n tƠi li u khác đ c trích d n m t cách đ y đ vƠ h p lý T t c tƠi li u tham kh o đ u có xu... khung n ch vƠ tìm cách x lý t đ ng ký t khung Chúng đƣ ti n hƠnh th c nghi m nhi u m u bi u m u đ ng vƠ thu đ c nh ng k t qu r t kh quan 9 T khóa – X lý nh, Phân tích trang tài li u, Nh n d ng,