Tìm hiểu các kỹ thuật phân loại văn bản tiếng việt

99 22 0
Tìm hiểu các kỹ thuật phân loại văn bản tiếng việt

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

Thông tin tài liệu

TRlTdNG DAI HOC AN GIANG KHOA KY THUAT - CONG NGHE - MOI TRlTOfNG DlTONG THANH TRlTC - DTH082062 KHOA LUAN TOT NGHIfiP DAI HOC NGANH CU* NHAN TIN HOC TIM HIEU CAC KY THUAT PHAN LOAI VAN BAN TIENG VIET Giang vien huong dan TS Nguyen Van Hoa TRUdNG DAI HQC AN GIANG thUvien An Giang, 05/2012 LCfl CAM ON- Truac tien, em muon giii lai cam on sau sac nhat den thay giao, Tien si Nguyen Van Hoa nguoi da tan tinh huang dan em suot qua trinh thuc hien khoa luan tot nghiep Em xin bay to lai cam an sau sac nhat den thay Ths Ho Nha Phong, co giao Ths Nguyen Thi My Truyen cimg nhung thay co giao da tan tinh giang day em bon nam qua, nhung kien thiic ma em nhan duac tren giang dirang dai hoc se la hanh trang giiip em vung buac tuang lai Cu6i cimg, em muon gui lai cam on chan den tat ca ban be, va dac biet la cha me va em gai, nhung nguai luon kip thai dong vien va giiip da em vuat qua nhung kho khan cupc song Sinh vien Duong Thanh True >'•' TOM TAT Phan loai van ban la nhan biet npi dung van b^n can phan loai ndi ve cac van de thupc chu de nao Day la van de rat dom gian doi vdi chung ta, nhung lai la mot viec rat khd khan ddi vdi may Viec hun luyen dS mot cai may hidu va phan loai dupe cac van ban (vdi ti le chinh xac cao) hien dang la mpt van de nan giai, dele biet doi vdi van ban tieng Viet Hien nay, da cd kha nhieu nghien era ve van de va dat dupe cac ket qua kha quan, nhien gap phai nhieu-vudng mac can duac giai quydt Trong khda luan toi se gidi thieu so lupc \h cac phuong phap phan loai van ban da duac sii dung va tap trung vao hai phuong phap phan loai la su dung b6 phan loai vector ho tra (SVM) va phuang phap phan loai theo hudng thong ke vi chung la cac phuang phap duac danh gia cao va duac su dung nhieu luih vuc (chi tiet v6 hai phuang phap se duac gidi thieu cac ph3n sau) Qua trinh phan lo^i van ban duac chia lam giai doan chinh: -Giai doan chuan bi du lieu: Thu thap tap dO lieu mau, tach tir, tinh cac so tu, loai bo cac tu tarn thudng khong cd y nghia phan loai, lira chpn cac dac trung -Giai doan huan luyen du lieu: Xay dimg cac mo hinh phan loai, theo phuang phap duac chpn ma se cd each xay drag bp phan loai khac -Giai doan phan lap danh gia: Thu nghiem cac mo hinh phan loai da xay dung tren nhrag van ban mdi, tinh toan dp chinh xac phan loai tir tim each cai tien cac mo hinh phan loai Toi da thtrc nghiem tren chu de: giao due, phap luat, sue khde, thl thao, vi tinh Vdi mdi chu dd toi thu thap 200 van ban mau lam tap du lieu hoc va kidm tra (tdng cpng 1000 van ban) Sau su dung phuang phap hold-out (lay ngau nhien A/3 t|p du lieu de hoc va 1/3 tap du lieu lai dung cho kiem tra, lap lai qua trinh Ian rdi lay gia tri trung binh) de danh gia hieu qua cua cac bp phan loai theo hai phuang phap SVM va phuang phap thong ke thu dupe ket qua: Phuong phap Dp chinh xac SVM 96.67% Thdng ke 96.26% MUCLUC CHUONGUTONGQUAN'.1 1.1 BatvindS1 , 1.2 Lich su giai quyet van de.1 1.3.Pham vi cua de tai2 1.4.Phuang phap nghien cuu/ hudng giai quyet van d2 CHUONG 2: CO SCS LY THUYET4 2.1.Gidi thieu bai toan phan Ioai van ban tiSng Viet4 2.2.Mo hinh phan Ioai van ban4 2.1.1.Giai doan chuan bi dii lieu4 2.1.2.Giai doan hu^n luyen5 2.1.3.Giai doan phan lop vadanhgia5 2.3.C^c cong viec chinh qua trinh phan Ioai6 2.3.1.Chuin hoa van ban6 2.3.2.Tachtir6 2.3.3.Bieu dien van ban8 2.3.4.Trich chpn dac tnmg9 2.4.Cdc phuang phap phan Ioai van ban11 2.4.1.Phuang phap k lang giSng gn nhlt (kNN)11 2.4.2.Phuang phap Naive bayes12 2.4.3.Phuang phap cay quyet dinh:13 2.4.4.Phuang phap may hoc vecta ho tra (SVM)14 CHl/ONG 3: N0I DUNG VA K3ET QUA NGHIEN CU"U18 3.1.Qua trinh xay dung bo phan Ioai18 3.1.1.Mo hinh cac buac thuc hienphan Ioai18 3.1.2.Xay dvrng tap du lieu18 3.1.3.TiSnxulyvanban18 3.1.4.Lua chon dac trung20 3.1.5.Mo hinh hoa khong gian vector20 3.1.6.Xay dung bp phan Ioai21 3.1.7.Thu nghiem va danh gia21 3.2.Xay dung he thong phan Ioai van ban21 3.2.1.YeucSu—.21 3.2.2.Phantich22 3.2.3.ThiStkS128 3.3 Kit qua tare nghiem' : .55 3.3.1.Banh gia cac giai thuat55 3.3.2.So sanh cac giai thuat57 KET LUAN VA HUCJNG PHAT TRIEN61 TAI LIEU THAM KHAO'.62 PHU LUC A: DAC TA USECASE63 PHU LUC B: DANH SACH TIT THUC5NG88 DANH SACH HINH VE Hinh 1: Gdn nhdn cho cac tdi lieu van ban4 Hinh 2: Mo hinh giai doqn chudn bi die lieu5 Hinh 3: Mo hinh giai doqn hudn luyen5 Hinh 4: Mo hinh giai doqnphdn lop6 Hinh 5: Biiu diin van ban„8 Hinh 6: Mat sieu phdng phdn tdch cac mdu duong khoi cac mdu dm14 Hinh 7: Mo hinh cac btcoc thuc hienphdn loqi van ban18 Hinh 8: Usecase tdng quan23 Hinh 9: So Usecacse chitc nangphdn loqi24 Hinh 10: So Usecase chitc ndng quan ly die lieu24 Hinh 11: So Usecase chiec ndng quan ly dqc trieng van ban25 Hinh 12: So Usecase chic ndng quan ly dqc trieng chu di25 Hinh 13: So Usecase chic ndng quan ly ti thieong26 Hinh 14: So Usecase chic ndng quan ly ti biiu diin26 Hinh 15: So Usecase chic ndng quan ly tap die lieu hoc27 Hinh 16: So Usecase chic ndng quan ly bo phdn loqi27 Hinh 17: So Usecase nhom chic ndng dang nhdp hi thong28 Hinh 18: Kiin true hi thing28 Hinh 19: So chic ndng he thing29 Hinh 20: So d6 giao dien he th6ng30 Hinh 21:Giao diin chinh chitong trinh30 Hinh 22: Giao diin phdn loqi van ban31 Hinh 23: So hoqt dqng chic nangphdn loqi van ban32 Hinh 24: Giao dienphdn loqi van ban32 Hinh 25:So hoqt dqng chic nangphdn loqi van ban33 Hinh 26: Giao dien thim chu di moi33 Hinh 27:So hoqt dqng chic ndng thim chu di34 Hinh 28: Giao diin quan ly chu di35 Hinh 29:Scr d6 hoat dong chuc nang quan ly chu de35 Hinh 30: Giao diin thim van ban moi36 Hinh 31: So hoqt dqng giao diin thim van ban37 Hinh 32: Giao diin quan ly van ban38 Hinh 33:So hoqt dqng giao diin quan ly van ban39 Hinh 34: Giao diin quan lyddc trieng van ban40 Hinh 35: So hoqt dqng giao diin quan lyddc trieng van ban41 Hinh 36:Giao diin quan ly dqc trieng chu di41 Hinh 37: So hoqt dqng giqo dien quan lyddc trieng chu di42 Hinh 38: Giao diin quan ly titthitdng43 Hinh 39: So hoqt dqng giao diin quan ly tie thieong44 Hinh 40:Giao diin quan ly tie biiu dien van ban44 Hinh 41: So hoqt dqng giao diin quan ly tit biiu diin45 Hinh 42: Giao diin quan ly tap die lieu hoc46 Hinh 43: So hoqt dqng giqo diin quan ly tap die lieu hoc47 Hinh 44: Giao diin xudt tap die lieu47 Hinh 45:Sodo hoqt dqng giao dien xudt tap die lieu hoc48 Hinh 46: Giao dien xdy dung bo phan loai tuddng49 Hinh 47: Sa dS hoat donggiao dien xdy dung bophdn loai tu dong.-50 Hinh 48: Giao dien qudn ly bophdn loai50 Hinh 49: Scr hoat donggiao dien xdy dung bqphdn loai tu dong51 Hinh 50: Sadd quan he (CSDL)51 '•*DANH SACH BANG BIEU Bang 1: Chudn hoa bo ddu19 Bang 2: Danh sack cdc Actor23 Bang 3: CdU hinh he thong=29 Bang 4: Sir dung du lieu giao dien phdn loaivan ban31 Bang 5: Sir dung du lieu giao dien phdn loai thu muc32 Bang 6: Sir dung du lieu giao dien them chu di mai33 Bang 7: Sir dung du lieu giao dien qudn ly chu de35 Bang 8: Sir dung du lieu giao dien them van ban mai36 Bang 9: Sir dung du lieu giao dien qudn ly van ban38 Bang 10: Sir dung du lieu giao dien dqc trung van ban40 Bang 11: Sir dung du lieu giao dien dqc trung chu de42 Bang 12: Sir dung du lieu giao dien qudn ly tir thuang43 Bang 13: Sir dung du lieu giao dien tir biiu dien van ban:.45 Bang 14: Sir dung du lieu giao dien qudn ly tap du lieu hoc46 Bang 15: Sir dung dulieu giao dien xudt tap du lieu hoc48 Bang 16: Sir dung du lieu giao dien tudqngxdy dung bo phdn loai49 Bang 17: Su dung du lieu giao dien qudn lybqphdn loai51 Bang 18: Cdu true bang chu de52 Bang 19: Cdu true bdngvdn ban52 Bang 20: Cdu true bang dqc trung chu de53 Bang 21: Cdu true bang dqc trung van ban53 Bang 22: Cdu true bang tir bieu dien54 Bang 23: Cau true bang tir thuang54 Bang 24: Cau trite bang bq phdn loai54 Bang 25: Cdu true bang tdi khodn55 Bang 26: Ma trdn confusion trinh bay kit qua phdn loai gidi thudt SVM Ian 155 Bang 27: Ma trdn confusion trinh bay kit qua phdn loai gidi thudt SVM Idn 256 Bang 28: Ma trdn confusion trinh bay kit ^ua phdn loai gidi thudt SVM Ian 356 Bang 29: Ma trdn confusion trinh bay kit qua phdn loai gidi thudt Thong ki Idn 56 Bang 30: Ma trdn confusion trinh bay kit qua phdn loai gidi thudt Thong ki Idn 51 Bang 31: Ma trdn confusion trinh bay kit qudphdn loai gidi thudt Thongki Idn 51 Bdng,32: Usecasephdn loaivan ban64 Bang 33: Usecasephdn loai thu muc65 Bang34: Usecase thim chu de.' 65 Bang 35: Usecase xoa chu di67 Bang 36: Usecase thim van ban68 Bang 37: Usecase xoa van ban69 Bang 38: Usecase tdch tir70 Bang 39: Usecase Chudn hoa van ban71 Bang 40: Usecase loai tir thuang van ban72 Bang 41: Usecase tim dqc trung van ban73 Bdng42: Usecasexoa dqc trungvdn ban73 Bang 43: Usecase tim dqc trung chu di74 Bang 44: Usecasexoa dqc trung chu di75 Bang 45: Usecase thim tir thuang76 Bang 46: Usecase Xda ticthiccmg:.•:77 Bang 47: Usecase Xoa tie biiu diin78 Bang 48: Usecase xem danhsdch tap die lieu hoc79 Bang 49: Usecase tao tap die lieu hoc80 Bang 50: Usecase Iteu tap die lieu hoc81 Bang 51: Usecase phuc hoi tap die lieu hoc82 Bang 52: Usecase xudt tap die lieu hoc83 Bang 53: Usecase xay dung bo phan loai tie dong84 Bang 54: Usecase tao bo phan loai85 Bang 55: Usecase ddnh gid bo phan loai86 Bang 56: Usecase thie nghiem bo phan loai87 Bang 57: Danh sdeh tic thitcmg89 DANH SACH CAC TIT VIET TAT Tit viet tSt Tfr day du SVM Support Vector Machine VB Van ban CD ChudS DTVB Dae tnmg van ban TDLH Tap da lieu hoc DTCD Dae tnmg chii dS BD Bieu diln BPL Bo phan loai CSDL Co so du lieu Tim hieu cac ky thuat phan lotii van ban tieng Viet Primary actor: Quan ly Usecase type: Chi tiet Stackehoders anh interestes: Quan ly: muon xoa mot tir khoi danh sach dac trung chu de Brief description:i Khi mot tir khong co y nghTa phan loai cho mot chu de nao thi no can bi loai bo khoi danh sach dac tnmg cua chii de Trigger: Khi co yeu cau xoa mot dac trung chu de Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalixation: Normal flow of event: 1.Chpn chuc nang (juan ly dac trung chu de 2.Chpn dac trung cfin xoa 3.NhSn niit xoa Sub 1: Xoa dac trung chu de 4.Nhan nut luu lai Sub 2: cap nhat CSDL 5.Thoat Sub 3: Kilm tra luu Subflows: Sub 1: Xoa dac trung chu de a.Neu khong xoa dupe thi thong bao loi va dimg lai b.Nguoc lai cap nhat lai giao dien Sub 2: Cap nhat CSDL a N6u khong cap nhat duac thi thong bao l6i Sub 3: Ki&n tra luu a.Neu chua luu lai thi yeu cau xac nhan b.Neu chpn luu lai thay doi thi thuc hien sub c.N^u chpn khong luu thi k^t thuc Alternate/Exceptional flows: Neu co loi qua trinh cap nhat CSDL thi kiem tra lai ket noi va thong bao Bang 44: Usecase xoa d^ c trung chu de 14 Loai tit thuarng chu de Chuc nang giong voi chuc nang loai tir thucmg van ban, nhung tir can xoa chfnh la cac dac trung chu dS SV: Duong Thanh True - DTH082062 Trang 75 Tim hieu cac ky thuat phan loai van ban tieng Viet 15 Them tie thteceng Usecase name: ThemTuThuong ID: 15 Primary actor: Quan ly Usecase type: Chi tiet Importance: Cao Stackehoders anh interestes: Quan ly: Muon them mot tit thucmg vao danh sach Brief description: Chuc nang cho phep them mot tit thuong vao danh sach tvr thucmg Tit thucmg moi them phai khong co danh sach Trigger: Khi co y^u cau them mot tit thucmg moi Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalixation: Normal flow of event: 1.Chon chiic nang them tit thucmg 2.Nhap tit thucmg cSn them 3.Nhdn nut thuc hien Sub 1: Them tit thucmg moi 4.Nhdn niit thoat Subflows: Sub 1: Them tit thuong m6i a.N6u chua co tit thuomg thi them vao CSDL b.N^u chua nhap tit can them thi yeu cdu nhap c.N^u tit cin them da co CSDL thi thong bao 16i va yeu ciu nhap tit khac Alternate/Exceptional flows: NSu co 16i xay cap nhat CSLD thi thong bao 16i va phuc h6i trang thai truoc Bang 45: Usecase them tie thie&ng 16 Xda til• thucmg Usecase name: XoaTuThuong • ID: 16 Primary actor: Quan ly Usecase type: Chi tiet Importance: Cao Stackehoders anh interestes: Quan ly: muon xoa mot tit thucmg khoi danh sach ttr thucmg Brief description: Chiic nang cho phep xoa mot tit thucmg khoi danh sach tit thucmg SV: Duong Thanh True - DTH082062Trang 76 Tim hieu cac ky thuat phan loai van ban tieng Viet Trigger: Khi co yeu cau xoa tir thuang Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generaiixation: Normal flow of event: 1.ChQn chuc nang quan ly tit thuang 2.Chpn tir thuang can xoa 3.Nhan nut x6a Sub 1: Kiem tra viec xoa 4.Nhan nut luu lai Sub 2: Cap nhat CSDL 5.Nhan hut thoat Sub 3: Ki&n tra luu Subflows: Sub 1: Kiem tra xoa a.NSu chua chpn ttr din xoa thi yeu ciu chpn tir b.N6u khong co tir thucmg nao danh sach thi thong ^ao 16i c.Neu thoa dieu kien thi xoa tir dupe chpn va cap nhat hien thi Sub 2: Cap nhat CSDL Sub 3: Kilm tra luu a.Neu chua luu lai thi yeu cau xac nhan b.N8u chpn luu lai thay doi thi thuc hien sub c.NSu chpn khong luu thi k^t thuc Alternate/Exceptional flows: Neu co loi xay cap nhat CSLD thi thong bao loi va phuc hoi trang thai trudc Bang 46: Usecase Xoa tir thudng 17 Xoa tir biiu diin Usecase name: XoaTuBieuDien ID: 17Importance: Cao Primary actor: Quan ly Usecase type: Chi tiet Stackehoders anh interestes: Quan ly: muon xoa mot tir khoi danh sach tir bieu dien van ban Brief description: Chuc nang cho phep xoa mot tir khoi danh sach tir bieu dien van ban Trigger: Khi co y^u cau xoa tir bieu dien SV: Duong Thanh True-DTH082062Trang 77 Tim hilu cac ky thuat phan loai van ban tilng Viet Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalixation: Normal flow of event: 1.Chon chiic nang quan ly tir bidu dien 2.Chpn tir can xoa 3.Nhan nut xoa Sub 1: Kiem tra viec xoa 4.Nhan nut luu lai Sub 2: Cap nhat CSDL 5.Nhan niit thoat Sub 3: Kiem tra luu Subflows: Sub 1: Kiem tra xoa d.Neu chua chon tir can xoa thi yeu cau chon tir e.Neu khong co tir nao danh sach thi thong bao loi f.Neu thoa dieu kien thi xoa tir dupe chon va cap nhat hien thi Sub 2: Cap nhat CSDL Sub3:Ki^mtraluu d.Neu chua luu lai thi yeu cau xac nhan e.Neu chon luu lai thay d6i thi thuc hien sub f.N8u chon khong luu thi ket thuc Alternate/Exceptional flows: N8u co 16i xay cap nhat CSLD thi thong bao 16i va phuc h6i trang thai truac Bang 47: Usecase Xoa tit bieu dien 18 Xem danh sdeh TDLH Usecase name: ID: 18 Importance: Trung binh XemDanhSachTapDuLieuHoc Primary actor: Quan ly Usecase type: Chi tilt Stackehoders anh interestes: Quan ly: mudn xem lai tap dO lieu hoc Brief description: Chuc nang cho phep nguai quan ly co thi xem lai tap du lieu hoc da dupe bieu dien he thing, tir co thi chinh sira, xoa cac phan tu du lieu nhieu SV: Duong Thanh True - DTH082062 Trang 78 Tim hieu cdc ky thuat phan lo^.i van ban tieng Viet Trigger: KM co nhu cau xem lai tap dfl lieu hoc Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generaiixation: Normal flow of event: 1.Chpn chuc nang quan ly tap du lieu hoc 2.Hien thi danh sach tap du lieu hpc Sub 1: Ki^m tra hiln thi tap du lieu hpc 3.thoat Subflows: Sub 1: Kiem tra hien thi tap du lieu a.Neu tap du lieu chua dupe xay dung thi thong bao b.Ngupc lai lay tat ca tap du lieu va hien thi len man hinh Alternate/Exceptional flows: Neu gap loi qua trinh truy xuat CSDL thi kiem tra lai ket noi va thong bao & Bang 48: Usecase xem danh sach tap du lieu hoc 19 T^o TDLH Usecase name: TaoTapDuLieuHoc ID: 19 Primary actor: Quan ly Usecase type: Chi tiet Importance: Cao Stackehoders anh interestes: Quan ly: mu6n mo hinh hoa lai cac van ban mlu he thdng Brief description: Chiic nang thuc hien viec mo hinh hoa lai toan bp van ban mau he thong va luu vao bang tap du lieu hpc CSDL Trigger: KM co nhu cfiu mo hinh hoa l^i toan bp van ban mau Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generaiixation: Normal flow of event: Chpn chuc nang quan ly tap du lieu hpc SV: Duong Thanh True - DTH082062Trang 79 Tim hieu cac ky thuat phan loai van ban tieng Viet 2.Nhan nut tao tap du lieu hoc Sub 1: Tao tap du lieu hoc 3.Nhn niit luu lai Sub 2: Cap nhat CSDL 4.Thoat Sub3:KiSmtraluu Subflows: Sub 1: Tao tap du lifu hoc a.Xda tam thfri tap du lieu hoc da co b.Ndu danh sach tu bidu din van ban thi thong bao 16i va dimg lai c.NSu khong co van ban nao he thOng thi thong bao va dimg lai d.Bi^u diin van ban theo danh sach tur bi^u diln Sub 2: Cap nhat CSDL a Neu khong cap nhat ctugc thi thong bao loi Sub 3: Kiem tra luu a.Neu ngudi dimg chua luu lai nhung thay doi thi yeu cau xac nhan b.Neu ngudi dimg chpn luu lai thay doi thi thuc hien sub c.Neu cac thay doi da dugc luu lai hoac ngudi dung khong muon luu lai cac thay doi thi thoat Alternate/Exceptional flows: Neu gap loi qud trinh truy xuat CSDL thi kiem tra lai ket noi va thong bao loi Bang 49: Usecase tao tap die lieu hoc 20 Sao lieu TDLH Usecase name: SaoLuuTDLH ID: 20 Primary actor: Quan ly Usecase type: Chi tiet Importance: Cao Stackehoders anh interestes: Quan ly: Mu6n luu lai tap du lieu mot tap tin d6c lap Brief description: Chiic nang thuc hien viec luu lai toan bp tap du lieu hoc da dupe bieu dien mot tap tin van ban doc lap Trigger: KM co nhu clu luu lai t^p du lieu hoc Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalixation: Normal flow of event: SV: Duong Thanh True-DTH082062Trang 80 Tim hieu cac ky thuat phan loai van ban tieng Viet Chpnchiicnang quan ly tap dft lieuhpc Nhan mit hiu tSp dii lieu hoc Subl Kilm tra tap dfi lieu hoc Sub Ghpn vj tri ltru Sub Thuc hien hru Thoat Subflows: Sub 1: Ki6m tra tap du lieu hoc a b Neu tap dii lieu rdng thi thong bao 16i Ngupc lai tiep tuc sub Sub 2: Chpn vi tri luu a b Nu chua chpn vi tri luru thi dung lai Ngupc lai thuc hien sub Sub 3: Thuc hien luu a b L^^y danh sach tap du lieu hoc Ghi tap du lieu hoc vao tap tin Alternate/Exceptional flows: Neu tao neu co co loi xay qua trinh ghi tap tin thi thong bao loi va xoa tap tin da Bang SO: Usecase luu tap du lieu hoc 21 Phuc hdi TDLH Usecase name: PhucHoiTDLH ID: 21 Primary actor: Quan ly Usecase type: Chi tiet Importance: Cao Stackehoders anh interestes: Quan 1^: Muon phuc hoi lai tap du lieu hoc da luu trade Brief description: Chiic nang cho phep phuc hoi lai tap du lieu hoc tir tap tin da luu trade Trigger: Khi co yeu cau phuc hoi lai tap du lieu hoc Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalixation: Normal flow of event: 1.Chpn chiic nang quan ly tap du lieu hoc 2.Nhan mit phuc hoi tap du lieu hoc S V: Duong Thanh True - DTH082062Trang 81 Tim hieu cac ky thuat phan loai van ban tieng Viet Sub 1: Xac nhan thuc hien Sub 2: Chon tap tin hru Sub 3: Thirc hien phgc hdi 3.Nhln nut hru lai Sub 4: Cap nhat CSDL 4.Thoat Sub 5: Kilm tra luu Subflows: Sub 1: Xac nhan thirc hien a.Neu ngirdi diing khong muon tiep tuc thuc hien thi dirng lai b.Nguac lai tigp tuc sub Sub 2: Chon tap tin luu a.Neu ngucri dung chua chon tap tin phan loai thi dirng lai b.Nguac lai ti^p tuc sub Sub 3: Thuc hien phuc h6i a.Xoa tam thai tap du lipu hoc hien tai b.Doc noi dung tap tin tap luu c.Dua phin du lieu doc duac vao CSDL va hien thi len man hinh Sub 4: Cap nhat CSDL a.Neu khong cap nhat duac CSDL thi thong bao loi b.Ngu^c lai tiep tuc Sub 5: Kilm tra luu a.Neu nguoi dung chua luu lai nhung thay doi thi yeu cau xac nhan b.Neu ngucri dung chon luu lai thay doi thi thuc hien sub c.Neu cac thay doi da duac luu lai hoac ngucri dung khong muon luu laicac thay d6i thi thoat Alternate/Exceptional flows: Neu gap loi doc tap tin luu thi thong bao loi va phuc hoi tap du lieu hoc trudcdo Neu gap loi qua trinh tray xuat CSDL thi kiem tra lai ket noi va thong bao Bang 51: Usecase phuc hoi tap die lieu hoc 22 XuSt TDLH Usecase name: XuatTDLH ID: 22 Primary actor: Quari ly Usecase type: Chi tiet Importance: Cao Stackehoders anh interestes: Quan ly: Mudn xult cac tap du lieu hoc "con" cho timg bp phan loai Brief description: Chuc nang se xuat cac tap du lieu dung lam tap du lieu hoc cho timg bp phan loai dupe xay dung he thong SV: Duong Thanh True - DTH082062Trang 82 Tim hieu cac ky thuat phan loai van ban tieng Viet Trigger: Khi co yeu cau xuat tap du lieu hoc va xay dung bo phan loai' •' f- Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalixation: Normal flow of event: Chon chuc nang xuat tap du lieu hoc Sub 1: Tim danh sach tap du lieu hoc 2^ Nhan nut xuat tap du: lieu Sub 2: Kiem tra thuc hien Sub 3: Xuat cac tap du lieu Thoat Subflows: Sub 1: Tim danh sach tap du li^u hoc a.Tim danh sach cac t|p dft lieu can xuat cho cac bo phan loai b.Hien thi danh sach tim dupe Sub 2: Kiem tra thuc hien a.Neu chua xay dung tap du lieu hoc chung thi thong bao loi va dung lai b.Nguac lai tiep tuc sub Sub 3: Xult cac tap du lieu a.Tao thu muc chua cac tap du li^u neu can b.Trich chpn cac tap du lieu can xuat c.Ghi vao cac tap tin caii thiet Alternate/Exceptional flows: Neu gap loi qua trinh truy xudt cac tap tin van ban thi thdng bao 16i va xoa cac tap tin da xuat neu co Bang 52: Usecase xuat tap die lieu hoc 23 Xay dung BPL tu dpng Usecase name: XayDungBPLTuDong ID: 23Importance: Cao Primary actor: Quan ly Usecase type: Chi tiet Stackehoders anh interestes: Quan ly: mudn xay dung lai cac bp phan loai mot each tu dpng Brief description: Chuc nang thuc hien lai toan bp cac qua trinh trich chpn dac trung, xay dung tap du lieu hoc va xay dung cac bp phan loai mot each tu dpng Trigger: Khi co yeu cau hoc tu dpng SV: Duomg Thanh True - DTH082062Trang 83 Tim hieu cac ky thuat phan loai van ban tieng Viet Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalixation: Normal flow of event: 1.Chon chiic nang xay dung bo phan loai tir dong 2.Cai dat cac thong s6 3.Nhan nut thuc Men Sub 1: Kidm tra thvrc Men Sub 2: Sao lira Sub 3: T6ng hop dac trung Sub 4: Chon tir bilu diln Sub 5: Xay dung tap du lieu hoc chung Sub 6: Xay cac bp phan loai 4.thoat Subflows: Sub 1: Kidm tra thuc hien a.Neu chua co van ban hoac chu de nao he thong thi thong bao va dimg lai b.Nguac lai tiep tuc sub Sub 2: Sao lira a.Yeu cau nguoi dimg xac nhan lira dfl lieu hien tai b.Thuc Men luu neu can thet, tiep tuc sub Sub 3: Tong hop dac trung a.Lay danh sach dac trung van ban va dua va danh sach dac trung chu de b.Xoa cac tir thubng va cac tir co gia tri nam ngoai nguong quy dinh Sub 4: Chon tir bieu dien a.Tinh cac gia tri tir danh sach tir bilu diln b.Xoa cac tir thucmg va cac tir co gia tri n^m ngoai nguong quy dinh Sub 5: Xay dung tap du lieu hoc chung a.Lay danh sach van ban he thdng b.Mo Mnh hoa cac van ban nhung vector theo cac tir bieu dien da chon Sub 6: Xay cac bp phan loai a.Tim danh sach cac bp phan loai can xay dung he thong b.Xuat cac tap du lieu hoc cho tirng bp phan loai c.Xay dung cac mo Mnh tuong ting cho cac tap du liSu hoc da xuat Alternate/Exceptional flows: Neu co loi qua trinh thuc Men thi dimg lai va cho phep phuc h6i lai trang thai ban dau Bang 53: Usecase xay dung bp phan loai tie dong 24 Too BPL SV: Duong Thanh True - DTH082062Trang 84 Tim hieu cac ky thuat phan loai van ban tieng Viet Usecase name: TaoBoPhanLoai ID: 24, Importance: Cao Primary actor: Quan ly Usecase type: Chi tiet Stackehoders anh interestes: Quan ly: Muon tao mot bo phan loai he thong Brief description: Chuc nang se tao cac mo hinh phan loai cho cac tap du lieu hoc da xay drag truoc Trigger: Khi co yeu cau xay drag bp phan loai cho mot tap du lieu hoc Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generaiixation: Normal flow of event: 1.Chpn chuc nang quan ly bp phan loai 2.Chpn tap du1 lieu can xiy drag bp phan loai 3.Nhan nut tao bp phan loai Sub 1: Kiem tra thuc hien Sub 2: tao bo phan loai 4.Thoat Subflows: Sub 1: Kiem tra thuc hien a.Neu khong co tap du lieu nao danh sach thi dimg lai b.Neu chua chpn tap du lieu thi thong bao loi c.Neu tap tin du lieu khong ton tai thi thong bao loi Sub 2: Tao bp phan loai a.Tao thu muc chua tap tin mo hinh ngu cin b.Gpi phucmg thuc tao mo hinh phan loai c.CapnhatCSDL Alternate/Exceptional flows: Neu co loi xay qua trinh truy cap CSDL thi kiem tra lai ket noi va thong bao loi Neu khong doc dupe tap tin du lieu hoc thi thong bao loi Bang 54: Usecase tao bo phan loai 25 Ddnh gid BPL Usecase name: DanhGiaBPL ID: 25 Importance: Cao Primary actor: Quan ly Usecase type: Chi tiet SV: Duong Thanh True-DTH082062Trang85 Tim hieu cac ky thuat phan loai van ban tieng Viet Stackehoders anh interestes: Quan ly: muon danh gia dp chinh xac ciia cac bp phan loai Brief description: Chuc nang cho phep danh gia cac bp phan loai da xay dung bang each phan loai cac van ban mau (cac van ban tham gia vao qua trinh hoc) Trigger: Khi muon danh gia dp chinh xac cac bp phan loai da xay dung Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalisation: Normal flow of event: 1.Chpn chuc nang danh gia bp phan loai 2.Chpn cac van ban muon phan loai 3.Nhan nut danh gia Sub 1: Kilm tra thuc hien Sub 2: Phan loai cac van ban dupe chpn va tinh ti 16 chinh xac 4.Thoat Subflows: Sub 1: Kiem tra thuc hien a.N^u chua xay dung cac bp phan loai thl thong bao 16i b.NSu chua chpn van ban cSn danh gia thi yeu ciu chpn Sub 2: Phan loai cac van ban dupe chpn va tinh ti 16 chinh xac a.Lay danh sach cac van ban dupe chpn b.Phan loai timg van ban va tinh ti 16 chinh xac Alternate/Exceptional flows: NSu co loi qua trinh truy cap CSDL thi kilm tra lai k6t n6i va thong bao • Bang SS: Usecase danh gia bo phan loai 26 Thir nghiem BPL Usecase name: Thu nghiem ID: 26Importance: Cao Primary actor: Quan ly Usecase type: Chi tiet Stackehoders anh interestes: Quan ly: Muon thu nghiem cac bp phan loai Brief description: Chuc nang cho phep ngubi quan ly co the thu nghiem cac bp phan loai da xay dung tren cac van ban mod (khong tham gia vao qua trinh hoc) Trigger: Khi muon thu: nghiem cac bp phan loai SV:Dumig Thanh True -DTH082062Trang 86 Tim hieu cac ky thuat phan loai van ban tieng Viet Type: external Relationships: + Asociation: Quan ly + Include: + Extend: + Generalixation: Normal flow of event: 1.Chon chiic nang thii nghiem bo phan loai 2.Chon van ban can thii nghiem 3.Chon chu de cho cac van ban 4.Nhan nut thuc Men Sub 1: Kiem tra thuc Men Sub 2: Tien hanh phan loai va tinh dp chinh xac 5.Thoat Subflows: Sub 1: Kiem tra thuc Men c.Neu chua xay dung cac bp phan loai thi thong bao loi d.NSu chua chon van ban cin thii nghiem thi yeu cau chon Sub 2: Phan loai cac van bdn dupe chon va tinh ti le chinh xac c.Liy danh sach cac van ban dupe chon d.Phan loai timg van ban va tinh ti le chinh xac Alternate/Exceptional flows: N^u co loi qua trinh truy cap CSDL thi ki&n tra lai kSt noi va thong bao Bang 56: Usecase thii nghiem ho phan loai SV: Duong Thanh True - DTH082062Trang 87 H b K u PQ U b b x -I •>s •a l-a •a -I a •

Ngày đăng: 01/03/2021, 09:23

Tài liệu cùng người dùng

  • Đang cập nhật ...

Tài liệu liên quan