0
  1. Trang chủ >
  2. Luận Văn - Báo Cáo >
  3. Báo cáo khoa học >

Báo cáo khoa học: "Automatic training of lemmatization rules that handle morphological changes in pre-, in- and suffixes alike" docx

Báo cáo khoa học:

Báo cáo khoa học: "Automatic training of lemmatization rules that handle morphological changes in pre-, in- and suffixes alike" docx

... the algorithms and by not subdividing the training words in word classes. 4 Generation of rules and look-up data structure 4.1 Building a rule set from training pairs The training algorithm ... one of the remaining candidates instead. The training pairs that are matched by the pat-tern of the winning rule become the supporters and non-supporters of that new rule and are no longer ... schemes and thus a finite number of lemmatization rules should suffice to lemmatize indefinitely many words. In agglutinated languages, on the other hand, there are classes of words that in principle...
  • 9
  • 372
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Detection of Grammar Elements that Decrease Readability" pdf

... Automatic Detection of Grammar Elements that Decrease ReadabilityMasatoshi Tsuchiya and Satoshi SatoDepartment of Intelligence Science and Technology,Graduate School of Informatics, Kyoto Universitytsuchiya@pine.kuee.kyoto-u.ac.jp, ... unreadable.The goal of our study is to present tools that helprewriting work of improving readability in Japanese.The first tool is to help detect the sentence frag-ments (words and phrases) that should ... patterns that are defined in this first way. In the second way,a grammar element is described as a pair of its pat-terns and its examples. The following pair is an ex-ample of the grammar element that...
  • 4
  • 398
  • 0
Báo cáo khoa học: Dual effect of echinomycin on hypoxia-inducible factor-1 activity under normoxic and hypoxic conditions docx

Báo cáo khoa học: Dual effect of echinomycin on hypoxia-inducible factor-1 activity under normoxic and hypoxic conditions docx

... Effect of echinomycin on HIF-1a protein level. HepG2 and HeLa cells were incubated for 5 or 16 h under hypoxia or normoxia in thepresence or absence of increasing concentrations of echinomycin. ... effect is caused by an increase in HIF-1a pro-tein level, resulting from an increase in the transcription of the HIF-1Agene in the presence of a low concentration of echinomycin. Transfectionexperiments ... Hypoxiamarkedly increased HIF-1 DNA binding activity. Theincubation of cells with echinomycin during normoxiaor hypoxia had a minimal effect on the HIF-1 DNAbinding activity detected in the nuclear...
  • 10
  • 341
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Detection of Syllable Boundaries Combining the Advantages of Treebank and Bracketed Corpora Training" docx

... samples of wordsserve as input to the training procedure. In a treebank training step we observe for eachrule in the training grammar how often it is usedfor the training corpus. The grammar rules ... want to examine the in- fluence of the size of the training corpus on theresults of the evaluation. Therefore, we split the training corpus into 9 corpora, where the size of the corpora increases ... combining the advan-tages of treebank and bracketed corpora training. We investigate the effect of the training corpus size on the perfor-mance of our system. The evaluationshows that a hand-written...
  • 8
  • 455
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

... adaptation of the rule set to newdomains and corpora.1 MotivationInformation Extraction (IE) systems often facethe problem of distinguishing between affirmed,negated, and speculative information in ... of phrases splitinto subsets (preceding vs. following their scope) toidentify cues using string matching. The cue scopesextend from the cue to the beginning or end of thesentence, depending ... sentence in the training dataset which contained a negation or speculationcue using the Stanford parser (Klein and Manning,2003; De Marneffe et al., 2006). Figure 1 shows theparse tree of a sample...
  • 5
  • 543
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

... also that the dashed lines con-necting placeholders of two texts (hypotheses) in- dicate structurally equivalent nodes. For instance,the dashed line between3 and blinks the mainverbs both in ... the point of view of bag -of- word methods, the pairs (T1, H1) and (T1, H2) have both the same intra-pair simi-larity since the sentences of T1 and H1as well asthose of T1 and H2differ ... improvement of 4.4% over thestate -of- the-art methods.1 IntroductionRecently, textual entailment recognition has beenreceiving a lot of attention. The main reason is that the understanding of the...
  • 8
  • 413
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

... sent ences in table.We can predict that there are opinion sentences in this table, because the left column acts as aheader and there are indicators (plus and minus) in that column.3.3 Linguistic ... Learning the polarity of wordsThere are some works that discuss learning the po-larity of words instead of sentences.Hatzivassiloglou and McKeown proposed amethod of learning the polarity of ... revealed that the accuracy is quite poorwhen the training and test sets are in different do-mains. On the other hand, when Naive Bayes istrained on our corpus, there are little variance in different...
  • 8
  • 409
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

... described in that paper. The motivation for including the list of opin-ion-bearing words as one of our features is that pro and con sentences are quite likely to contain opinion-bearing expressions ... sentence in those reviews collected from each domain with the features described in Section 3.1. We divided the data for training and testing. We then trained our model using the training set and ... researchers in Computational Lin-guistics define an opinion for their studies. It is difficult to define what an opinion means in a computational model because of the difficulty of determining the...
  • 8
  • 461
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

... noting that human translations are generally good and machinetranslations poor, that binary training data can becreated by taking the human translations as posi-tive training instances and ... for doing this, as we wereinterested in the level of agreement of intuitive un-derstanding of fluency. We instructed them also that they should evaluate the sentence without consider-ing its ... Connexor, the Link Grammar parser returns in- formation about word relationships, forming links,with the proviso that links cannot cross and that in a grammatical sentence all links are indirectly...
  • 8
  • 507
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

... Evaluation of Machine Translation Quality Using Longest Com-mon Subsequence and Skip-Bigram Statistics Chin-Yew Lin and Franz Josef Och Information Sciences Institute University of Southern ... using bag -of- words instead. Instead of error measures, we can also use accuracy measures that compute similarity between candidate and ref-erence translations in proportion to the number of ... subsequence (LCS) of X and Y is a common subsequence with maximum length. We can find the LCS of two sequences of length m and n using standard dynamic program-ming technique in O(mn) time....
  • 8
  • 442
  • 0

Xem thêm

Từ khóa: báo cáo khoa họcbáo cáo khoa học mẫubáo cáo khoa học y họcbáo cáo khoa học sinh họcbáo cáo khoa học nông nghiệpbáo cáo khoa học lâm nghiệpchuyên đề điện xoay chiều theo dạngNghiên cứu tổ chức pha chế, đánh giá chất lượng thuốc tiêm truyền trong điều kiện dã ngoạiGiáo án Sinh học 11 bài 13: Thực hành phát hiện diệp lục và carôtenôitGiáo án Sinh học 11 bài 13: Thực hành phát hiện diệp lục và carôtenôitGiáo án Sinh học 11 bài 13: Thực hành phát hiện diệp lục và carôtenôitĐỒ ÁN NGHIÊN CỨU CÔNG NGHỆ KẾT NỐI VÔ TUYẾN CỰ LY XA, CÔNG SUẤT THẤP LPWANPhối hợp giữa phòng văn hóa và thông tin với phòng giáo dục và đào tạo trong việc tuyên truyền, giáo dục, vận động xây dựng nông thôn mới huyện thanh thủy, tỉnh phú thọPhát triển du lịch bền vững trên cơ sở bảo vệ môi trường tự nhiên vịnh hạ longThơ nôm tứ tuyệt trào phúng hồ xuân hươngThiết kế và chế tạo mô hình biến tần (inverter) cho máy điều hòa không khíSở hữu ruộng đất và kinh tế nông nghiệp châu ôn (lạng sơn) nửa đầu thế kỷ XIXChuong 2 nhận dạng rui roBT Tieng anh 6 UNIT 2Tăng trưởng tín dụng hộ sản xuất nông nghiệp tại Ngân hàng Nông nghiệp và Phát triển nông thôn Việt Nam chi nhánh tỉnh Bắc Giang (Luận văn thạc sĩ)Giáo án Sinh học 11 bài 15: Tiêu hóa ở động vậtchuong 1 tong quan quan tri rui roGiáo án Sinh học 11 bài 14: Thực hành phát hiện hô hấp ở thực vậtBÀI HOÀN CHỈNH TỔNG QUAN VỀ MẠNG XÃ HỘIChiến lược marketing tại ngân hàng Agribank chi nhánh Sài Gòn từ 2013-2015MÔN TRUYỀN THÔNG MARKETING TÍCH HỢP