Báo cáo khoa học: "Probabilistic Text Structuring: Experiments with Sentence Ordering" docx

Báo cáo khoa học: "Probabilistic Text Structuring: Experiments with Sentence Ordering" docx

Báo cáo khoa học: "Probabilistic Text Structuring: Experiments with Sentence Ordering" docx

... Twelve texts were ran- domly selected from the 20 texts in our test data. The texts were presented to subjects with the order of their sentences scrambled. Participants were asked to reorder the sentences ... determine the order for a new text that we haven’t encountered before, since some of the features representing its sentences will be familiar. Given a text with N sentences there...

Ngày tải lên: 23/03/2014, 19:20

8 271 0
Tài liệu Báo cáo khoa học: "Hierarchical Text Classification with Latent Concepts" doc

Tài liệu Báo cáo khoa học: "Hierarchical Text Classification with Latent Concepts" doc

... subclasses share information with these d- ifferent concepts respectively. Then, we pro- pose a variant Passive-Aggressive (PA) algo- rithm for hierarchical text classification with latent concepts. ... analysis. Section 5 concludes the paper. 2 Hierarchical Text Classification In text classification, the documents are often rep- resented with vector space model (VSM) (Salton et al.,...

Ngày tải lên: 20/02/2014, 05:20

5 392 0
Báo cáo khoa học: "Boosting-based parse reranking with subtree features" docx

Báo cáo khoa học: "Boosting-based parse reranking with subtree features" docx

... x i is an in- put sentence, and y i is a correct parse associated with the sentence x i . • Let Y(x) be a function that returns a set of candi- 189 date parse trees for a particular sentence x. • ... the branch-and-bound algorithm find new features that are not in the cache. 5 Experiments 5.1 Parsing Wall Street Journal Text In our experiments, we used the same data set that used i...

Ngày tải lên: 08/03/2014, 04:22

8 317 0
Báo cáo khoa học: "A Bottom-up Approach to Sentence Ordering for Multi-document Summarization" ppt

Báo cáo khoa học: "A Bottom-up Approach to Sentence Ordering for Multi-document Summarization" ppt

... contin- uous k sentences. A text with sentences arranged in proper order does not interrupt a human’s reading while moving from one sentence to the next. Hence, the qual- ity of a sentence ordering ... consider a bottom-up approach in arrang- ing sentences. Starting with a set of segments ini- tialized with a sentence for each, we concatenate two segments, with the strongest a...

Ngày tải lên: 31/03/2014, 01:20

8 239 0
Báo cáo khoa học: "Inducing Combinatory Categorial Grammars with Genetic Algorithms" docx

Báo cáo khoa học: "Inducing Combinatory Categorial Grammars with Genetic Algorithms" docx

... turn, combines with the np “Ronaldinho” to form a sentence. The example illustrates the rule of Application, denoted with < and > in derivations. The schemata for this rule, along with the Composition ... are accessible to the language learner; the current pro- posal is preoccupied with grammar induction from unannotated text, and assumes (sentence- level) log- ical forms to...

Ngày tải lên: 31/03/2014, 01:20

6 365 0
Báo cáo khoa học: "Probabilistic Document Modeling for Syntax Removal in Text Summarization" ppt

Báo cáo khoa học: "Probabilistic Document Modeling for Syntax Removal in Text Summarization" ppt

... Python NLTK li- brary. 6 For SumBasic without stop-word removal (SB-), we obtain 3.8 R-2 and 6.2 R-SU4 (with the -s flag). 7 With stop-words removed from the sentence scoring calculation (SumBasic), ... Allocation and models long-range the- matic word dependencies with a set of topics, while short-range (sentence- wide) word dependencies are modeled with syntax classes using a Hidden...

Ngày tải lên: 07/03/2014, 22:20

6 449 0
Báo cáo khoa học: "acquiring and structuring semantic information from text" pdf

Báo cáo khoa học: "acquiring and structuring semantic information from text" pdf

... today, together with comparisons to related work. We conclude with a discussion on extending the MindNet methodology to the processing of other corpora (specifically, to the text of the Microsoft ... exploited, including domain information associated with particular senses (e.g., Baseball). In processing normal input text outside of the context of MindNet creation, WSD reli...

Ngày tải lên: 17/03/2014, 07:20

5 264 0
Tài liệu Báo cáo khoa học: "Identifying Text Polarity Using Random Walks" pptx

Tài liệu Báo cáo khoa học: "Identifying Text Polarity Using Random Walks" pptx

... the task of identifying text that present opinions as op- posed to objective text that present factual in- formation (Wiebe, 2000). Text could be either words, phrases, sentences, or any other ... phrases are identified without consider- ing their context (Wiebe, 2000; Hatzivassiloglou and Wiebe, 2000; Banea et al., 2008). In the sec- ond category, the context of subjective text is use...

Ngày tải lên: 20/02/2014, 04:20

9 450 0
Tài liệu Báo cáo khoa học: "A Text Input Front-end Processor as an Information Access Platform" doc

Tài liệu Báo cáo khoa học: "A Text Input Front-end Processor as an Information Access Platform" doc

... example sentence resources. Like a Kana-Kanji conversion front-end processor used to input Japanese language text, this tool is also implemented as a front-end processor and can be combined with ... from text as it is being input into the tool, and these words are used to locate information relevant to the input text. This information is then automatically displayed to the use...

Ngày tải lên: 20/02/2014, 18:20

5 385 0
Tài liệu Báo cáo khoa học: "Untangling Text Data Mining" ppt

Tài liệu Báo cáo khoa học: "Untangling Text Data Mining" ppt

... parts of the text manipulation process and to integrate un- derlying computationally-driven text analysis with human-guided decision making within ex- ploratory data analysis over text. References ... since this applica- tion uses metadata associated with text docu- ments, rather than the text directly, it is un- clear if it should be considered text data min- ing or st...

Ngày tải lên: 20/02/2014, 18:20

8 336 0
Từ khóa:
w