bootcat bootstrapping corpora and terms from the web

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

... indidates that the term passed the test. Twenty terms out of the thrity candidate terms passed the first techinical-term test (Tech.) and six- teen terms out of the twenty terms passed the second relation ... target terms that should be collected from each seed word, and then checked whether each of the target terms was included in the system output. We counted the number of tar- get terms in the following ... collected 610 terms in total; the average number of output terms per input is 12.2 terms. We checked whether each of the 610 terms is a correct related term of the original seed term by hand. The result...

Ngày tải lên: 20/02/2014, 16:20

4 437 0
Tài liệu Báo cáo khoa học: "Extraction and Approximation of Numerical Attributes from the Web" pdf

Tài liệu Báo cáo khoa học: "Extraction and Approximation of Numerical Attributes from the Web" pdf

... evaluation, since the nature of the data is different from that of the QA dataset. Most of the questions asked over the Web target named entities like specific car brands, places and actors. There is usually ... strong PMI with the ob- ject (this can be estimated using any fixed corpus). However, this is not essential. We then extract new terms from the retrieved web snippets and use these terms iteratively ... attributes from the Web and attempt to deal with ambiguity and noise of the retrieved attribute values. (Aramaki et al., 2007) utilize a small set of patterns to extract physical object sizes and use the...

Ngày tải lên: 20/02/2014, 04:20

10 466 0
Báo cáo y học: "introduction to special issue on Eye and Zoonosis – from the guest editors"

Báo cáo y học: "introduction to special issue on Eye and Zoonosis – from the guest editors"

... acquired and con- genital infections in presumed ocular toxoplas- mosis. Am J Ophthalmol 2008;146:851-5). ã Optical Coherence Tomography in ocular toxoplasmosis ã Usefulness of vitrectomy in the ... treatment of ocular toxoplasmosis ã Update on the treatment of ocular toxoplasmo- sis. We hope that this special issue will be interesting to readers and provides researchers with timely up- date ... researchers with timely up- date on various topics in this important field. Conflict of Interest The authors have declared that no conflict of in- terest exists. ...

Ngày tải lên: 03/11/2012, 11:11

2 617 0
Silychristin and isosilychristin from the fruits

Silychristin and isosilychristin from the fruits

... [M-H] + in the negative spectra. All NMR assignments of 2 were made carefully from HSQC and HMBC and from the comparison with those of 1 as shown in table 2. The H-C long-range correlations in the ... confirming the positions of C-, C- and C- of two oximethine and oximethylene groups. The selected H-C correlations in the HMBC spectrum of 1 were shown in Fig. 2. Furthermore, the negative ... silychristin (1) and isosilychristin (2) isolated from the fruits of Silybum marianum (L.) Gaertn. cultivated in the North of Vietnam. The structures were elucidated by analyses of the NMR and ESI...

Ngày tải lên: 07/11/2012, 15:53

5 408 0
Tài liệu Unix Use and Security From The Ground Up_ The Prophet pdf

Tài liệu Unix Use and Security From The Ground Up_ The Prophet pdf

... shows the name of the owner's group .The sixth field shows the size of the file. the seventh field shows the time and date the file was last modified. the last field shows the name of the file ... execute the program, and then disconnect from the system. Soon, some unlucky user will call the system and be switched into the detached account's tty. When they enter their username and password, ... directory, the uid shell is created. Another good idea is to set the name of the trojan to a command in the user's login file, have it make the uid shell, execute the real command, and then delete...

Ngày tải lên: 21/12/2013, 04:19

50 551 0
12 Powerful Ideas On Creativity and Business From The PSFK Conference

12 Powerful Ideas On Creativity and Business From The PSFK Conference

... Despite an understandably reduced crowd (thanks to the Hurricane), we all gathered at the Kabuki Sundance Theater in San Francisco as branding gurus, designers, architects, educators and ent repreneurs ... repreneurs all took the stage to share groundbreaking new ideas on the future of business and culture. In a few weeks, all the video from the event will be available online, but in the meant ime – ... quick glimpse behind the scenes at building both brands. We were treated to an image of the whiteboard where he and his team first laid out the principle and insight behind the “t hird place” strategy...

Ngày tải lên: 09/02/2014, 20:13

5 310 0
Tài liệu Báo cáo khoa học: "Learning to Find Translations and Transliterations on the Web" doc

Tài liệu Báo cáo khoa học: "Learning to Find Translations and Transliterations on the Web" doc

... compute the frequency of all the candidates identified in all snippets, and output the one with the highest frequency. 4 Experiments and Evaluation We extracted the Wikipedia titles of English and ... in the same way as done in the training phase. We then use the trained model to tag the snippets, and extract translation candidates by identifying consecutive Chinese tokens labeled as B and ... translations and transliterations on the Web for a given term. The approach involves using a small set of terms and translations to obtain mixed-code snippets from a search engine, and automatically...

Ngày tải lên: 19/02/2014, 19:20

5 532 1
Tài liệu Báo cáo khoa học: "Names and Similarities on the Web: Fact Extraction in the Fast Lane" ppt

Tài liệu Báo cáo khoa học: "Names and Similarities on the Web: Fact Extraction in the Fast Lane" ppt

... documents from which the candidate fact is extracted; and c) the pattern- based scores of the candidate fact. The latter fea- ture converts the scores of the patterns extracting the candidate fact into ... over the entire set of names from the gold standard. For the Gold A set, the size of the ∩Gold set of person names changes little when the facts are ex- tracted from chunk W 1 vs. W 2 vs. W 3 . The ... and [CD] for the left and right sides respec- tively. The infix occurs in all sentences. How- ever, the matching of the part-of-speech tags of the sentence sequences to the left and right of the...

Ngày tải lên: 20/02/2014, 12:20

8 489 0
AKBAR, EMPEROR OF INDIA A PICTURE OF LIFE AND CUSTOMS FROM THE SIXTEENTH CENTURY ppt

AKBAR, EMPEROR OF INDIA A PICTURE OF LIFE AND CUSTOMS FROM THE SIXTEENTH CENTURY ppt

... grandfather Baber before him, he had many bitter battles with them, for no other Indian people had opposed him so vigorously as they. Their domain blocked the way to the south, and from their ... brand Islam and its supreme contempt for followers of other faiths, with one of the greatest stains in the history of humanity. When a tax-collector gathered the taxes of the Hindus and the ... and these into provinces, administrative districts and lesser subdivisions, and governed the revenues of the empire on the basis of a uniformly exact survey of the land. He introduced a standard...

Ngày tải lên: 06/03/2014, 12:20

38 673 1
Báo cáo khoa học: The fabp4 gene of zebrafish (Danio rerio) ) genomic homology with the mammalian FABP4 and divergence from the zebrafish fabp3 in developmental expression pot

Báo cáo khoa học: The fabp4 gene of zebrafish (Danio rerio) ) genomic homology with the mammalian FABP4 and divergence from the zebrafish fabp3 in developmental expression pot

... specificity and affinity [10]. In addition to the similar three- dimensional structure and ligand-binding specificity and affinity of FABP3 and FABP4, the transcripts and proteins of the two paralogous ... fabp4) from the zebrafish genome. The polypeptide sequence encoded by zebrafish fabp4 showed highest identity to the H ad - FABP or H6-FABP from Antarctic fishes and the putative orthologs from other ... zebrafish fabp4 and fabp3 and other fish and mammalian FABP genes was performed using clustalx [43]. The Antarctic fish H6-FABP and H8- FABP sequences were included in this analysis, and the putative...

Ngày tải lên: 07/03/2014, 10:20

13 478 0
SCULPTURE AND INSCRIPTIONS FROM THE MONUMENTAL ENTRANCE TO THE PALATIAL COMPLEX AT KERKENES DAG, TURKEY pot

SCULPTURE AND INSCRIPTIONS FROM THE MONUMENTAL ENTRANCE TO THE PALATIAL COMPLEX AT KERKENES DAG, TURKEY pot

... Sculpture from Sardis: The Finds through 1975    The Highlands of Phrygia: Sites and Monuments Hostetter, ... Limestone from Crete and Mainland Greece     The ... Plastik    Gods, Demons and Symbols of Ancient Mesopotamia: An Illustrated Dictionary    The Greeks Overseas: Their Early Colonies and Trade ...

Ngày tải lên: 07/03/2014, 13:20

212 333 0
Báo cáo khoa học: Molecular cloning, expression analysis and functional confirmation of ecdysone receptor and ultraspiracle from the Colorado potato beetle Leptinotarsa decemlineata pdf

Báo cáo khoa học: Molecular cloning, expression analysis and functional confirmation of ecdysone receptor and ultraspiracle from the Colorado potato beetle Leptinotarsa decemlineata pdf

... be 64 kDa and 49 kDa, respectively, from the mobility in the gel. The 64 kDa LdEcR-A protein was consistent with the predicted size from the deduced amino acid sequence (63.4 kDa). In the lane of ... calculated from the saturation curve of the specific binding and the Scatchard plot (Fig. 5). The K D values of LdEcR-A and LdEcR-A ⁄ LdUSP cal- culated from saturation curves were 72.6 and 2.8 nm, respectively. Receptor-binding ... [29–33], and were also reported for a dipteran, the yellow fever mosquitoe Aedes aegypti [34], and lepidopterans, the tobacco hornworm Manduca sexta [35–37] and the silkworm Bombyx mori [38,39]. On the...

Ngày tải lên: 07/03/2014, 21:20

15 564 0
Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc

Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc

... (1) Given a web site, the root page and web pages directly linked from the root page are downloaded. Then for each of the downloaded web page, all of its anchor texts (i.e. the hyperlinked ... English-Chinese parallel data from the web. The mining procedure is initiated by acquiring Chinese website list. We have downloaded about 300,000 URLs of Chinese websites from the web directories at ... that, using the new web mining scheme, the web mining throughput is increased by 32%; (ii) The quality of the mined data is improved. By lever- aging the web pages’ HTML structures, the sen- tence...

Ngày tải lên: 08/03/2014, 02:21

8 435 0
Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx

Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx

... NP F “a(x) x and other” NP QT (,)? and other NP F “a(x) x or other” NP QT (,)? or other NP F Plural “such as p(x)” NP F such as NP QT “p(x) and other” NP QT (,)? and other NP F “p(x) or other” NP QT (,)? ... evaluation measures. Then we describe the creation of the gold standard. Further, we present the results of the com- parison of the different ranking measures with re- spect to the gold standard. Finally, ... measures. On the one hand, we explore measures which use the Web to calculate the corre- lation strength between a qualia term and its qualia elements. These measures are Web- based versions of the Jaccard...

Ngày tải lên: 08/03/2014, 02:21

8 379 0
Báo cáo khoa học: "Mining Parenthetical Translations from the Web by Word Alignment" potx

Báo cáo khoa học: "Mining Parenthetical Translations from the Web by Word Alignment" potx

... pairs, where the translation of the in-parenthesis terms is a suffix of the pre-parenthesis text. The lengths and frequency counts of the suffixes have been used to determine what is the translation ... et al. (2001) made the first proposal to mine translations from the web. Their work was concentrated on terminologies, and assumed the English terms were given as input. Wu and Chang (2007), ... + K, where C is the length of the Chinese text, E is the length of the English text in the parentheses and K is a constant (we used K=6 in our experiments). The lengths C and E are measured...

Ngày tải lên: 17/03/2014, 02:20

9 612 0
w