... dispersed liquid crystal devices.[6,7] Their work demonstrated the potential to fractionate commercially available liquid crystal mixtures Following from this the first carbon dioxide extractions ... dioxide extraction was carried out using a Thar Technologies SFE-500F-2-FMC System and liquid withdrawal carbon dioxide (99.9 %) The extraction vessel was charged with 300 g of glass from a LCD ... conditions of 250 bar and 40ºC the extraction recovery from complete screens was poor, 0.1% of the total liquid crystal present The removal of a resin seal from the outer edge of the LCD did...
... the contour by scanning from outer sides towards center Studying these background pixels will give us knowledge on which part of the histogram is from background and which from text Then the binarization ... T Watanabe Character Extractionfrom Noisy Background for an Automatic Reference System ICDAR1999 pp.143-146 [4] P Matti and O Okun Edge-Based Method for Text Detection from Complex Document ... overcoming the following difficulties for extracting text from name cards: 1) Variation of background color and text color (varying from line to line); 2) Complex graphical foregrounds like logos...
... open domain event extraction within Twitter, there are two key related strands of research: extracting specific types of events from Twitter, and extracting open-domain events from news [43] Recently ... e,i end for for each date which co-occurs with e, i = Nd d Generate ze,i from Multinomial(θe ) Generate the date de,i from Multinomial(βzn ) listed in Figure (not including the type) We then ... Dir(α) for each entity which co-occurs with e, i = Ne n Generate ze,i from Multinomial(θe ) Generate the entity ne,i from Multinomial(βzn ) TwiCal-Classify Supervised Baseline 0.0 0.2 0.4 0.6...
... sources from existing, mature components within the translation process This paper presents a method of phrase extractionfrom alignment data generated by IBM Models By working directly from alignment ... quality We estimate translation condence by measures from three models; the estimation from the maximum approximation (alignment map), estimation from the word based translation lexicon, and language ... level translation estimate, motivating the alignment model as a starting point for phrasal extraction The extraction technique must be able to handle alignments that are only partially correct,...
... percent of all U.S coal production comes from fewer than 100 mines in the West The remaining 42 percent derives from a mix of 10 Producing Liquid Fuels from Coal: Prospects and Policy Issues Table ... 42 Timeline for Coal-to-Liquids Development 46 vii viii Producing Liquid Fuels from Coal: Prospects and Policy Issues CHAPTER ... greenhouse-gas emissions from a CTL plant would be about twice those associated with fuels produced from conventional crude oils Slightly higher values would result from less efficient CTL plants...
... neighborhood of 6070% (Huang et al., 2000) The task that is most similar to our work is named entity extractionfrom speech data (DARPA, 1999) Although the goal of the named entity task is similar - to ... stochastictransducer induction It aims to learn rules automatically from training data instead of requiring hand-crafted rules from experts Although the results with this system are not yet as ... identity and leave the actual specific names later for extraction T Maximum entropy modeling is a powerful framework for constructing statistical models from data It has been used in a variety of difficult...
... Basili and Maria Teresa Pazienza 1997 Lexical acquisition for information extraction In M T Pazienza, editor, Information Extraction: A multidisciplinary approach to an emerging information technology ... Morgan Kaufmann Publishers Ralph Grishman 1997 Information extraction: Techniques and challenges In M T Pazienza, editor, Information Extraction: a multidisciplinary approach to an emerging technology ... background lexicons and word sense disambiguation for information extraction In International Workshop on Lexically Driven Information Extraction, Frascati, Italy G.A Miller 1990 Wordnet: an on-line...
... scenarios defined in the toolkit: “parallel data mining from comparable corpora” and “named entity/terminology extraction and mapping from comparable corpora” The next section provides a general ... parallel sentence pairs are extracted from the aligned comparable corpora (section 2.2) The workflow for named entity (NE) and terminology extraction and mapping from comparable corpora extracts data ... LEXACC requires aligned document pairs (also m to n alignments) for sentence extraction It also allows extractionfrom comparable corpora as a whole; however, precision may decrease due to larger...
... 2004) to extract keywords from tweets to tag users Topic discovery from Twitter is also related to our work (Ramage et al., 2010), but we further extract keyphrases from each topic for summarizing ... incorporating users’ interests for topical keyphrase extraction To the best of our knowledge, our work is the first to study how to extract keyphrases from microblogs We perform a thorough analysis ... Work Our work is related to unsupervised keyphrase extraction Graph-based ranking methods are the state of the art in unsupervised keyphrase extraction Mihalcea and Tarau (2004) proposed to use...
... of rare lexicon extraction There are few previous works focusing on the extraction of rare word translations, especially from comparable corpora One of the earliest works is from (Pekar et al., ... our knowledge, this is one of the first high accuracy extraction of rare lexicon from non-parallel documents We obtained a FMeasure ranging from about 80% (French-English, Chinese-English) to 97% ... translations from less known languages, using a well known language as training We used the same algorithms and same features as in the previous sections, but used the data computed from one pair...
... from the source to all the English words (including the e m p t y one), edges from all the French words (including the empty one) to the sink, an edge from the sink to the source, and edges from ... (type 1), or through two edges, one from bandwidth to largeur de bande., and one from bandwidth to either largeur or hap.de (type 2), or even through the two edges from bandwidth to largeur and bande ... parameters, multiword notions, or information on part-of-speech, information derived from bilingual dictionaries or from thesauri The integration of new parameters is in general straigthforward For...
... subsets of GI-H4 that are characterized by a different distance from the core of the lexical category of sentiment Sentiment Tag Extractionfrom WordNet Entries Word lists for sentiment tagging applications ... performance gain (from 66.5% to 73.4%) associated with the removal of neutrals from the evaluation set emphasizes the importance of neutral words as a major source of sentiment extraction system ... (1) describe the proposed approach used to extract sentiment information from WordNet entries using STEP (Semantic Tag Extraction Program) algorithm, (2) discuss the overall performance of STEP...
... corpus under study Results obtained from the first experiments confirm the usefulness of a morphological pattern based approach for the extraction of terms from domain-specific corpora and especially ... (“oncologist”) share an initial substring of length Moreover the terms “neuro-oncology” from F1 and “neurooncologist” from F2 contain the combining form “neuro” Families F1 and F2 are therefore united ... “volcano” 3.3 Terms The overlap percentage between the list of terms and the list of key words ranges from 38.65% (V fr) to 56.92% (V en) of the total amount of terms extracted If we compare both the...
... biologists 1.2 Information extraction We are using information extraction methods to automatically extract named entity properties, events and other domain-specific concepts from MEDLINE abstracts ... the information extraction programs Our interface provides a link to the information extraction programs as well as clickable links to aid in querying for related information from publically ... developing called On- tology Extraction- Maintenace System (OEMS) OEMS extracts three types of information about the domain-ontology, (Ogata, 1997), called typing information, from the abstracts: taxonomy...
... discourse analysis for information extraction Data & Knowledge Engineering, 55(1):59-83 H.L Chieu and H.T Ng 2002 A Maximum Entropy Approach to Information Extractionfrom Semi-Structured and Free ... Adaptive Information Extractionfrom Text by Rule Induction and Generalization In Proc of IJCAI-2001 A Culotta and J Sorensen J 2004 Dependency tree kernels for relation extraction In Proc of ... domain are given in Table 594 Synset 22 ID Table Linguistic features for anchor extraction Given an input phrase P from a test sentence, we need to classify if the phrase belongs to anchor cue...
... Unsupervised named-entity extractionfrom the Web: An experimental study Artificial Intelligence 165(1): 91-134 Feldman, R and B Rosenfeld (2006) Boosting Unsupervised Relation Extraction by Using ... (2006) Self-Supervised Relation Extractionfrom the Web ISMIS-2006, Bari, Italy Hasegawa, T., S Sekine and R Grishman (2004) Discovering Relations among Named Entities from Large Corpora ACL 2004 ... relation extraction accuracy However, the background techniques of our methods are relatively simple and known The validation is based on the same ideas that underlie semi-supervised entity extraction...
... problem of identifying story highlight lies somewhere between keyword extraction and single-document summarization The K EA keyphrase extraction system (Witten et al., 1999) mainly relies on purely ... feature extraction and estimation 3.1 Training Data In order to determine the features used for predicting which sentences are the sources for story highlights, we gathered statistics from 1,200 ... can be seen in Figure Only half of the highlights stem from sentences in the first fifth of the article Nevertheless, selecting sentences from only the first few lines is not a sure-fire approach...
... derived from phrase structure trees and dependency trees in conjunction with Support Vector Machines (SVMs) to solve the tasks For the design of structures and type of kernel, I took motivation from ... system for relation extraction I tried all the kernels and their combinations proposed by Nguyen et al (2009) I used syntactic and semantic insights to devise a new structure derived from dependency ... goal of creating a system that can extract social networks from a wide variety of texts I will then attempt to extract social networks from the increasing amount of text that is becoming machine...
... to project words from each language This sub-space plays the same role as the sub-space defined by translation pairs in the standard method, although with CCA, it is derived from the corpus via ... crosslingual information retrieval system from parallel corpora We show here that it can be used to infer language-independent semantic representations from comparable corpora, which induce a similarity ... combination we used here Conclusion We have shown in this paper how the problem of bilingual lexicon extractionfrom comparable corpora could be interpreted in geometric terms, and how this view led to...
... results from a linguistic point of view, is the possibility automaticaUy creating corpus defined thesauri, as can be seen above in the differences between relations extracted from medical and from ... information science corpora In conclusion, we feel that this fine grained approach to context extractionfrom large corpora, and similarity calculation employing those contexts, even using imperfect ... Automatic acquisition of hyponyms from large text corpora COLING'92, Nantes, France, July 92 (Jacobs and Zeruick 1988) P S Jacobs and U Zernick Acquiring lexical knowledge from text: A case study In...