... from the rest of the text We therefore concentrated on building protein name extractors and interaction extractors in parallel so that the results of the former analysis could be fed into the ... model, then there is a true underlying reason for the proteins to be co-cited - that is, they are interacting at either the functional, pathway level, or are co-localized or physically interact The ... using the Bayesian estimator [36] we enrich further for physical interactions, but at the expense of coverage Among the disadvantages are that the algorithm enriches for certain types of errors (for...
Ngày tải lên: 14/08/2014, 14:21
... Of these clones, 5,573 were sequenced in duplicate (mostly both times from the 5' end, with the exception of 77 clones that were sequenced from both the 3' and the 5' end) The primer used for the ... tosignificant microarraythe a non-honey termthe PCR analyses antor blastn the on HymenopHoneybasedforof bestmostclones blastxatassembled blastn hit toand Additionalforbeeforsequences to fireperformedbasedsequences ... regions of the pre-mRNAs Alternatively, transcripts may lack large ORFs because they are short or because they are noncoding RNAs (that is, transcripts other than rRNA or tRNA that not code for proteins)...
Ngày tải lên: 14/08/2014, 17:22
Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf
... column represents the relevance of the corresponding feature across tasks/shards The sum of the norms enforces a selection among features based on these norms Consider for example the two 5-feature, ... at once We split the data into 2290 shards for the ep runs and 141 shards for the nc runs, each shard holding about 1,000 sentences, which corresponds to the dev set size of the nc data set 5.2 ... with other sparse features Table shows results for algorithms and on the Europarl data (ep) for different devtest and test sets Europarl data were used in all runs for training and for setting the...
Ngày tải lên: 19/02/2014, 19:20
Báo cáo khoa học: "Leveraging Reusability: Cost-effective Lexical Acquisition for Large-scale Ontology Translation" potx
... lexicon for automated translation of the rest of the thesaurus The process begins by prioritizing keyword phrases for manual translation in terms of their value in accessing the collection and the ... segments to which the concept has been assigned For parent nodes, the thesaurus value is the number of segments (if any) to which the node has been assigned, plus the average of the thesaurus value ... count the number of segments s2 to which n1 was assigned and add that count to the average of the thesaurus values for n3, and n4 At n2 we simply average the thesaurus values for n4 and n5 The...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "A System for Large-Scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora" pot
... parser We use the standard scripts supplied with RASP to output the set of GRs for the most probable analysis returned by the parser or, in the case of parse failures, the GRs for the most likely ... corresponds to the coindexation marked by : the subject of the VP is the NP of the PP The only part of the feature structure which is not represented by the GRs is coindexation between the omitted ... output (rather than, for example, the filtering step) This is particularly evident for nouns for which 15 of the 27 frames exemplified in the gold standard are missing in the classifier output For adjectives...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "The key role of semantics in the development of large-scale grammars of natural language" pdf
... as the instrument ("means") which is used by the actor in order to perform the action denoted by the verb The "gegen" (against) alternant (see example (11)), on the other hand, entails that the ... used by the actor in order to perform the action denoted by the verb, while the other alternant (see example (11)) entails that the location undergoes directed motion; it is moved by the actor ... predicate in that when the location argument is realized as the direct object of the predicate the locatum argument is optional, but when the locatum argument is realized as the direct object all...
Ngày tải lên: 17/03/2014, 22:20
Báo cáo khoa học: "Efficient Inference of CRFs for Large-Scale Natural Language Data" docx
... appropriate values for the setting function δ of the active set and for the constant value ω of the inactive set These two problems are closely related The size of the active set affects both the complexity ... McCallum, 2006) We formally describe here an efficient calculation of α and β recursions for the forwardbackward procedure The forward value αt (i) is the sum of the unnormalized scores for all partial ... 368 s for ours) and 7∼12 times faster for decoding (2.881 ms for M ALLET, 5.028 ms for C RF ++, and 0.418 ms for ours) This result demonstrates that learning and decoding CRFs for large- scale...
Ngày tải lên: 23/03/2014, 17:20
Mining Console Logs for Large-Scale System Problem Detection docx
... component ya , the projection on the abnormal subspace The dashed line shows the threshold Qα The solid line with spikes is the SPE calculated according to Eq (1) The circles denote the anomalous ... contrast to the normal use of decision trees, in our case the decision tree is constructed to explain the underlying logic of the detection algorithm, rather than the nature of the dataset The decision ... message types out of the 379 possible Many of the other message types only appear in log when exceptional behavior happens, and therefore Approach There are four steps in our approach for mining console...
Ngày tải lên: 30/03/2014, 16:20
Báo cáo khoa học: "Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation" docx
... the partitions of the words in the vocabulary After loading the current clustering, it then randomly chooses a subset of these words of a fixed size For each of the selected words the worker then ... also stored Together with the counts stored with the vocabulary partitions, this allows for efficient updating of the terms in Eq (10) The initial clustering together with all the required counts ... a word Thus the algorithm scales linearly in the number of classes The second difference is that B dominates the term B + Nv for most corpora and scales far less than linearly with the vocabulary...
Ngày tải lên: 31/03/2014, 00:20
scalable decentralized object location and routing for large scale peer to peer systems
... along the path A publisher sends data to the rendez-vous point via Pastry, again using the topicId as the key The rendez-vous point forwards the data along the multicast tree formed by the reverse ... in the quality of the tables Figure 10 shows the impact of failures and repairs on the route quality The left bar shows the average number of hops before the failures; the middle bar shows the ... node in the leaf set whose nodeId is closest to the key (possibly the present node) (line 3) If the key is not covered by the leaf set, then the routing table is used and the message is forwarded...
Ngày tải lên: 28/04/2014, 13:40
the economics of large scale infrastructure project finance an empirical examination of the propensity to project finance
... because the market prices their debt fairly Therefore, the diversion of payoff from the existing creditors makes stockholders better off The use of secured debt therefore increases the incentives for ... used only for the purpose they were intended for Additionally these are assets which not require extensive managerial skills for performance (Esty (2003)) I therefore not test this theory in ... from the redeployer Since the entrepreneur is in control of the asset when the investment is made and before the state of the world is known, he is in a position to bargain with the redeployer The...
Ngày tải lên: 03/06/2014, 02:15
Báo cáo hóa học: " Adaptive antenna selection and Tx/Rx beamforming for large-scale MIMO systems in 60 GHz channels" pptx
... resort to a simple stochastic gradient method for updating the beamformers 4.1 Stochastic gradient algorithm for beamformer update The algorithm for the beamformer update is a generalization of [14] ... as well as the actual largest eigenvalues of the selected antenna subsets are plotted for the first 200 iterations as a zoom-in view The number of transmissions for obtaining the smoothed estimate ... until the selected subsets not meet the requirement for a number of iterations, e.g., 50, which means the last nt is the desired minimum size n∗ Therefore, by restoring t the last backup data, the...
Ngày tải lên: 21/06/2014, 01:20
Báo cáo hóa học: " Research Article Distributed and Cooperative Link Scheduling for Large-Scale Multihop Wireless Networks" doc
... state information, power allocation, and the desired date rate As long as the amount of local information is much smaller than the amount of information in the data packets to be transmitted, the ... convergence, Pl (i) (k) for each l becomes nonzero for some (at least one) of k = 1, 2, , K, and zero for the rest of k = 1, 2, , K Then for each k, there is a subset Sk of the network, corresponding ... transmissions is achieved under the given value of K It should be obvious that the larger the value of K, the larger the maximum sparseness of concurrent cochannel transmissions Once the original set of...
Ngày tải lên: 22/06/2014, 06:20
Báo cáo y học: "Hybrid dynamic/static method for large-scale simulation of metabolism" pptx
... = 0.01s-1 and kr = 0.001-1 for the other reactions in the pathway of Figure 2a The initial metabolite concentrations were 1.0 mM for A, B and C, and 0.5 mM for the other metabolites Metabolite ... (blue) The results of these models were also in good agreement for the erythrocyte model (e) The reaction rates of the hybrid model differed only slightly from those of the dynamic model The lines ... The maximum errors at the first integration step after the perturbation were 3.575 × 10-7 for the metabolite concentrations and 0.00120 for the reaction rates In contrast, the models did not agree...
Ngày tải lên: 13/08/2014, 23:20
Báo cáo y học: "Network security and data integrity in academia: an assessment and a proposal for large-scale archiving" potx
... While the majority of these viruses and worms not cause any data loss, with many simply written for the virus- or worm-writer’s amusement or to enable the writer to use other people’s computers for ... allows information to be generated at a much faster rate, the scale of digital information is vastly greater than physically printed information ever was – a fact that is causing headaches for the ... security-event counts for the first 198 days of 2002; the expanded region had a large increase in daily counts Attack attempts are an everyday occurrence and there can be large spikes in attack...
Ngày tải lên: 14/08/2014, 14:22
ON THE ANALYSIS OF LARGE-SCALE DATASETS TOWARDS ONLINE CONTEXTUAL ADVERTISING
... centroids for each group, is the best among them It is probably due to the robustness of the training set using search engine For syntactic feature, they used the tf-idf score and section score for ... improve the overall performance of the framework For better understanding about the content of these short ads, they also carried out an experiment that considers the page pointed by the ads ... are close to the others The cosine of the angle between two strings measures the similarity of them It defines how similar they are For a web page p and an ad message a, let wpi be the weight associated...
Ngày tải lên: 20/08/2014, 09:36
dynamic workflow management for large scale scientific applications
... he is the person who should be congratulated before me for this thesis I wish to thank my committee members for their support during the thesis This thesis would not be possible without the contribution ... because of the high demand for computational and data resources Large scale scientific applications are the main drivers for this demand since they involve large number of simulations and these simulations ... resources, some steps of the applications need large amounts of data transfers The time consumed in data transfers may form the large portion of the application completion time Therefore, computational...
Ngày tải lên: 30/10/2014, 20:07
efficient communication and coordination for large-scale multi-agent systems
Ngày tải lên: 14/11/2014, 06:38
TIỂU LUẬN MÔN CƠ SƠ DỮ LIỆU NÂNG CAO Massive Parallel Processing for Large Scale Database
... http://db.cs.yale.edu/hadoopdb/hadoopdb.html http://www.cubrid.org/blog/dev-platform/database-technology -for- large- scaledata/ http://hadoop.apache.org/ http://davidmenninger.ventanaresearch.com/2011/01/19/secrets-revealed-inmassively-parallel-processing-and-database-technology/ ... (N:N): Mỗi segment rehash liệu đích (bằng cách join cột) phát tán lại dòng tới segment tương ứng Gather Motion (N:1): Mỗi segment gửi liệu đích tới node (luôn master) b Aster Data database: Cơ sở ... liệu quản lý trình tải liệu song song vào hệ thống database Cơ sở liệu song song 10 Catalog: theo dõi vị trí vùng liệu khác nhau, bao gồm (replicate) nhiều node SQL-MapReduce-SQL (SMS): lập...
Ngày tải lên: 08/07/2015, 16:05
TIỂU LUẬN MÔN CƠ SƠ DỮ LIỆU NÂNG CAO Massive Parallel Processing for Large Scale Database
... nội dung trình bày Sự cần thiết MPP Large Scale Database Phương hướng thực thi MPP Một số hệ thống sở liệu thực thi MPP Sự cần thiết MPP Large Scale Database Có nhiều dịch vụ giới ... http://db.cs.yale.edu/hadoopdb/hadoopd b.html http://www.cubrid.org/blog/devplatform/database-technology -for- largescale-data/ http://hadoop.apache.org/ 17 Xin chân thành cảm ơn! 18 ...
Ngày tải lên: 08/07/2015, 16:08