f for large scale data analysis

Big data analytics with spark  a practitioners guide to using spark for large scale data analysis

Big data analytics with spark a practitioners guide to using spark for large scale data analysis

... Transform Load (ETL), 107 „„         F Feature extraction and transformation, 172 Feature transformer, 197 Feedforward neural network, 166 filter method, 66, 68, 87, 125 flatMap method, 73, 87 F- measure, ... http://pyml.sourceforge.net/doc/howto.pdf Breiman, Leo Random Forests https://www.stat.berkeley.edu/~breiman/randomforest2001.pdf Cassandra http://cassandra.apache.org Chang, Fay, Jeffrey Dean, Sanjay ... provided by Spark are useful for both troubleshooting and optimizing application performance The monitoring web UI helps you find configuration issues and performance bottlenecks If a job is taking...

Ngày tải lên: 04/03/2019, 13:44

290 175 0
Apress big data analytics with spark a practitioners guide to using spark for large scale data analysis

Apress big data analytics with spark a practitioners guide to using spark for large scale data analysis

... Transform Load (ETL), 107 „„         F Feature extraction and transformation, 172 Feature transformer, 197 Feedforward neural network, 166 filter method, 66, 68, 87, 125 flatMap method, 73, 87 F- measure, ... Data Analytics with Spark A Practitioner’s Guide to Using Spark for Large Scale Data Analysis — Mohammed Guller Big Data Analytics with Spark A Practitioner’s Guide to Using Spark for Large- Scale ... http://pyml.sourceforge.net/doc/howto.pdf Breiman, Leo Random Forests https://www.stat.berkeley.edu/~breiman/randomforest2001.pdf Cassandra http://cassandra.apache.org Chang, Fay, Jeffrey Dean, Sanjay...

Ngày tải lên: 17/04/2017, 15:20

290 1,6K 0
Tài liệu A Comparison of Approaches to Large-Scale Data Analysis pdf

Tài liệu A Comparison of Approaches to Large-Scale Data Analysis pdf

... read from disk Furthermore, the column-wise storage of data results in better compression factors (approximately a factor of 2.0 for Vertica, versus a factor of 1.8 for DBMS-X and 1.25 for Hadoop); ... system for optimal performance was done through trial and error We found that certain parameters, such as the size of the sort buffers or the number of replicas, had no affect on execution performance, ... sizes of the buffer pool and sort heaps, were too conservative for modern systems Furthermore, DBMS-X proved to be ineffective at adjusting memory allocations for changing conditions For example,...

Ngày tải lên: 19/02/2014, 12:20

14 924 0
Rstoolbox - a Python library for large-scale analysis of computational protein design data and structural bioinformatics

Rstoolbox - a Python library for large-scale analysis of computational protein design data and structural bioinformatics

... contains functions for parsing the input data The input functions in io generate one of the three data containers defined in the components module: DesignFrame for decoy populations, SequenceFrame for ... Specifically, we analysed how the design Page 10 of 13 populations performed regarding energetic sampling (Fig 5a) and the mimicry of BINDI’s conformational shift from the original scaffold (Fig ... growth of different structural databases (e.g PDB [34], SCOP [35], CATH [31]) Conclusion Here, we present the rstoolbox, a Python library for the analysis of large- scale structural data tailored for...

Ngày tải lên: 25/11/2020, 12:15

13 11 0
Resource aware load distribution strategies for scheduling divisible loads on large scale data intensive computational grid systems

Resource aware load distribution strategies for scheduling divisible loads on large scale data intensive computational grid systems

... STRATEGIES FOR SCHEDULING DIVISIBLE LOADS ON LARGE- SCALE DATA INTENSIVE COMPUTATIONAL GRID SYSTEMS SIVAKUMAR VISWANATHAN (M.Sc., National University of Singapore) A THESIS SUBMITTED FOR THE DEGREE OF ... green field for future research activities 147 Bibliography [1] Foster, I., “The Grid: A New Infrastructure for 21st Century Science,” Physics Today, vol 55, no 2, pp 42-47, Feb 2002 [2] Foster, ... Institute for Infocomm Research (I R) for supporting me during this part-time study I am grateful to Dr Michael Li Ming, who convinced me to pursue Ph.D degree, Prof Wong Wai Choong Lawrence, Prof Lye...

Ngày tải lên: 14/09/2015, 08:25

181 207 0
Context data management for large scale context aware ubiquitous systems

Context data management for large scale context aware ubiquitous systems

... 70 Figure 16 Comparison of query response time for the different schemes 74 Figure 17 Comparison of query response time with different answer set sizes 75 Figure 18 Comparison of query ... 78 Figure 21 PSG update operations for different network sizes 81 Figure 22 Contribution of cumulative updates in different ranges to the total updates 83 Figure 23 Variation of query ... LIST OF FIGURES Figure Coalition System architecture 30 Figure Illustration of the concept of physical space 31 Figure Overview of Coalition data management layer 32 Figure...

Ngày tải lên: 30/09/2015, 05:44

163 633 0
A forest-based feature screening approach for large-scale genome data with complex structures

A forest-based feature screening approach for large-scale genome data with complex structures

... investigated the performance of a forestbased feature screening approach for detecting epistatic, correlative, and polygenic effects for large- scale genome data Besides the difficulties caused by ... parameter β(u) is useful to explain personalized covariate effects that vary for different individuals due to different genetic information and other factors [30] Simulation For Sim 4, we consider ... the average rank of all five causative features For Sim 1, the first three features have linear marginal effects but x4 and x5 have interactive effects The marginal effect of x1 is designed to...

Ngày tải lên: 27/03/2023, 05:18

11 3 0
Tài liệu Use of Event Data Recorder (EDR) Technology for Highway Crash Data Analysis doc

Tài liệu Use of Event Data Recorder (EDR) Technology for Highway Crash Data Analysis doc

... remarkable new data source for improvements in highway crash data analysis and research There are however several difficult issues which may impede the use of EDR data for highway crash data analysis ... Approach F- 1 vii List of Figures Figure 2-1 Example of GM EDR pre-crash information Figure 2-2 GM EDR record of Longitudinal Velocity vs Time Figure 2-3 Ford Longitudinal ... recommended database formats, we conclude that a significant fraction of data elements currently being collected could be provided by either existing or future EDR data elements For example, 56 of the...

Ngày tải lên: 19/02/2014, 03:20

210 752 0
Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf

Tài liệu Báo cáo khoa học: "Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT" pdf

... (wz,t,i,j ) end for wz,t,i+1,0 ← wz,t,i,P end for end for Collect/stack weights W ← [w1,t,S,0 | |wZ,t,S,0 ]T Select top K feature columns of W by norm and for k ← K v[k] = end for end for return ... scaling discriminative training for SMT to large training sets We deploy generic features for SCFG-based SMT that can efficiently be read off from rules at runtime Such features include rule ids, rule-local ... wz,t,i,0 for all pairs xj , j ∈ {0 P − 1}: wz,t,i,j+1 ← wz,t,i,j − η lj (wz,t,i,j ) end for wz,t,i+1,0 ← wz,t,i,P end for wz,t+1,0,0 ← wz,t,S,0 end for end for Collect final weights from each...

Ngày tải lên: 19/02/2014, 19:20

11 549 0
scalable decentralized object location and routing for large scale peer to peer systems

scalable decentralized object location and routing for large scale peer to peer systems

... replace a failed node in the leaf set, its neighbor in the nodeId space contacts the live node with the largest index on the side of the failed node, and asks that node for its leaf table. For instance, ... performance in terms of the expected number of routing hops and the number of messages exchanged as part of a node join operation. This section focuses on another aspect of Pastry’s routing performance, ... appropriateentries for from node , the next node encountered along the route from to , and so on. Finally, transmits a copy of its resulting state to each of the nodes found in its neighborhood set, leaf set,...

Ngày tải lên: 28/04/2014, 13:40

22 440 0
Báo cáo hóa học: " Adaptive antenna selection and Tx/Rx beamforming for large-scale MIMO systems in 60 GHz channels" pptx

Báo cáo hóa học: " Adaptive antenna selection and Tx/Rx beamforming for large-scale MIMO systems in 60 GHz channels" pptx

... method for updating the beamformers. 4.1 Stochastic gradient algorithm for beamformer update The algorithm for the beamformer update is a generali- zati on of [14] and is described as follows. ... requirement with minimum number of selected antenna Performance of adaptive beamforming Figure 8 shows one run of Algorithm 2 for adaptive transmit and receive beamforming upon a selected ... beamforming for high-speed data transmission. We assume that the number of RF cha ins is smaller than the number of antennas, which motivates the use of antenna selection to exploit the beamforming...

Ngày tải lên: 21/06/2014, 01:20

14 404 0
báo cáo khoa học: " Large-scale data integration framework provides a comprehensive view on glioblastoma multiforme" pot

báo cáo khoa học: " Large-scale data integration framework provides a comprehensive view on glioblastoma multiforme" pot

... characterization of complex diseases calls for coordinated efforts to collect and share genome -scale data from large patient cohorts A prime example of such a coordinated effort is The Cancer ... efforts require software and computational tools to facilitate interpretation of the data We have developed Anduril, an efficient and systematic data integration framework, to conduct largescale ... to effective diagnosis, treatment and prevention strategies requires computational tools that are designed for large- scale data analysis as well as for the integration of multidimensional data...

Ngày tải lên: 11/08/2014, 12:20

12 233 0
Báo cáo khoa học:" Evolution of the M gene of the influenza A virus in different host species: large-scale sequence analysis" pptx

Báo cáo khoa học:" Evolution of the M gene of the influenza A virus in different host species: large-scale sequence analysis" pptx

... M1 for all hosts ω of the entire coding region of the M gene for human and swine influenza was significantly higher (no overlap of 95% confidence intervals) than that for the avian influenza (Figure ... avian influenza (Figure 3) ω for both M1 and M2 of human influenza are also significantly larger than that for avian influenza (Figure 3) Page of 13 (page number not for citation purposes) Virology ... according to hosts, subtypes, geographical information, or temporal information using FigTree (ver.1.1.2) Dataset of Influenza for Each Host Datasets for each host (avian, canine/equine, human,...

Ngày tải lên: 12/08/2014, 04:21

13 342 0
Báo cáo y học: "Hybrid dynamic/static method for large-scale simulation of metabolism" pptx

Báo cáo y học: "Hybrid dynamic/static method for large-scale simulation of metabolism" pptx

... condition (-) or after a two-fold increase of metabolite A (+) Perturbation Boundary Static part + + of kinetic properties for large- scale metabolic pathways Therefore, the applicability of the dynamic ... masses of information necessary for developing dynamic cell -scale simulation models In addition, this DFBA study did not define the criteria for segmenting a whole metabolic pathway into parts defined ... a criterion for identifying groups of enzymes that can be approximated with sufficient accuracy by static modules However, a large amount of experimental data Recently, a method for high-throughput...

Ngày tải lên: 13/08/2014, 23:20

11 336 0
Báo cáo y học: "Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome" ppt

Báo cáo y học: "Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome" ppt

... these data manually is made difficult by the large number of articles, all lacking formal structure. Automated extraction of information would be preferable, and therefore, mining data from Medline ... measures of accuracy that are useful for the evaluation and integration of upcoming data- sets. We established two benchmarks for assessing the quality of large- scale human protein interaction datasets, ... 7,748 human proteins, forming a framework for the interpretation of human func- tional genomics data. These data are collected in the ID-Serve database [37], which can be queried for protein interactions...

Ngày tải lên: 14/08/2014, 14:21

12 208 0
Xác định hóa chất bảo vệ thực vật carbamat trong một số loại rau quả bằng phương pháp sắc ký lỏng khối phổ (LCMS)

Xác định hóa chất bảo vệ thực vật carbamat trong một số loại rau quả bằng phương pháp sắc ký lỏng khối phổ (LCMS)

... lưu giữ quy định lực F1 , F2 , F3 Trong F1 F2 giữ vai trò định, F3 yếu tố ảnh hưởng không lớn Ở F1 lực giữ chất phân tích cột, F2 lực kéo pha động chất phân tích khỏi cột, F3 lực tương tác pha ... nhanh với dòng liên tục (continuous flow- fast atom bombardment CFFAB) hay tia nhiệt (thermospray – TS) đòi hỏi áp suất thấp Một thuận lợi API ion hóa mềm (soft ionization), không phá vỡ cấu trúc ... trung gian gọi giao diện Rất nhiều kỹ thuật giao diện (interface technology) chùm tia hạt (FB), bắn phá nguyên tử nhanh dòng liên tục (CF-FAB),… nghiên cứu ứng dụng, cuối thập nhiên 80, có đột phá...

Ngày tải lên: 18/06/2015, 10:16

29 732 5
Luận văn thạc sỹ xác định đồng thời dư lượng kháng sinh nhóm nitrofuran trong một số loại thực phẩm tươi sống trên địa bàn hà nội bằng phương pháp sắc ký lỏng khối phổ LCMSMS

Luận văn thạc sỹ xác định đồng thời dư lượng kháng sinh nhóm nitrofuran trong một số loại thực phẩm tươi sống trên địa bàn hà nội bằng phương pháp sắc ký lỏng khối phổ LCMSMS

... cầu trùng màu đen đầu [12] 1.1.2 Các chất nhóm nitrofuran Các chất nhóm n itrofuran bao gồ m Furazolidone, furaltadone, furazolidone, nitrofurazone, vào thể sinh vâ ̣t tạo thành chất chuyển ... kí lỏng khối phổ LOD Limit of detection Giới hạn phát LOQ Limit of quality Giới hạn định lƣợng MeOH Methanol Methanol Minimum required Yêu cầu giới hạn hiệu nhỏ performance limit của phƣơng pháp ... nitrofuran số loại thực phẩm tƣơi sống địa bàn Hà Nội phƣơng pháp sắc ký lỏng khối phổ LC/MS/MS” CHƢƠNG 1: TỔNG QUAN 1.1 Giới thiệu kháng sinh nhóm nitrofuran 1.1.1 Nhóm nitrofuran gì? Nitrofuran...

Ngày tải lên: 24/10/2015, 14:33

77 952 1
Xác định đồng thời dư lượng kháng sinh nhóm nitrofuran trong một số loại thực phẩm tươi sống trên địa bàn hà nội bằng phương pháp sắc ký lỏng khối phổ LC  MS MS

Xác định đồng thời dư lượng kháng sinh nhóm nitrofuran trong một số loại thực phẩm tươi sống trên địa bàn hà nội bằng phương pháp sắc ký lỏng khối phổ LC MS MS

... (2009), Analysis of nitrofurans in animal tissues a food of animal origin by LC/MS/MS, sop FSG341 18 E Horne, A Cadogan, M OKeeffe, Hoogenboo, L.A.P (1996), Analysis of proteinbound metabolites of furazolidone ... monitoring for the simultaneous determination of five nitrofurans (furazolidone, furaltadone, nitrofurazone, nitrofurantoine, nifursol) in poultry muscle tissue through the detection of their five ... P.Gowik (2007), Validation of a confirmatory method for the determination of residues of four nitrofurans in egg by liquid chromatography-tandem mass spectrometry with the software interval, Analytica...

Ngày tải lên: 10/02/2014, 20:55

16 981 0
Nghiên cứu phương pháp định lượng diltiazem trong huyết tương người bằng sắc ký lỏng khối phổ (LC MS)

Nghiên cứu phương pháp định lượng diltiazem trong huyết tương người bằng sắc ký lỏng khối phổ (LC MS)

... chiết lỏng - lỏng: - 38 - + Dung môi cloroform cho hiệu suất chiết DTZ FELO cao Tuy vậy, cloroform dễ tạo nhũ tương bề mặt phân cách, đồng thời cloroform có tỷ trọng > 1, nên nằm lớp gây khó ... pháp chiết lỏng – lỏng: Lựa chọn dung môi diethyl ether, cloroform hỗn hợp dung môi diethyl ether cloroform với tỷ lệ khác để chiết DTZ, FELO theo quy trình tóm tắt sơ đồ hình 3.11 mL mẫu huyết tương ... Kiềm hoá pH ÷ 10 - 5-7 mL dung môi chiết* - Lắc học, ly tâm * Dung môi chiết: Cloroform Diethyl ether Cloroform:Diethyl ether tỷ lệ 3:7/ 2:8… (v/v) – mL lớp dung môi Bốc dung môi (N2, toC) Lớp...

Ngày tải lên: 27/07/2014, 07:02

63 774 0
Nghiên cứu chuẩn hóa phương pháp phân tích nhóm b agonists bằng kỹ thuật sắc ký lỏng khối phổ tứ cực  và ứng dụng để phân tích dư lượng trong thịt lợn ở một số tỉnh miền bắc việt nam

Nghiên cứu chuẩn hóa phương pháp phân tích nhóm b agonists bằng kỹ thuật sắc ký lỏng khối phổ tứ cực và ứng dụng để phân tích dư lượng trong thịt lợn ở một số tỉnh miền bắc việt nam

... - FAO: Food Agricultural Organization - HPLC: High-performance liquid chromatography - ISO : T ch c tiêu chu n th gi i (International Standard Organization) - LOD: Gi i h n phát hi n (Limit of ... fenoterol, formoterol, isoproterenol, salmeterol, terbutaline, fenoterol, metaproterenol, terbutaline, isoetarine, pirbuterol, procaterol, ritodrine, broxaterol, cinaterol, denopamine, etilefrine, ... a th p niên 80, M ñã c m s d ng clenbuterol vào th c ăn gia súc Cơ quan thu c th c ph m FDA FSIS (Food Safety and Inspection Trư ng ð i h c Nông nghi p Hà N i – Lu n văn th c s khoa h c Nông nghi...

Ngày tải lên: 04/10/2014, 17:21

83 863 2

Bạn có muốn tìm thêm với từ khóa:

w