• No results found

[PDF] Top 20 MapReduce for Data Intensive Scientific Analyses

Has 10000 "MapReduce for Data Intensive Scientific Analyses" found on our website. Below are the top 20 most common "MapReduce for Data Intensive Scientific Analyses".

MapReduce for Data Intensive Scientific Analyses

MapReduce for Data Intensive Scientific Analyses

... MapReduceMapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a ... See full document

25

Map Reduce for Data Intensive Scientific Analyses

Map Reduce for Data Intensive Scientific Analyses

... using MapReduce for textual data processing using Hadoop, we could not find any schemes for using MapReduce for these types of ...the data for the MapReduce tasks to be in its ... See full document

8

Map Reduce for Data Intensive Scientific Analyses

Map Reduce for Data Intensive Scientific Analyses

... two MapReduce implementations scale as the number of processing units ...the data at 100 GB, and measured the execution time by varying the number of nodes in the ...of data using a sequential ... See full document

8

MapReduce and Data Intensive Applications

MapReduce and Data Intensive Applications

... terabyte data is generated from scientific experiments, commercial online data collecting, and physical digital device ...collected data are generally analyzed according to different ... See full document

10

Energy Efficient Data-Intensive Computing With Mapreduce

Energy Efficient Data-Intensive Computing With Mapreduce

... 2.2 Related Work Profiling and Modelling Tool Physical power measurement, the only means for obtaining the first-hand power data for real systems and components, has been either coarse-grained or intrusive. Node ... See full document

75

Scientific data processing framework for Hadoop MapReduce

Scientific data processing framework for Hadoop MapReduce

... scale scientific data centers to provide stable web search services with high quality of response time and ...Currently scientific workflows assist scientists and programmers with tracking their ... See full document

5

Data Management Challenges of Data-Intensive Scientific Workflows

Data Management Challenges of Data-Intensive Scientific Workflows

... some scientific disciplines, such as astronomy, there are standard data formats [2] that include metadata about an image as part of the image file ...image data and write new images in the same ... See full document

6

A Survey of Data-Intensive Scientific Workflow Management

A Survey of Data-Intensive Scientific Workflow Management

... where scientific workflows will need to be deployed at several sites, ...the data accessed by the workflow is in different research groups’ databases in different sites or because the workflow execution ... See full document

44

Scientific data mining and processing using MapReduce in cloud environments

Scientific data mining and processing using MapReduce in cloud environments

... processing scientific data has enabled the development of digital resources and web-based services that facilitate uses of data beyond those that may have been envisioned by the original data ... See full document

7

SURVEY ON SCIENTIFIC DATA MANAGEMENT USING HADOOP MAPREDUCE IN THE KEPLER SCIENTIFIC WORKFLOW SYSTEM

SURVEY ON SCIENTIFIC DATA MANAGEMENT USING HADOOP MAPREDUCE IN THE KEPLER SCIENTIFIC WORKFLOW SYSTEM

... Keywords: Mapreduce; Hadoop; Scientific Workflow; Parallel Processing; Actor-Oriented Modeling ...Currently scientific workflows assist scientists and programmers with tracking their data ... See full document

5

Clouds and MapReduce for Scientific Applications

Clouds and MapReduce for Scientific Applications

... required data samples with a throughput of 1 trillion base pairs per day and this rate will ...and data pipeline is shown in figure 1 with sequencers producing DNA samples that are assembled and subject to ... See full document

5

Scheduling Data Intensive Workloads through Virtualization on MapReduce based Clouds

Scheduling Data Intensive Workloads through Virtualization on MapReduce based Clouds

... a MapReduce job J to process data of size Φ and be completed within the Completion time D, how many map/reduce slots need to be allocated to this job over time so that it finishes within deadline D when run ... See full document

8

DryadLINQ for Scientific Analyses

DryadLINQ for Scientific Analyses

... – Applications that can only be implemented using DryadLINQ but not with typical MapReduce. • Current release of DryadLINQ has some performance limitations[r] ... See full document

20

DryadLINQ for Scientific Analyses

DryadLINQ for Scientific Analyses

... Condor DAGMan [21] is a well-known parallel runtime that supports applications expressible as DAGs and many workflow runtimes supports DAG based execution flows. However, the granularity of tasks handled at the vertices ... See full document

8

An extensible and scalable Pilot-MapReduce framework for data intensive applications on distributed cyberinfrastructure

An extensible and scalable Pilot-MapReduce framework for data intensive applications on distributed cyberinfrastructure

... Apache Hadoop [3]. However, there are limitations to the current MR implementations: (i) They lack a modular architecture, (ii) are tied to specific infrastructure, e. g. Hadoop relies on the Hadoop File System (HDFS), ... See full document

46

Parameterized specification, configuration and execution of data-intensive scientific workflows

Parameterized specification, configuration and execution of data-intensive scientific workflows

... Effects of task granularity In these experiments, we coa- lesced components of the PIQ workflow into meta-compone- nts to produce a workflow template with a coarser task gran- ularity. Figure 8 illustrates an alternative ... See full document

19

Accelerating data-intensive scientific visualization and computing through parallelization

Accelerating data-intensive scientific visualization and computing through parallelization

... after data partitioning and distribution, a sort-last parallel visualization scheme takes two sequential steps, ...of data blocks; ii) the traditional over operator performs order-dependent composition and ... See full document

147

The Fourth Paradigm: Data-Intensive Scientific Discovery, Open Science and the Cloud

The Fourth Paradigm: Data-Intensive Scientific Discovery, Open Science and the Cloud

... digital data “as the digital recorded factual material commonly accepted in the scientific community as necessary to validate research findings including data sets used to support scholarly ... See full document

67

XML Database Support for Distributed Execution of Data-intensive Scientific Workflows

XML Database Support for Distributed Execution of Data-intensive Scientific Workflows

... in scientific data analysis ...for data exchange in Web and Grid ...for scientific workflows and can ben- efit from XML database ...based data processing middleware with a distributed ... See full document

6

Study of MapReduce for Data Intensive Applications, NoSQL Solutions, and a Practical Provisioning Interface for IaaS Cloud

Study of MapReduce for Data Intensive Applications, NoSQL Solutions, and a Practical Provisioning Interface for IaaS Cloud

... iterative MapReduce, in Proceedings of the First International Workshop on MapReduce and its Applications of ACM HPDC 2010 conference June 20-25, ...to Data Intensive problems University of ... See full document

49

Show all 10000 documents...