• No results found

Apache Spark

A Study on New Challenges of Big Data and Apache Spark

A Study on New Challenges of Big Data and Apache Spark

... frameworks. Apache Spark runs on Hadoop, Mesos, standalone, or inside the ...run Spark the use of its standalone cluster mode, on EC2, on Hadoop YARN, or on Apache ...

8

Apache Spark usage and deployment models for scientific computing

Apache Spark usage and deployment models for scientific computing

... and Spark service provided by CERN IT is used by the IT Monitoring service which is critical for CC operations and WLCG, IT Security for intrusion detection, LHC experiments (CMS, ATLAS) for the analytics on ...

10

Using Apache Spark on genome assembly for scalable overlap-graph reduction

Using Apache Spark on genome assembly for scalable overlap-graph reduction

... Apache Spark is a cluster-based engine that processes very large-scale ...computation, Apache Spark provides GraphX library that contributes a set of fundamental operations and graph ...

12

Leveraging resource management for efficient performance of Apache Spark

Leveraging resource management for efficient performance of Apache Spark

... [4], Apache Spark quickly becomes the most popular com- plement of Apache Hadoop ...and Apache Spark. In their study, they show that Spark is very con- suming memory, and ...

23

Analytics For Healthcare Using Hadoop Mapreduce, Apache Spark And In Cloud Services

Analytics For Healthcare Using Hadoop Mapreduce, Apache Spark And In Cloud Services

... use Spark easily. Apache Spark is the fast general purpose big data analytics engine and it is very suitable for any kindof big data ...of Apache spark are Low Tolerance to Latency ...

5

Time series analysis with apache spark and its applications to energy informatics

Time series analysis with apache spark and its applications to energy informatics

... There is no time series package for Apache Spark and R (R Core Team, 2018) on the market. So, another method to perform forecasts with R on a Spark-Cluster is needed. With the API SparkR the function ...

5

Partitional Based Clustering Algorithms on Big Data Using Apache Spark

Partitional Based Clustering Algorithms on Big Data Using Apache Spark

... Apache Spark, a framework similar to the Von Neumann architecture. It has an efficient implementation of in- memory computations and iterative optimization is processed to analyze large volume of data. Data ...

6

StreamAligner: a streaming based sequence aligner on Apache Spark

StreamAligner: a streaming based sequence aligner on Apache Spark

... on Spark stream- ing ...in Spark memory and can be used for an unlimited times without reload- ...alignment, Apache Spark, Hadoop, Distributed computing ...

18

Optimizing machine learning on Apache Spark in HPC environments

Optimizing machine learning on Apache Spark in HPC environments

... of Apache Spark, a distributed data-flow framework, to support the use of machine learning in HPC environments for the purposes of machine ...using Spark in this context; memory management, ...

12

Incremental-Parallel Data Stream Classification in Apache Spark Environment

Incremental-Parallel Data Stream Classification in Apache Spark Environment

... on apache spark, to address the problems on the real-time streaming data communication cost and workload imbalance problem of large scale data in a parallel and distributed ...

9

GeoMatch:Efficient Large Scale Map Matching on Apache Spark

GeoMatch:Efficient Large Scale Map Matching on Apache Spark

... Abstract—We contribute by developing GeoMatch as a novel, scalable, and efficient big-data pipeline for large-scale map matching on Apache Spark. GeoMatch improves ex- isting spatial big data solutions by ...

8

Efficient iterative virtual screening with Apache Spark and conformal prediction

Efficient iterative virtual screening with Apache Spark and conformal prediction

... Due to the recent availability of large molecule datasets (e.g., ZINC [7]) and their structure being highly paral- lelizable, parallel approaches have been used for virtual screening. In our previous studies [8, 9], we ...

8

Study of Machine Learning Techniques using Apache Spark

Study of Machine Learning Techniques using Apache Spark

... as Apache Spark MLlib have been ...of Apache Spark.One of the major libraries of Apache Spark, Spark MLlib is the most prominent platform available for big data analysis to carry ...

7

Choice of Cluster Computing System Hadoop and Apache Spark for Network Systems

Choice of Cluster Computing System Hadoop and Apache Spark for Network Systems

... internal Spark functions written in Scala running on a JVM, translation between two different environments and the selected languages can be the source of more errors, problems, and reduced ...of Apache ...

8

A comparison on scalability for batch big data processing on Apache Spark and Apache Flink

A comparison on scalability for batch big data processing on Apache Spark and Apache Flink

... Data, Apache Spark and Apache ...platforms. Apache Spark have shown to be the frame- work with better scalability and overall faster ...and Spark ML are minimal, MLlib performs ...

11

Assessing Apache Spark Streaming with Scientific Data

Assessing Apache Spark Streaming with Scientific Data

... using Spark instead. Apache Spark Streaming lets the program store results in an intermediate data-form within the memory, and when new data arrives as another discrete set, it is batched to perform ...

50

Evolutionary Optimization Using Big Data from Engineering Simulations and Apache Spark

Evolutionary Optimization Using Big Data from Engineering Simulations and Apache Spark

... within Apache Spark. The reduced order model from Spark is then utilized by anevolutionary algorithm in the optimization of an industrial system ...utilizing Spark as the common ...

14

Apache Spark based Big Data Analytics for Social Network Cybercrime Forensics

Apache Spark based Big Data Analytics for Social Network Cybercrime Forensics

... The study relied on distributed contributing framework Apache Spark and made use of Spark streaming API to stream Twitter data and Spark ML API for tweet analysis and classification. the study ...

10

A Technological Survey On Apache Spark And Hadoop Technologies.

A Technological Survey On Apache Spark And Hadoop Technologies.

... as Spark which is developed by Matei Zaharia in AMPLab of UC Berkeley in ...2009. Spark was donated to apache software foundation which was open sourced in 2010 with BSD Licensed and this ...

10

Liver Disorderprognosis With Apache Spark Random Forest And Gradient Booster Algorithms  Thari Krishna, Dr C Rajabhushanam Abstract PDF  IJIRMET1604020013

Liver Disorderprognosis With Apache Spark Random Forest And Gradient Booster Algorithms Thari Krishna, Dr C Rajabhushanam Abstract PDF IJIRMET1604020013

... the spark random forests algorithm with spark gradient booster tree ...data, apache spark concepts in section ...learning, spark machine learning code work flow, random forests ...

7

Show all 1305 documents...

Related subjects