Cloudera Announces Enterprise-Grade Support For Apache Spark
February 10, 2014Grazed from CloudComputingToday. Author: Arnal Dayaratna, PhD.
![]()
Cloudera recently announced the general availability of Apache Spark for Cloudera Enterprise. First developed at UC Berkeley, Apache Spark is a parallel data processing framework that supplements Apache Hadoop by facilitating the development of big data applications related to machine learning, interactive analytics and real-time analytics.
Spark allows users to write parallel sets of code in Java, Scala and Python that operate on Hadoop clusters with a speed up to 100 times faster than MapReduce. Moreover, applications developed in Spark tend to require 2 to 10 ten times less code than a corresponding MapReduce application…
Spark Streaming, an add-on to Spark, enables analytics to be run on streaming datasets such that developers can derive analytic insights within seconds of data ingestion…
Read more from the source @ http://cloud-computing-today.com/2014/02/10/966813/
Subscribe to the CloudCow bi-monthly newsletter @ http://eepurl.com/smZeb


