Free Download Apache Spark for Web ::: Web Server Scripts

Apache Spark

Software Screenshot:

Software Details:

Version: 1.3.1 ^updated

Upload Date: 12 May 15

Developer: UC Berkeley AMP Lab

Distribution Type: Freeware

Downloads: 45

Download

Currently 5.00/5
1
2
3
4
5

Rating: 5.0/5 (Total Votes: 1)

Spark was designed to improve processing speeds for data analysis and manipulation programs.

It was written in Java and Scala and provides features not found in other systems, mostly because they're not mainstream nor that useful for non-data processing applications.

What is new in this release:

The core API now supports multi-level aggregation trees to help speed up expensive reduce operations.
Improved error reporting has been added for certain gotcha operations.
Spark's Jetty dependency is now shaded to help avoid conflicts with user programs.
Spark now supports SSL encryption for some communication endpoints.
Realtime GC metrics and record counts have been added to the UI.

What is new in version 1.3.0:

The core API now supports multi-level aggregation trees to help speed up expensive reduce operations.
Improved error reporting has been added for certain gotcha operations.
Spark's Jetty dependency is now shaded to help avoid conflicts with user programs.
Spark now supports SSL encryption for some communication endpoints.
Realtime GC metrics and record counts have been added to the UI.

What is new in version 1.2.1:

PySpark's sort operator now supports external spilling for large datasets.
PySpark now supports broadcast variables larger than 2GB and performs external spilling during sorts.
Spark adds a job-level progress page in the Spark UI, a stable API for progress reporting, and dynamic updating of output metrics as jobs complete.
Spark now has support for reading binary files for images and other binary formats.

What is new in version 1.0.0:

This release expands Spark's standard libraries, introducing a new SQL package (Spark SQL) that lets users integrate SQL queries into existing Spark workflows.
MLlib, Spark's machine learning library, is expanded with sparse vector support and several new algorithms.

What is new in version 0.9.1:

Fixed hash collision bug in external spilling
Fixed conflict with Spark's log4j for users relying on other logging backends
Fixed Graphx missing from Spark assembly jar in maven builds
Fixed silent failures due to map output status exceeding Akka frame size
Removed Spark's unnecessary direct dependency on ASM
Removed metrics-ganglia from default build due to LGPL license conflict
Fixed bug in distribution tarball not containing spark assembly jar

What is new in version 0.8.0:

Development has moved to the Apache Sowftware Foundation as an incubator project.

What is new in version 0.7.3:

Python performance: Spark's mechanism for spawning Python VMs has been improved to do so faster when the JVM has a large heap size, speeding up the Python API.
Mesos fixes: JARs added to your job will now be on the classpath when deserializing task results in Mesos.
Error reporting: Better error reporting for non-serializable exceptions and overly large task results.
Examples: Added an example of stateful stream processing with updateStateByKey.
Build: Spark Streaming no longer depends on the Twitter4J repo, which should allow it to build in China.
Bug fixes in foldByKey, streaming count, statistics methods, documentation, and web UI.

What is new in version 0.7.2:

Scala version updated to 2.9.3.
Several improvements to Bagel, including performance fixes and a configurable storage level.
New API methods: subtractByKey, foldByKey, mapWith, filterWith, foreachPartition, and others.
A new metrics reporting interface, SparkListener, to collect information about each computation stage: task lengths, bytes shuffled, etc.
Several new examples using the Java API, including K-means and computing pi.

What is new in version 0.7.0:

Spark 0.7 adds a Python API called PySpark.
Spark jobs now launch a web dashboard for monitoring the memory usage of each distributed dataset (RDD) in the program.
Spark can now be built using Maven in addition to SBT.

What is new in version 0.6.1:

Fixed overly aggressive message timeouts that could cause workers to disconnect from the cluster.
Fixed a bug in the standalone deploy mode that did not expose hostnames to scheduler, affecting HDFS locality.
Improved connection reuse in shuffle, which can greatly speed up small shuffles.
Fixed some potential deadlocks in the block manager.
Fixed a bug getting IDs of failed hosts from Mesos.
Several EC2 script improvements, like better handling of spot instances.
Made the local IP address that Spark binds to customizable.
Support for Hadoop 2 distributions.
Support for locating Scala on Debian distributions.

What is new in version 0.6.0:

Simpler deployment.
Spark's documentation has been expanded with a new quick start guide, additional deployment instructions, configuration guide, tuning guide, and improved Scaladoc API documentation.
A new communication manager using asynchronous Java NIO lets shuffle operations run faster, especially when sending large amounts of data or when jobs have many tasks.
A new storage manager supports per-dataset storage level settings (e.g. whether to keep the dataset in memory, deserialized, on disk, etc, or even replicated across nodes).
Enhanced debugging.

12 May 15 in Development Tools Scripts, Web Server Scripts

Comments to Apache Spark

Search by Category

Apache Spark

Similar Software

phpGandiHostingMonitor

http_logger

Hostkit

PHP Nameserver Lookup

Comments to Apache Spark

Comments not found

Add Comment

Search by Category

Search by Category

Popular software

Logging Ruby 12 Apr 15

clogger 13 May 15

Hostkit 13 May 15

Unicorn 10 Dec 15

localtunnel 1 Mar 15

vtop 13 Apr 15

PHP LSAPI 10 Feb 16

Apache Spark

Similar Software

phpGandiHostingMonitor

http_logger

Hostkit

PHP Nameserver Lookup

Comments to Apache Spark

Comments not found

Add Comment

Search by Category

Popular software

Apache TomEE 10 Feb 16

jWebSocket 28 Feb 15

Capistrano 12 Apr 15

Sentry 10 Dec 15

LogBox 1 Mar 15

Socket.IO 12 Apr 15

Python LSAPI 13 May 15