Apache Spark
Apache Spark™ is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
Apache Hadoop is a open source software framework that supports data-intensive distributed applications licensed under the Apache v2 license.
Apache Hadoop is a open source software framework that supports dataintensive distributed applications licensed under the Apache v2 license. It enables applications to work with thousands of computational independent computers and petabytes of data.
web-development distributed-computing developer-tools big-data distributed-file-system
Apache Spark™ is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
Free Open Source Mac OS X Windows Linux
Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.
Free Open Source Mac OS X Windows Linux BSD
Disco is a lightweight, open-source framework for distributed computing based on the MapReduce paradigm and written in Python.
Free Open Source Mac OS X Windows Linux
Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.
Commercial Web
HPCC Systems offers an open source cluster computing platform used to solve Big Data problems. Its unique architecture and simple yet powerful data programming language...
Free Open Source Linux