The Best 9 Apache Spark Alternatives

  • Apache Hadoop

    Apache Hadoop is a open source software framework that supports data-intensive distributed applications licensed under the Apache v2 license.

    Free Open Source Mac OS X Windows Linux

  • Apache Flink

    Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.

    Free Open Source Mac OS X Windows Linux BSD

  • Apache Storm

    Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime...

    Free Open Source Mac OS X Windows Linux BSD

  • Heron

    Heron is a realtime, distributed, fault-tolerant stream processing engine from Twitter http://heronstreaming.io .

    Free Open Source Linux

  • Disco MapReduce

    Disco is a lightweight, open-source framework for distributed computing based on the MapReduce paradigm and written in Python.

    Free Open Source Mac OS X Windows Linux

  • Amazon Kinesis

    Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.

    Commercial Web

  • Gearpump

    Apache Gearpump is a real-time big data streaming engine. The name Gearpump is a reference to the engineering term “gear pump” which is a super simple pump that consists...

    Free Open Source Linux

  • Upsolver

    The only Data Preparation Platform that lets you prepare and deliver data at a massive scale in a matter of minutes.

    Commercial Web