Apache Hadoop
Apache Hadoop is a open source software framework that supports data-intensive distributed applications licensed under the Apache v2 license.
Apache Spark™ is a fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
Apache Spark™ is a fast and general engine for largescale data processing.SpeedRun programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
Spark has an advanced DAG execution engine that supports cyclic data flow and inmemory computing.
Productivity Developer Tools Networking and Admin
cloud-computing machine-learning web-analytics big-data data-analysis business-analytics parallel-computing data-analytics syslog cluster-computing
Apache Hadoop is a open source software framework that supports data-intensive distributed applications licensed under the Apache v2 license.
Free Open Source Mac OS X Windows Linux
Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.
Free Open Source Mac OS X Windows Linux BSD
Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime...
Free Open Source Mac OS X Windows Linux BSD
Disco is a lightweight, open-source framework for distributed computing based on the MapReduce paradigm and written in Python.
Free Open Source Mac OS X Windows Linux
Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.
Commercial Web