Apache Spark
Apache Spark is an open-source unified analytics engine for large-scale data processing. Runs distributed computations on clusters.
Key Features
- Distributed computing
- Python, R, Scala, Java support
- Batch and streaming
- Machine learning (MLlib)
- Open source