A
Apache Spark
Projects with this topic
-
Execute Hadoop and Spark applications on the BigData@Polito cluster with a single command
Updated -
Stack Exchange releases "data dumps" of all its publicly available content roughly every three months via archive.org.
This project is an example and a framework for building ETL for this data with Apache Spark and Java.
Updated