Explore projects
-
Joonas T. Holmi / wit_io
BSD 3-Clause "New" or "Revised" LicenseWITio: A MATLAB data evaluation toolbox to script broader insights into big data from WITec microscopes
Updated -
-
-
-
-
-
rychly-edu / theses / dist-forensic-digital-data-repo
Apache License 2.0Distributed storage for digital forensic data with data/metadata repository, API for queries and incoming/outgoing data, indexing, plug-in system for yet unsupported data-types, etc.
Updated -
Miguel Andreu / hadoop-premier-league
GNU General Public License v3.0 onlyThis project was an exercise for the Master in Big Data Engineering and Data Science at "Universidad Autónoma de Madrid". See the readme.md for more information.
Updated -
DP3 is an algorithm for distributed and shared-memory parallel Frequent Itemsets Mining.
Updated -
Stack Exchange releases "data dumps" of all its publicly available content roughly every three months via archive.org.
This project is an example and a framework for building ETL for this data with Apache Spark and Java.
Updated -
Amit Kamat / Map-Reduce-Ukraine
MIT LicenseThis project aggregates trending data from Ukraine based Twitter accounts. The raw aggregated data is cleansed before analysis using some Big-data methods. The purpose of this project is to familiarize myself with the workings of Hadoop for HDFS and Map-Reduce infrastructure.
Updated -
Giacomo Marciani / flink-app
MIT LicenseScaffolding for data stream processing applications, leveraging Apache Flink.
Updated -
Giacomo Marciani / mapreduce-app
MIT LicenseScaffolding for Map/Reduce applications, leveraging Apache Hadoop.
Updated