Explore projects
-
Joonas T. Holmi / wit_io
BSD 3-Clause "New" or "Revised" LicenseWITio: A MATLAB data evaluation toolbox to script broader insights into big data from WITec microscopes
Updated -
New machine learning algorithms based on the minimum nescience principle
Updated -
-
Canada LINCS / Access / Stratocumulus
MIT LicenseA zoomable network browser for cultural linked data
Updated -
Nous avons entrepris un projet d'apprentissage automatique pour prédire des maladies en analysant les données symptomatiques et médicales. Notre modèle sophistiqué, basé sur des techniques d'apprentissage automatique avancées, évalue les symptômes pour fournir des prédictions précises. Avec une interface API développée avec Django et un déploiement sur Microsoft Azure via Terraform, notre solution est conviviale et évolutive.Découvrez notre projet ici :https://apipharma-app-service.azurewebsites.net/
Updated -
Workshop de Big Data a cargo de Jimmy Farfán docente del curso online "Desarrollo de Aplicaciones de Big Data en Hadoop". Si requieren más información o cualquier duda pueden ubicarnos en facebook como Data Hack Formation.
Updated -
Práctica del módulo Big Data Processing (Spark y Scala) del V Bootcamp BD & ML de Keepcoding
Updated -
Jean-Baptiste Feret / bigRaster
GNU General Public License v3.0 onlyThe package bigRaster allows handling large rasters when they can be processed by chunk. This includes computing spectral indices, applying regression models, stacking individual rasters into larger rasters...
Updated -
clirai / pyralysis
GNU General Public License v3.0 or laterPYthon Radio Astronomy anaLYSis and Image Synthesis
Updated -
Giacomo Marciani / mapreduce-app
MIT LicenseScaffolding for Map/Reduce applications, leveraging Apache Hadoop.
Updated -
Neuroscience Lab / BNDF
Apache License 2.0Structured Big data framework based on Apache Spark for storing and manipulating large scale multi channel neurophysiological recording data
Updated -
Workshop dictado por Jesús Méndez (https://pe.linkedin.com/in/jmendezgal) y Antonio Cachuán (https://linkedin.com/in/antoniocachuan/) los temas de Apache Druid, Certificarte en GCP y nuestro Data Engineering Program
Updated -
Giacomo Marciani / flink-app
MIT LicenseScaffolding for data stream processing applications, leveraging Apache Flink.
Updated -
Amit Kamat / Map-Reduce-Ukraine
MIT LicenseThis project aggregates trending data from Ukraine based Twitter accounts. The raw aggregated data is cleansed before analysis using some Big-data methods. The purpose of this project is to familiarize myself with the workings of Hadoop for HDFS and Map-Reduce infrastructure.
Updated -
Prince Bansal / pyspark-azure-hdinsight-sample
MIT LicenseDeploying PySpark Jobs on Azure HDInsight Spark Cluster (CI/CD)
Updated -
rychly-edu / theses / dist-forensic-digital-data-repo
Apache License 2.0Distributed storage for digital forensic data with data/metadata repository, API for queries and incoming/outgoing data, indexing, plug-in system for yet unsupported data-types, etc.
Updated -
Daniel Snider / crawler
GNU Affero General Public License v3.0A Python app for scanning large data sets of URLs for a given signature and storing the results to an ElasticSearch index. Useful applications for CERTs and security researchers, maybe others.
Updated -
DP3 is an algorithm for distributed and shared-memory parallel Frequent Itemsets Mining.
Updated -
NLTK for sentiment analysis given a Twitter streaming for a word. Configuration scripts for MongoDB and twitter streaming.
Updated -