Projects with this topic
-
Fundamental theory and practice in Data Science (DS).
🧮 data analysis AI ML DL machine lear... deep learning data science data-enginee... artificial i... data-science data preproc... Python C C++ NumPy pandas mathematics Algorithm algorithms Data Enginee... big data scipy scikit-learn xgboost lightgbm catboost TensorFlow keras PyTorch matplotlib seaborn plotly nltk opencv dask linear-algebra calculus probability statistics Discrete Mat... RUpdated -
An automated data pipeline for migrating and synchronizing patient records from HOSxP (MySQL) to the Buddy Care platform, featuring SQL optimization and data integrity validation for healthcare services.
Updated -
Solución end-to-end para la migración y análisis de datos utilizando Python, FastAPI, Kafka y PostgreSQL. Implementa un pipeline de datos asíncrono y una API RESTful para analíticas, todo completamente containerizado con Docker Compose para un despliegue fácil y reproducible.
Updated -
End-to-end design of a Hadoop-based ecosystem for healthcare data at scale (50 TB, IoT streams, medical imaging). Proposed a 10-node cluster architecture integrating HDFS, Spark, Hive, NiFi, Kafka, and Docker with HIPAA-compliant security (Kerberos, TLS, Apache Ranger). Delivered a proof-of-concept Docker deployment and professional proposal document.
Updated -
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
See the documentation at: https://airflow-dbt-python.readthedocs.io/
Updated