π Hi, Iβm Cristian Vasu
π₯ Spotlight Projects
-
π₯ HealthTrend Innovations β Big Data Architecture (Capstone)
Designed a Hadoop-based healthcare data ecosystem (50TB). Integrated HDFS, Spark, Hive, NiFi, Kafka, Docker, with HIPAA-compliant security. Delivered Docker PoC + proposal. -
β‘ Spark for Batch + Streaming: Market Analysis & Kafka Pipeline
Combined batch analytics (S&P 500 with PySpark) and real-time pipelines (Spark + Kafka). -
π€ Scalable Machine Learning with SparkML (Census Income Classification)
End-to-end ML pipeline on the Adult Census dataset with preprocessing, cross-validation, Logistic Regression + Random Forest. -
π Database Performance Benchmarking with YCSB (Cassandra vs PostgreSQL)
Benchmarked NoSQL vs SQL systems under different workloads using YCSB, analyzing throughput and latency trade-offs. -
π¦οΈ NOAA Weather Data Analysis with MapReduce
MapReduce analysis of 1920β1940 NOAA station data for operability and descriptive statistics.
π« Connect
-
πΌ LinkedIn -
π§ cristian@cristianvasu.com
Personal projects
View all- Loading