Projects with this topic
Sort by:
-
Cristian Vasu Data Portfolio / Scalable Machine Learning with SparkML - Census Income Classification
Built a complete machine learning pipeline in SparkML using the Adult Census dataset (~48k rows, 14 features). Implemented data preprocessing, feature encoding, cross-validation, and model training with Logistic Regression and Random Forest. Evaluated models with metrics such as AUC and F1-score. Reflected on scalability trade-offs and optimizations in distributed ML.
Updated