Projects with this topic
-
The Open Energy Tracker is an open data platform for monitoring and visualizing energy policy targets.
Updated -
Hybrid cloud-edge ML system for predictive rain control with automated retraining, monitoring, and Raspberry Pi hardware actuation.
Updated -
CVerify is a production-ready AI recruitment platform that analyses CVs through a 3-phase ML pipeline, recommends matching jobs from 20 000+ real job postings, and provides a full recruiter/candidate/admin web interface with an integrated AI chatbot.
Updated -
A data mining project analyzing hate crime patterns in the United States from 2017 to 2025, using clustering, predictive modeling, and association rule mining.
Updated -
Suite for manage large volume forensic audiovisual evidence. Dlist implemented for Files scan.
Updated -
Benchmarking open source hydrological model Shyft: https://gitlab.com/shyft-os/shyft. This repo contains helper scripts for simulation setup, results analysis and preparation of figures for publication: O.Silantyeva and S.Huang “Benchmarking Shyft hydrologic model performance of streamflow simulations in mainland Norway"
Updated -
-
Updated
-
A comprehensive system for downloading, extracting, and systematically analyzing firmware from LVFS (Linux Vendor Firmware Service). Integrates multiple specialized tools for binary analysis, archive extraction, CPU architecture recognition, and Machine Learning-based classification.
Updated -
-
Dimensionality Reduction and Classification Methodology
Load the MNIST dataset of handwritten digits Perform the dimensionality reduction methods that were introduced in the lecture: (a) PCA (b) IsoMap (c) Local Linear Embedding (LLE) (d) t-distributed stochastic neighbor embedding (t-SNE) (e) Uniform maniforld approximation and projection (UMAP) Reduce the dimensionality to d = 2 dimensions and plot the results Use the low dimensional representations to perform unsupervised clustering with k-means. Use k =10 classes for the clustering algorithm Classify all images according to the clustering. Choose the label based on how many the majority of datapoints within a clusterUpdated -
A production-oriented Machine Learning pipeline that predicts whether an active user session will result in a purchase.
Model: XGBoost Classifier optimized for class imbalance.
Performance: ROC AUC 0.936 | F1-score 0.71 (at 0.30 threshold).
Key Features: Reproducible environment (uv), modular CLI for training/inference, leakage-free preprocessing, and SHAP interpretability analysis.
Data: UCI Online Shoppers Purchasing Intention Dataset.
Tech Stack: Python, XGBoost, Scikit-learn, Pandas, SHAP.
UpdatedUpdated -
Objectives of the Study The primary objectives of this study are: • To explore the characteristics of students who drop out • To identify academic, social, and economic factors associated with dropout • Comparing dropout and non-dropout students based on key variables • To provide insights that can help institutions reduce dropout rates
Updated -
MultiNativQA is Multilingual Native question-answering (QA) dataset consisting of 64k QA pairs in seven extremely low to high resource languages, covering 18 different topics from nine different regions. Paper: https://arxiv.org/pdf/2407.09823. Project: https://nativqa.gitlab.io
Updated -
official implementation for the manuscript 'AI-based association analysis for medical imaging using latent-space geometric confounder correction'
Updated -
The automatic classifiers for peace group is an open-source community of researchers, organisations and individuals that are interested in aiding social media analysis through the automatic classification of social media discourse. This repository contains the experiments, documentation, tutorials and source code that the group has developed.
Updated -
-
A comprehensive machine learning pipeline for classifying astronomy images into 6 categories of celestial objects, featuring advanced data preprocessing, exploratory data analysis, and deep learning classification models.
https://huggingface.co/spaces/Saqib772/Astronomy_image_classfication
Kaggle Notebook: https://www.kaggle.com/code/saqibiqbal2/astronomy-image-classification
Updated -
Repository displaying the results of measurements of various Institute of Solar-Terrestrial Physics (ISTP) SB RAS instruments for the May 2024 geomagnetic storm
Updated -