Projects with this topic
-
Una arquitectura de red distribuida diseñada para la gestión, preservación y acceso al patrimonio audiovisual de El Sistema. EDICIÓN EN DESARROLLO
Updated -
This package provides a modular framework for the semi-automated processing of entomological specimen labels. It uses artificial intelligence to perform label detection, classification, rotation correction, OCR, and clustering laying the groundwork for comprehensive information extraction. It is designed to work in conjunction with the python-mfnb package for downstream clustering tasks.
Updated -
XCut - solving normalized cut on graphs using expander decomposition
Updated -
Library and tools for similarity measurement, classification and clustering of digital content and segmentation images from digitized document
Updated -
A regular expression generator for arbitrary sets of strings. Returns the patterns with exact or generalised character sets, depending on the choice of the user, and facilitates clustering over patterns to create superpatterns.
Updated -
Cluster analysis of conformere ensembles
Updated -
Search for patterns in river water samples data
Updated -
A program that utilizes cluster computing and parallel programming to simulate trading strategies on the Nordic stock market.
Updated -
My attempts to solve homework from the Moscow Institute of Physics and Technology course
Updated -
Collection of completed data-mining (university course) on python
Updated -
A simple spectral clustering example made with Python.
Updated -
The end.. and maybe the beginning of the LinuxPMI kernel clustering extensions. Based on openmosix.
Updated -
Clustering and rescheduling algorithms for the Social Internet of Energy. Related scientific manuscripts: https://ieeexplore.ieee.org/abstract/document/8784194 https://www.mdpi.com/1424-8220/20/13/3704
Updated -
MSMCluster is a plugin for MegaMol that clusters and visualizes the clustering of molecular surface map images.
Updated -
-
Implementation of fuzzy k-means (with extragrades) clustering in Rust
Updated -
"Cloud container data analytics, statistical modeling, and machine learning on distributed databases". "A free opensource alternative to SPSS, SAS, MATLAB, PowerBI, Tableau and Alteryx". Runs on Linux, Windows, MacOS, and in the cloud via containers.
LaTeX statistics sas spss matlab Python R spark cloud gcp Oracle azure Amazon Web S... Kubernetes containers Docker ML machine lear... regression clustering TiDB Yugabyte MySQL MariaDB SQL sparkr pyspark RStudio - KNIME Anal... Apache Spark... PyTorch MXNet Chainer keras gluon Scikit-learn... ONNX MLOps - Anac... NumPy Ipython) StatsModels pytest dask Koalas API -... Tornado - Py... Altair Bokeh Jupyter Voila Plotly/Dash matplotlib Seaborn - C#... SASPy - R: T... ggplot2 shiny dash Sparklyr BlueSky Stat... Jamovi - Int... vs code Vim - Tableau TabPy Tableau Buil... Python) - PL... SQL Developer PostgreSQL MySQL/MariaDB pgAdmin4 dbeaver MySQL Workbench Spark SQL Delta Lake Angular 2+ React .NET Core JavaScript (JS) Typescript (TS) Blazor Razor html5 CSS3 AWS EC2 Servers docker-compose podman Red Hat Ente... Oracle Linux fedora centos Ubuntu (WSL 2) debian Kestrel nginx Apache web s... jira Git Gitlab CI/CD... Code Climate... Ansible helm Terraform Cloudera Dat... nifi blender godot MS OfficeUpdated -
DPCfam Workstation version. Runs on Linux-based systems. Developed and tested on Ubuntu 18. DPCfamW uses the moodycamel::ConcurrentQueue library ( https://github.com/cameron314/concurrentqueue ) freely available provided citation (Simplified BSD license). This version replicates the pipeline used in to anlayze UniRef50 (v. 2017_07) as in Unsupervised protein family classification by Density Peak clustering, Russo ET, 2020, PhD Thesis ( http://hdl.handle.net/20.500.11767/116345 ), but with smaller datasets. Largest dataset we analysed is the TESTproteins_cd50.fasta datased we provide in this package. Due to memory bounds we do not guarantee that the abalysis of largest datasets is acheivable with this version.
Updated -
Implementation of Altieri, F., Pietracaprina, A., Pucci, G., & Vandin, F. (2021). Scalable distributed approximation of internal measures for clustering evaluation. In Proceedings of the 2021 SIAM International Conference on Data Mining (SDM) (pp. 648-656). Society for Industrial and Applied Mathematics.
Updated -
Capstone 1: Caterpillar Tube Pricing Prediction & Categorization. Capstone 2:Pipeline Multi-Leak Classification.
Updated