V
vllm

Projects with this topic

View stuff project

Ivan S. Titov / stuff

useful Gentoo overlay Curated ebuilds, AI, tools & science

cuda gentoo crystallography gentoo-overlay texlive deadbeef electron-mic... python2 SAXS rocm micromagnetism xafs hyperspy mantid 4d-stem gwyddion pf-sources xdna vllm amd-ryzen-ai

0

Updated Jul 29, 2026

0 0 0

Updated Jul 29, 2026
View Subliminal Learning Inspection project

Pig AI / Subliminal Learning Inspection

Experiments with Subliminal Learning

llm machinelearning Python vllm

0

Updated Jun 02, 2026

0 0 0 0

Updated Jun 02, 2026
View KVortex project

Ayi NEDJIMI / KVortex

VRAM to RAM Offloader for AI and vLLM - High-Performance C++23 KV Cache Engine with Multi-Stream GPU Transfers

https://ayinedjimi-consultants.fr

AI cpp23 cuda GPU-computing high-perform... kv-cache llm-inference machine-lear... vllm vram-offload cpp deep-learning gpu inference nvidia vRAM

0

Updated May 22, 2026

0 0 0 0

Updated May 22, 2026
View flashquant project

Ayi NEDJIMI / flashquant

Extreme KV Cache Compression for LLM Inference — C++17/CUDA implementation of TurboQuant (arXiv 2504.19874). 7.5x compression, <2% quality loss.

https://ayinedjimi-consultants.fr

compression cpp cuda flash-attention gpu inference kv-cache llm machine-lear... PyTorch quantization transformer turboquant vllm

0

Updated May 22, 2026

0 0 0 0

Updated May 22, 2026
View cli project

spendlayer / cli

Track AI spending across API providers, MCP tools, subscriptions, and self-hosted GPUs from your terminal. One CLI to see what you're actually paying — Anthropic, OpenAI, OpenRouter, vLLM on your own hardware — with real TCO math. Compare your self-hosted $/MTok vs cloud pricing. Local-first, no SaaS required.

AI cli cost-tracking Go golang dataset vllm self-hosted gpu openml anthropic open-source devtools terminal bitcoin infrastructure open-source ...

0

Updated Mar 03, 2026

0 0 0 0

Updated Mar 03, 2026

Projects with this topic

Ivan S. Titov / stuff

Pig AI / Subliminal Learning Inspection

Ayi NEDJIMI / KVortex

Ayi NEDJIMI / flashquant

spendlayer / cli