Projects with this topic
-
Empirical validation of C4 geometric defense against 16 Agents of Chaos. 550 adversarial prompts. 4 defense systems. 96.7% block rate. LLM validation on GPT-4o-mini + Mistral 7B. MIT.
Updated -
Raw benchmark results and statistical analysis.
Updated -
Evaluation harness for measuring how well AI models perform on SysML v2 modeling tasks.
Updated -
Scaling and complexity benchmarks for Univec.
Updated -
Benchmark suite for measuring Univec performance.
Updated -
An Extensible Benchmark Framework for Real-Time Applications
Documentation: https://rt-bench.gitlab.io/rt-bench/
Updated -
SDLC query evaluation framework for orbit/knowledge-graph — measuring whether GKG graph query tools help AI agents answer developer questions
Archived 0Updated -
A benchmark library with statistical analysis and plotting capabilities in C++. https://cppstatbench.musicscience37.com/
Updated -
System utility designed to stress and monitor various hardware components
Updated -
Introduction to Cassandra, and benchmark of its main mechanisms. Developed for the final exam of the master's course "Architetture Dati".
Updated -
-
Sorting algorithms benchmarks
Updated -
Moved to: https://github.com/joular/cpupowerbench CPUPowerBench is an automated benchmark to accurately generate a power model for single-board computers.
Archived 2Updated -
A tool that helps you create benchmark and visualize it https://benchmaker.vercel.app/
Updated -
-
Measure performance of various languages in calculating cosine similarity of vectors (README will be updated...)
Updated -
Research study regarding the effect of docker image layers on performance.
Updated -
Mongoose is a storage performance testing tool
Updated -
An HTTP benchmark that compares the performance of Racket's Web servlets with Racket SCGI + nginx, Flask, Sinatra, Plug, etc. Results and discussion.
Archived 3Updated