Projects with this topic
Sort by:
-
Robot Framework test harness for LLM evaluation — deterministic grading, containerized execution, multi-model comparison, safety testing, test history, and CI/CD-native.
Updated -
A superset docker setup ready to configure just via environment variables and secret files. Comes with a compose file for local development and testing.
Updated -
-