Open-source LLM evaluation framework similar to Pytest but specialized for unit testing LLM outputs, with comprehensive RAG evaluation metrics and CI/CD integration.
Get the best new AI agent tools delivered to your inbox every week.
Boost visibility with a featured listing โ highlighted across the directory.