Velvet
Posts
Evaluate models, settings, and metrics

Evaluate models, settings, and metrics

Experiments and continuous monitoring

November 08, 2024

LLMs are inherently unpredictable, which can make feature-development challenging. With Velvet Evaluations, you can feel confident that your LLM-powered features will work the way you expect them to. Test your request logs against models, settings, and metrics.

Test for accuracy, latency, and cost.
Experiment with relevancy, accuracy, and quality.
Validate data, RAG, and agentic workflows.

Watch a video to learn how it works. And log in to your workspace to try it out!

Run Evaluations with Velvet | Read the docs →