• Velvet
  • Posts
  • Evaluate models, settings, and metrics

Evaluate models, settings, and metrics

Experiments and continuous monitoring

LLMs are inherently unpredictable, which can make feature-development challenging. With Velvet Evaluations, you can feel confident that your LLM-powered features will work the way you expect them to. Test your request logs against models, settings, and metrics.

  • Test for accuracy, latency, and cost.

  • Experiment with relevancy, accuracy, and quality.

  • Validate data, RAG, and agentic workflows.

Watch a video to learn how it works. And log in to your workspace to try it out!

Run Evaluations with Velvet | Read the docs