Velvet is an AI gateway that works with any model. Use our evaluation framework to test models, settings, and metrics on your request logs.
LLMs are inherently unpredictable, which can make feature-development challenging. With Velvet Evaluations, you can feel confident that your LLM-powered features will work the way you expect them to. Test your request logs against models, settings, and metrics.
Use cases:
Watch a video introduction below, or see our docs for in-depth tutorials on running evaluations in your application.
Continuously test AI features in production, set alerts to take action.
Test models, settings, and metrics against historical request logs.
Use our data copilot to query your AI request logs with SQL.