Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
A new tool from Microsoft aims to bridge the gap between application development and prompt engineering. Overtaxed AI developers take note. One of the problems with building generative AI into your ...