Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Abstract: A novel integration scheme is proposed for the accurate numerical evaluation of test (reaction) integrals needed for solving complex direct or inverse electromagnetic problems using surface ...
Inspiration: This extension was inspired by Daniel Micah's spock-test-runner but focuses exclusively on VS Code's Test API integration rather than CodeLens functionality.
I'm creating a first version of this extension with the help of my friend, Copilot. I still need to validate it a bit more and actually use it in my own development workflow, to see how practical it ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results