Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
February 21, 2026 • NPR investigative reporter Tom Dreisbach talks about how and why he led an ambitious team effort to preserve a comprehensive record of the events of January 6th, 2021.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results