logo News Newest Ask Show Jobs Built with Nuxt.js

Confidence estimation is a better metric than agreement for LLM judges

(arxiv.org)

3 points | by rapiddev 1 hour ago

0 comments