News
Newest
Ask
Show
Jobs
Built with Nuxt.js
Confidence estimation is a better metric than agreement for LLM judges
(arxiv.org)
3 points | by
rapiddev
1 hour ago
0 comments
0 comments