Collections

Discover the best community collections!

Collections including paper arxiv:2310.08491
LLM as a Judge
Curated resources that support the use of LLMs to serve as automatic evaluators of other LLM outputs.
Tailor-made LLM evaluations: custom evaluations for your LLM
Collection of articles and resources focusing on automatic evaluation for LLM's and their role as unbiased judges in assessing other LLMs' outputs
Papers: LLM as a Judge
Collection by Apr 10
LLM-evaluation
Evaluation of LLM agents paper
🧑‍⚖️ LLM-as-a-judge
Collection by Apr 24
The Feedback Collection
Dataset and Model for "Prometheus: Inducing Fine-grained Evaluation Capability in Language Models"