Offline evaluation

by kaiwang13 - opened

How to do offline evaluation of this benchmark locally?

kaiwang13 changed discussion status to closed

Did you find a way to do offline evaluation?

Did you find a way to do offline evaluation? is applied in this leaderboard. You can conduct offline evalution with it.


In the it says to use, but I didn't find it.
Can you please elaborate a bit on using lm-evaluation-harness to do offline validation?

Sign up or log in to comment