Adding Evaluation Results
#6 opened 10 months ago
by
leaderboard-pr-bot
The model produces nonsense
9
#4 opened 12 months ago
by
Pkoosha
The model seems not have a general ability
6
#3 opened 12 months ago
by
yuansiwe
Evaluation of long sequence of conversation
5
#1 opened 12 months ago
by
cooee-ashutosh