FinBen / leaderboard.csv
Jimin Huang
feature: add auto evaluation tab
55328ae
raw
history blame contribute delete
No virus
415 Bytes
ChatGPT,0.78,0.78,,0.77,0.77,0.58,0.60,0.53,-0.025,0.50,0.005,0.55,0.01
GPT-4,0.76,0.78,,0.86,0.83,0.63,0.76,0.54,0.03,0.52,0.02,0.57,0.01
BloombergGPT,,0.51,0.75,0.82,0.61,,0.43,,,,,,
FinMA-7B,0.86,0.86,0.84,0.98,0.75,0.06,0.25,0.48,0.04,0.50,0.00,0.56,-0.02
FinMA-30B,0.87,0.88,0.87,0.97,0.62,0.11,0.40,0.47,0.04,0.49,0.00,0.43,-0.05
FinMA-7B-full,0.88,0.88,0.83,0.97,0.67,0.06,0.32,0.51,0.06,0.52,0.03,0.52,0.04