Alina Lozovskaya
alozowski
AI & ML interests
NLP in all aspects
Organizations
alozowski's activity
Voting System: You can vote for your own model.
1
#851 opened 3 days ago
by
nlpguy
Is there an issue with adding bos in the new evaluation?
1
#852 opened 3 days ago
by
lingyun1
Model deleted from Pending
6
#850 opened 3 days ago
by
dnhkng
dolphin-2.9.2-qwen2-72b failed, check logs
4
#824 opened 17 days ago
by
CombinHorizon
70B models FAILED
7
#830 opened 11 days ago
by
MaziyarPanahi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5fd5e18a90b6dc4633f6d292/gZXHW5dd9R86AV9LMZ--y.png)
Latest results from eval runs are not updated on Leaderboard/Content repo
1
#847 opened 4 days ago
by
pankajmathur
![](https://cdn-avatars.huggingface.co/v1/production/uploads/603d621d15002b0a1b02b74a/1V2yT80KV00V9hzakLK3M.png)
[BUG] Gemma2-9b-it evaluation
2
#849 opened 4 days ago
by
DeepMount00
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64f1bf6a8b550e875926a590/xdZHPQGdI2jISWcKhWTMQ.png)
model evaluation results not updated on the leaderboard
1
#846 opened 4 days ago
by
Azure99
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64473d9670338c0376083dc4/L2hYLjoW5Bcn2_LcULilw.jpeg)
Wrong results or am i understanding something wrong?
8
#839 opened 9 days ago
by
nicobuko
State of Open LLM Leaderboard v2 evals and Reproduciblity Issues.
8
#829 opened 12 days ago
by
pankajmathur
![](https://cdn-avatars.huggingface.co/v1/production/uploads/603d621d15002b0a1b02b74a/1V2yT80KV00V9hzakLK3M.png)
Submitted models aren't showing up
4
#835 opened 10 days ago
by
Stark2008
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644d602653ad80c659399ff7/a062gUTf_BXg_W6PPenvF.jpeg)
'Running' from the first day of the new leaderboard to pending and not showing anymore
1
#845 opened 6 days ago
by
DavidGF
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64b999a40b24527e9c25583a/xFHCewJdf5EGn8qDPypqy.jpeg)
The problem about the overall score of BBH and GPQA datasets
1
#842 opened 7 days ago
by
Amigozyq
submission-system-update
4
#844 opened 6 days ago
by
alozowski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f5010dfcf95ecac2ad8652/vmRox4fcHMjT1y2bidjOL.jpeg)
Model not on pending for evaluation
3
#841 opened 7 days ago
by
acbdkk
Gemma-2-9B-it scores
2
#843 opened 6 days ago
by
saishf
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1675590698085-noauth.jpeg)
WizardLM-8x22B Evaluation failed
25
#823 opened 17 days ago
by
llama-anon
![](https://cdn-avatars.huggingface.co/v1/production/uploads/643344526c2a26ae66d5d5b0/VotgxEYpg6YQ_eMwhMqac.jpeg)
submit-system-update
12
#838 opened 9 days ago
by
alozowski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f5010dfcf95ecac2ad8652/vmRox4fcHMjT1y2bidjOL.jpeg)
bump-up-gradio_leaderboard
6
#836 opened 9 days ago
by
alozowski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f5010dfcf95ecac2ad8652/vmRox4fcHMjT1y2bidjOL.jpeg)
v2 voting
23
#831 opened 11 days ago
by
lucyknada
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/QSins2Wn6frq1-EdnhJ2Y.jpeg)
Feature Request for Leaderboard: date added to hub
8
#425 opened 8 months ago
by
madmaxbr5
Model Eval Failed: Tess-v2.5.2-Qwen2-72B
3
#826 opened 13 days ago
by
migtissera
![](https://cdn-avatars.huggingface.co/v1/production/uploads/647a6317555b5e199cffd5a2/SE6vWIFxkQjAc_9BrG8Al.png)
The results of BBH are inconsistant with official result of Qwen2
1
#827 opened 13 days ago
by
peels7877
Raw results to normalized results
1
#825 opened 13 days ago
by
Ilyasch2
I am getting this Base model "mistralai/Mistral-7B-Instruct-v0.2" was not found or misconfigured on the hub!
4
#819 opened 19 days ago
by
rootxhacker
RecurrentGemma - add the rest of the models!
4
#800 opened 25 days ago
by
devingulliver
Average column values
5
#821 opened 18 days ago
by
Stark2008
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644d602653ad80c659399ff7/a062gUTf_BXg_W6PPenvF.jpeg)
It seems that PHI-3 is the best...
2
#816 opened 19 days ago
by
ZeroWw
Archive of the last leaderboard
5
#807 opened 23 days ago
by
MarxistLeninist
Some models are tagged with incorrect model types
3
#806 opened 23 days ago
by
scinerd68
Model glm-4-9b-chat 128K and 1M are missing
2
#817 opened 19 days ago
by
ZeroWw
Failed evaluation for Miqu-70B
3
#812 opened 23 days ago
by
llama-anon
![](https://cdn-avatars.huggingface.co/v1/production/uploads/643344526c2a26ae66d5d5b0/VotgxEYpg6YQ_eMwhMqac.jpeg)
cannot load results
2
#811 opened 23 days ago
by
ysharma1126
Leaderboard data
3
#813 opened 21 days ago
by
Stark2008
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644d602653ad80c659399ff7/a062gUTf_BXg_W6PPenvF.jpeg)
fix-merged-column
1
#810 opened 23 days ago
by
alozowski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f5010dfcf95ecac2ad8652/vmRox4fcHMjT1y2bidjOL.jpeg)
submission-fix
19
#803 opened 24 days ago
by
alozowski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f5010dfcf95ecac2ad8652/vmRox4fcHMjT1y2bidjOL.jpeg)
No good way to identify number of activated parameters causes MIxtral evaluation failures
32
#680 opened 3 months ago
by
0-hero
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6382255fcae34727b9cc149e/PYiwi8LVZParYvImmcGez.png)
70B models failed
3
#756 opened about 2 months ago
by
MaziyarPanahi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5fd5e18a90b6dc4633f6d292/gZXHW5dd9R86AV9LMZ--y.png)
Model Submission Finished but Not Listed in Results
7
#747 opened 2 months ago
by
Stefan171
Eval Failed
2
#146 opened about 2 months ago
by
ajibawa-2023
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64aea8ff67511bd3d965697b/Jxn52EmDF5RApJh8antxn.jpeg)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f5010dfcf95ecac2ad8652/vmRox4fcHMjT1y2bidjOL.jpeg)
New activity in
open-llm-leaderboard-old/details_Ramikan-BR__tinyllama_PY-CODER-4bit-lora_4k-v12
about 2 months ago
Create README.md
#1 opened about 2 months ago
by
Ramikan-BR
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644cb09a22d211df644a0a6c/v0EHypMU4X3Oxxf3cao_O.png)
Failed eval
6
#125 opened 3 months ago
by
KnutJaegersberg
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1669551186189-63732ebbbd81fae2b3aaf3fb.jpeg)
Leaderboard stuck?
1
#754 opened about 2 months ago
by
DreamGenX
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6548b80bb3a7efb9391e19e8/DYCJL22AOn8kDLQhi9TaW.png)
Update LB to latest transformers
1
#751 opened 2 months ago
by
MaziyarPanahi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5fd5e18a90b6dc4633f6d292/gZXHW5dd9R86AV9LMZ--y.png)
bump-transformers-to-4.41.1
2
#753 opened about 2 months ago
by
alozowski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f5010dfcf95ecac2ad8652/vmRox4fcHMjT1y2bidjOL.jpeg)
DBRX-Instruct evaluation failed, likely due to model size (132B params)
3
#121 opened 3 months ago
by
abhi-db
apply-ruff
7
#748 opened 2 months ago
by
alozowski
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63f5010dfcf95ecac2ad8652/vmRox4fcHMjT1y2bidjOL.jpeg)
Feature Request: Multilingual Evaluations 🌐
1
#745 opened 2 months ago
by
eliot-christon
Models that used Nectar dataset
13
#749 opened 2 months ago
by
Stark2008
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644d602653ad80c659399ff7/a062gUTf_BXg_W6PPenvF.jpeg)