Update README.md
Browse files
README.md
CHANGED
@@ -88,7 +88,7 @@ with torch.inference_mode():
|
|
88 |
|
89 |
### Results
|
90 |
|
91 |
-
By following the **LLM-as-Juries** evaluation method, the following results were obtained using three judge models (GPT-4o, Gemini1.5 Pro
|
92 |
|
93 |
![constellation](https://i.postimg.cc/kMRmcBpQ/constellation-0.png)
|
94 |
|
|
|
88 |
|
89 |
### Results
|
90 |
|
91 |
+
By following the **LLM-as-Juries** evaluation method, the following results were obtained using three judge models (GPT-4o, Gemini1.5 Pro and Claude 3.5-Sonnet). These models were evaluated based on a well-defined scoring rubric specifically designed for the VQA context, with clear criteria for each score (0 to 5) to ensure the highest possible precision in meeting expectations.
|
92 |
|
93 |
![constellation](https://i.postimg.cc/kMRmcBpQ/constellation-0.png)
|
94 |
|