aari1995 commited on
Commit
ec93e6d
1 Parent(s): 73211b0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -106,6 +106,17 @@ Evaluation to come.
106
  **A:** In terms of flexibility-definitely. In terms of data-yes as well, as it is more up-to-date. In terms of benchmark they differ, while V3 is better for longer texts, V2 works very well for shorter texts. Keeping in mind that many benchmarks also do not cover cultural knowledge too well.
107
  If you are fine with the model not knowing about developments after early 2020, I'd suggest you use [German_Semantic_V3b](https://huggingface.co/aari1995/German_Semantic_V3).
108
 
 
 
 
 
 
 
 
 
 
 
 
109
  **Q: How does the model perform vs. multilingual models?**
110
 
111
  **A:** There are really great multilingual models that will be very useful for many use-cases. This model shines with its cultural knowledge and knowledge about German people and behaviour.
 
106
  **A:** In terms of flexibility-definitely. In terms of data-yes as well, as it is more up-to-date. In terms of benchmark they differ, while V3 is better for longer texts, V2 works very well for shorter texts. Keeping in mind that many benchmarks also do not cover cultural knowledge too well.
107
  If you are fine with the model not knowing about developments after early 2020, I'd suggest you use [German_Semantic_V3b](https://huggingface.co/aari1995/German_Semantic_V3).
108
 
109
+ **Q: What is the difference between V3 and V3b?**
110
+
111
+ **A:** V3 is slightly worse on benchmarks, while V3b has a knowledge cutoff by 2020, so it really depends on your use-case which model to use.
112
+
113
+ If you want peak performance and do not worry too much about recent developments, take this [V3b](https://huggingface.co/aari1995/German_Semantic_V3b).
114
+
115
+ If you are fine with sacrificing a few points on benchmarks and want the model to know what happened from 2020 on (elections, covid, other cultural events etc.), I'd suggest you use this one.
116
+
117
+ Also, V3 uses cls_pooling while V3buses mean_pooling.
118
+
119
+
120
  **Q: How does the model perform vs. multilingual models?**
121
 
122
  **A:** There are really great multilingual models that will be very useful for many use-cases. This model shines with its cultural knowledge and knowledge about German people and behaviour.