pankajmathur
/

orca_mini_v2_7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pankaj Mathur commited on Jul 3, 2023

Commit

589c009

•

1 Parent(s): 15cdc0d

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -20,6 +20,10 @@ Please note this model has *better code generation capabilities* compare to our
 # Evaluation
 |||||||
 |:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
 |**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
@@ -30,6 +34,7 @@ Please note this model has *better code generation capabilities* compare to our
 |*truthfulqa_mc*|0|1|mc1|0.2938|0.0159|
 |*truthfulqa_mc*|0|1|mc2|0.4399|0.0153|
 # Dataset
 We used [remove_refusals.py](https://huggingface.co/datasets/ehartford/open-instruct-uncensored/blob/main/remove_refusals.py) script from https://huggingface.co/ehartford.

 # Evaluation
+I evaluated orca_mini_v2_7b on a wide range of tasks using [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) from EleutherAI.
+Here are the results, please note num_fewshots for each task.
 |||||||
 |:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
 |**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
 |*truthfulqa_mc*|0|1|mc1|0.2938|0.0159|
 |*truthfulqa_mc*|0|1|mc2|0.4399|0.0153|
 # Dataset
 We used [remove_refusals.py](https://huggingface.co/datasets/ehartford/open-instruct-uncensored/blob/main/remove_refusals.py) script from https://huggingface.co/ehartford.