flobbit commited on
Commit
fdccaec
1 Parent(s): 820d1aa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -28,14 +28,16 @@ pipeline_tag: image-classification
28
  The model is used to classify images into one of the 51 North American swallowtail or cattleheart butterfly species. `resnet50` was used for training.
29
 
30
  ## Intended uses & limitations
31
- The model was trained on 8577 insect images spread over 51 species. The model is likely biased toward some species being more likely found in certain habitats.
32
 
33
  ## Training and evaluation data
34
 
35
  The images used in training were obtained from GBIF:
36
  GBIF.org (22 June 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.bqg8bw
37
 
38
- Only the first 400 images of each species (if available) were downloaded.
39
- The image set was partially cleaned for quality to remove caterpillars, poor images or butterflies that were too far away for proper ID. After "cleaning", 200 additional images were downloaded for Battus philenor and Battus polydamas (as those species had a very high percentage of caterpillar shots).
40
- The dataset is primarily "in the wild" shots rather than all staged poses, and includes images for which even an expert would not be able to see identifying characteristics (hence the lower overall accuracy). The image set had a minimum of 30 pics in a class for the less uncommon species (which is not enough for accurate training but they were included for completeness). 33 species had over 200 images (after cleaning).
 
 
41
 
 
28
  The model is used to classify images into one of the 51 North American swallowtail or cattleheart butterfly species. `resnet50` was used for training.
29
 
30
  ## Intended uses & limitations
31
+ The model was trained on 8577 insect images spread over 51 species. The model is likely biased toward some species being more commonly found in certain habitats.
32
 
33
  ## Training and evaluation data
34
 
35
  The images used in training were obtained from GBIF:
36
  GBIF.org (22 June 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.bqg8bw
37
 
38
+ Only the first 400 images of each species (if available) were downloaded. The image set was partially cleaned for quality to remove caterpillars, poor images or butterflies that were too far away for proper ID. After "cleaning", 200 additional images were downloaded for Battus philenor and Battus polydamas (as those species had a very high percentage of caterpillar shots).
39
+
40
+ The dataset is primarily "in the wild" shots rather than all staged poses, and includes images for which even an expert would not be able to see identifying characteristics (hence the lower overall accuracy).
41
+
42
+ The image set had a minimum of 30 pics in a class for the less uncommon species (which is not enough for accurate training but they were included for completeness). 33 species had over 200 images (after cleaning).
43