RunDiffusion commited on
Commit
6cae953
1 Parent(s): 331eacc
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -88,13 +88,13 @@ license_link: https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICE
88
  Wonderman is in the public domain, so it can be freely shared, except where restricted by Flux's non-commercial license.
89
 
90
  Flux thinks that "Wonderman" is "Superman"
91
- ![Flux thinks that "Wonderman" is "Superman"](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Huggingface-assets/superman-flux.jpg)
92
 
93
 
94
  ## Data Used for Training
95
- You can view the training data here in [this repo here](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/tree/main/Raw%20Low%20Quality%20Data).
96
  The training data was low resolution, cropped, oddly shaped, pixelated, and overall the worst possible data we've come across. That didn't stop us! AI to the rescue!
97
- ![Low Quality Training Data](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Huggingface-assets/multiple-samples-training-data.png)
98
 
99
  To fix the data we had to:
100
  - Inpaint problem areas like backgrounds, signatures, and text
@@ -104,14 +104,14 @@ To fix the data we had to:
104
 
105
  We were able to get the dataset to 13 with these techniques.
106
  Full dataset [is here](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/tree/main/Cleaned%20and%20Captioned%20Data)
107
- ![Cleaned Wonderman Dataset](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Huggingface-assets/multiple-samples-of-cleaned-data.png)
108
 
109
  ### Captioning the Data
110
  We are not entirely familiar with Flux's preferred captioning style. We understand that this model responds will to full descriptive sentences so we went with that. Below are some examples of the images with their captions. We chose LLaMA v3 inspired by this paper: https://arxiv.org/html/2406.08478v1
111
  The system prompt used was basic and could likely benefit from further refinement.
112
 
113
  A vintage comic book cover of Wonderman. On the cover, there are three main characters: Wonderman in a green costume with a large 'W' on his chest, a woman in a yellow and black outfit, and a smaller figure in a brown costume. Wonderman and the woman appear to be in a dynamic pose, suggesting action or combat. Wonderman is holding a thin, sharp object, possibly a weapon. The woman has a confident expression and is looking towards the viewer. The background is a mix of green and yellow, with some abstract designs.
114
- ![Vintage Wonderman](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Cleaned%20and%20Captioned%20Data/00008.png)
115
 
116
  Wonderman, a male superhero character. He is wearing a green and red costume with a large 'W' emblem on the chest. Wonderman has a muscular physique, brown hair, and is wearing a black mask covering his eyes. He stands confidently with his hands by his sides. photo
117
  ![Standing Wonderman](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Cleaned%20and%20Captioned%20Data/00002.png)
 
88
  Wonderman is in the public domain, so it can be freely shared, except where restricted by Flux's non-commercial license.
89
 
90
  Flux thinks that "Wonderman" is "Superman"
91
+ ![Flux thinks that "Wonderman" is "Superman"](Huggingface-assets/superman-flux.jpg)
92
 
93
 
94
  ## Data Used for Training
95
+ You can view the [RAW low quality data here: ](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/tree/main/Raw%20Low%20Quality%20Data).
96
  The training data was low resolution, cropped, oddly shaped, pixelated, and overall the worst possible data we've come across. That didn't stop us! AI to the rescue!
97
+ ![Low Quality Training Data](Huggingface-assets/multiple-samples-training-data.png)
98
 
99
  To fix the data we had to:
100
  - Inpaint problem areas like backgrounds, signatures, and text
 
104
 
105
  We were able to get the dataset to 13 with these techniques.
106
  Full dataset [is here](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/tree/main/Cleaned%20and%20Captioned%20Data)
107
+ ![Cleaned Wonderman Dataset](Huggingface-assets/multiple-samples-of-cleaned-data.png)
108
 
109
  ### Captioning the Data
110
  We are not entirely familiar with Flux's preferred captioning style. We understand that this model responds will to full descriptive sentences so we went with that. Below are some examples of the images with their captions. We chose LLaMA v3 inspired by this paper: https://arxiv.org/html/2406.08478v1
111
  The system prompt used was basic and could likely benefit from further refinement.
112
 
113
  A vintage comic book cover of Wonderman. On the cover, there are three main characters: Wonderman in a green costume with a large 'W' on his chest, a woman in a yellow and black outfit, and a smaller figure in a brown costume. Wonderman and the woman appear to be in a dynamic pose, suggesting action or combat. Wonderman is holding a thin, sharp object, possibly a weapon. The woman has a confident expression and is looking towards the viewer. The background is a mix of green and yellow, with some abstract designs.
114
+ ![Vintage Wonderman](Cleaned and Captioned Data/00008.png)
115
 
116
  Wonderman, a male superhero character. He is wearing a green and red costume with a large 'W' emblem on the chest. Wonderman has a muscular physique, brown hair, and is wearing a black mask covering his eyes. He stands confidently with his hands by his sides. photo
117
  ![Standing Wonderman](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Cleaned%20and%20Captioned%20Data/00002.png)