RunDiffusion
/

Wonderman-Flux-POC

@@ -88,13 +88,13 @@ license_link: https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICE
 Wonderman is in the public domain, so it can be freely shared, except where restricted by Flux's non-commercial license.
 Flux thinks that "Wonderman" is "Superman"
-![Flux thinks that "Wonderman" is "Superman"](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Huggingface-assets/superman-flux.jpg)
 ## Data Used for Training
-You can view the training data here in [this repo here](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/tree/main/Raw%20Low%20Quality%20Data).
 The training data was low resolution, cropped, oddly shaped, pixelated, and overall the worst possible data we've come across. That didn't stop us! AI to the rescue!
-![Low Quality Training Data](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Huggingface-assets/multiple-samples-training-data.png)
 To fix the data we had to:
 - Inpaint problem areas like backgrounds, signatures, and text
@@ -104,14 +104,14 @@ To fix the data we had to:
 We were able to get the dataset to 13 with these techniques.
 Full dataset [is here](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/tree/main/Cleaned%20and%20Captioned%20Data)
-![Cleaned Wonderman Dataset](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Huggingface-assets/multiple-samples-of-cleaned-data.png)
 ### Captioning the Data
 We are not entirely familiar with Flux's preferred captioning style. We understand that this model responds will to full descriptive sentences so we went with that. Below are some examples of the images with their captions. We chose LLaMA v3 inspired by this paper: https://arxiv.org/html/2406.08478v1
 The system prompt used was basic and could likely benefit from further refinement.
 A vintage comic book cover of Wonderman. On the cover, there are three main characters: Wonderman in a green costume with a large 'W' on his chest, a woman in a yellow and black outfit, and a smaller figure in a brown costume. Wonderman and the woman appear to be in a dynamic pose, suggesting action or combat. Wonderman is holding a thin, sharp object, possibly a weapon. The woman has a confident expression and is looking towards the viewer. The background is a mix of green and yellow, with some abstract designs.
-![Vintage Wonderman](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Cleaned%20and%20Captioned%20Data/00008.png)
 Wonderman, a male superhero character. He is wearing a green and red costume with a large 'W' emblem on the chest. Wonderman has a muscular physique, brown hair, and is wearing a black mask covering his eyes. He stands confidently with his hands by his sides. photo
 ![Standing Wonderman](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Cleaned%20and%20Captioned%20Data/00002.png)

 Wonderman is in the public domain, so it can be freely shared, except where restricted by Flux's non-commercial license.
 Flux thinks that "Wonderman" is "Superman"
+![Flux thinks that "Wonderman" is "Superman"](Huggingface-assets/superman-flux.jpg)
 ## Data Used for Training
+You can view the [RAW low quality data here: ](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/tree/main/Raw%20Low%20Quality%20Data).
 The training data was low resolution, cropped, oddly shaped, pixelated, and overall the worst possible data we've come across. That didn't stop us! AI to the rescue!
+![Low Quality Training Data](Huggingface-assets/multiple-samples-training-data.png)
 To fix the data we had to:
 - Inpaint problem areas like backgrounds, signatures, and text
 We were able to get the dataset to 13 with these techniques.
 Full dataset [is here](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/tree/main/Cleaned%20and%20Captioned%20Data)
+![Cleaned Wonderman Dataset](Huggingface-assets/multiple-samples-of-cleaned-data.png)
 ### Captioning the Data
 We are not entirely familiar with Flux's preferred captioning style. We understand that this model responds will to full descriptive sentences so we went with that. Below are some examples of the images with their captions. We chose LLaMA v3 inspired by this paper: https://arxiv.org/html/2406.08478v1
 The system prompt used was basic and could likely benefit from further refinement.
 A vintage comic book cover of Wonderman. On the cover, there are three main characters: Wonderman in a green costume with a large 'W' on his chest, a woman in a yellow and black outfit, and a smaller figure in a brown costume. Wonderman and the woman appear to be in a dynamic pose, suggesting action or combat. Wonderman is holding a thin, sharp object, possibly a weapon. The woman has a confident expression and is looking towards the viewer. The background is a mix of green and yellow, with some abstract designs.
+![Vintage Wonderman](Cleaned and Captioned Data/00008.png)
 Wonderman, a male superhero character. He is wearing a green and red costume with a large 'W' emblem on the chest. Wonderman has a muscular physique, brown hair, and is wearing a black mask covering his eyes. He stands confidently with his hands by his sides. photo
 ![Standing Wonderman](https://huggingface.co/RunDiffusion/Wonderman-Flux-POC/resolve/main/Cleaned%20and%20Captioned%20Data/00002.png)