README.md · unvailai/3DKX_1.0b at b00ee7cae9305ee814eede5b54853798b5e0f292

metadata

model:
  base_learning_rate: 0.0001
  target: ldm.models.diffusion.ddpm.LatentDiffusion
  params:
    linear_start: 0.00085
    linear_end: 0.012
    num_timesteps_cond: 1
    log_every_t: 200
    timesteps: 1000
    first_stage_key: jpg
    cond_stage_key: txt
    image_size: 64
    channels: 4
    cond_stage_trainable: false
    conditioning_key: crossattn
    monitor: val/loss_simple_ema
    scale_factor: 0.18215
    use_ema: false
    scheduler_config:
      target: ldm.lr_scheduler.LambdaLinearScheduler
      params:
        warm_up_steps:
          - 10000
        cycle_lengths:
          - 10000000000000
        f_start:
          - 0.000001
        f_max:
          - 1
        f_min:
          - 1
    unet_config:
      target: ldm.modules.diffusionmodules.openaimodel.UNetModel
      params:
        image_size: 32
        in_channels: 4
        out_channels: 4
        model_channels: 320
        attention_resolutions:
          - 4
          - 2
          - 1
        num_res_blocks: 2
        channel_mult:
          - 1
          - 2
          - 4
          - 4
        num_heads: 8
        use_spatial_transformer: true
        transformer_depth: 1
        context_dim: 768
        use_checkpoint: true
        legacy: false
    first_stage_config:
      target: ldm.models.autoencoder.AutoencoderKL
      params:
        embed_dim: 4
        monitor: val/rec_loss
        ddconfig:
          double_z: true
          z_channels: 4
          resolution: 256
          in_channels: 3
          out_ch: 3
          ch: 128
          ch_mult:
            - 1
            - 2
            - 4
            - 4
          num_res_blocks: 2
          attn_resolutions: []
          dropout: 0
        lossconfig:
          target: torch.nn.Identity
    cond_stage_config:
      target: ldm.modules.encoders.modules.FrozenCLIPEmbedder

Model name: H&A 3DKX Model version: 1.0b

Description:

SFW model with limited nsfw capabilities (suggestive nsfw) that is highly versatile for 3D renders. The model has the particularity of splitting itself into two different well balanced styles. If you'd like to have your 3D characters have a more "Cartoony" face, you simply start your prompt with "3d cartoon of", and if you want the classic 3D render style, you write "a 3d render of".

Dataset:

between 140 and 180 pictures of 3D render of all kind

Has a high success rate at:

sfw portraits, full body poses, close ups, etc
high versatility in terms of outputs, it isn't locked to perform well on portraits
Landscapes, cyberpunk, steampunk, natural, scifi, etc
2B Nier Automata (Don't ask us why)
different body types - different ethnicity
nsfw portraits, full body poses, close ups, etc

What it "In theory" shouldn't exceed at:

anything outside the scope of portraits, people, landscapes, game artworks, 3D sculptures, 3D fantasy, 3D film stills, etc
celebrities
highly specific animated cartoon characters
multiple subjects
highly specific video-game characters
pornography, genitalia and highly explicit materials