Edit model card

FLUX.1-schnell-GGUF

Original Model

black-forest-labs/FLUX.1-schnell

Run with sd-api-server

  • sd-api-server version: 0.1.1

  • Run as LlamaEdge service

    wasmedge --dir .:. sd-api-server.wasm \
      --model-name flux1-schnell \
      --diffusion-model flux1-schnell-Q4_0.gguf \
      --vae ae-f16.gguf \
      --clip-l clip_l-f16.gguf \
      --t5xxl t5xxl-Q4_0.gguf
    

Quantized GGUF Models

Name Quant method Bits Size Use case
ae-f16.gguf f16 16 168 MB
ae.safetensors f32 32 335 MB
clip_l-Q8_0.gguf Q8_0 8 131 MB
clip_l-f16.gguf f16 16 246 MB
clip_l.safetensors f16 16 246 MB
flux1-schnell-Q4_0.gguf Q4_0 4 6.69 GB
flux1-schnell-Q4_1.gguf Q4_1 4 7.43 GB
flux1-schnell-Q5_0.gguf Q5_0 5 8.18 GB
flux1-schnell-Q5_1.gguf Q5_1 5 8.92 GB
flux1-schnell-Q8_0.gguf Q8_0 8 12.6 GB
flux1-schnell-f16.gguf f16 16 23.8 GB
t5xxl-Q2_K.gguf Q2_K 2 1.61 GB
t5xxl-Q3_K.gguf Q3_K 3 2.10 GB
t5xxl-Q4_0.gguf Q4_0 4 2.75 GB
t5xxl-Q4_K.gguf Q4_K 4 2.75 GB
t5xxl-Q5_0.gguf Q5_0 5 3.36 GB
t5xxl-Q5_1.gguf Q5_1 5 3.67 GB
t5xxl-Q8_0.gguf Q8_0 8 5.20 GB
t5xxl-f16.gguf f16 16 9.79 GB
t5xxl_fp16.safetensors f16 16 9.79 GB

Quantized with stable-diffusion.cpp master-64d231f.

Downloads last month
1,418
GGUF
Model size
4.89B params
Architecture
undefined

2-bit

3-bit

4-bit

5-bit

8-bit

16-bit

Inference Examples
Inference API (serverless) is not available, repository is disabled.

Model tree for second-state/FLUX.1-schnell-GGUF

Quantized
this model