diff --git a/README.md b/README.md index e992f2e..e6a167d 100644 --- a/README.md +++ b/README.md @@ -2,84 +2,58 @@ Run Stable Diffusion on your machine with a nice UI without any hassle! -This repository provides the [WebUI](https://github.com/hlky/stable-diffusion-webui) as a docker image for easy setup and deployment. - -Now with experimental support for 2 other forks: - -- [AUTOMATIC1111](./AUTOMATIC1111/) (Stable, very few bugs!) -- [lstein](./lstein/) - -NOTE: big update coming up! +This repository provides multiple UIs for you to play around with stable diffusion: ## Features -- Interactive UI with many features, and more on the way! -- Support for 6GB GPU cards. -- GFPGAN for face reconstruction, RealESRGAN for super-sampling. -- Experimental: - - Latent Diffusion Super Resolution - - GoBig - - GoLatent -- many more! +### AUTOMATIC1111 -## Setup +[AUTOMATIC1111's fork](https://github.com/AUTOMATIC1111/stable-diffusion-webui) is imho the most feature rich yet elegant UI: -Make sure you have an **up to date** version of docker installed. Download this repo and run: +- Text to image, with many samplers and even negative prompts! +- Image to image, with masking, cropping, in-painting, out-painting, variations. +- GFPGAN, RealESRGAN, LDSR, CodeFormer. +- Loopback, prompt weighting, prompt matrix, X/Y plot +- Live preview of the generated images. +- Highly optimized 4GB GPU support, or even CPU only! +- [Full feature list here](https://github.com/AUTOMATIC1111/stable-diffusion-webui-feature-showcase) -``` -docker compose build -``` +| Text to image | Image to image | Extras | +| ---------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------- | +| ![](https://user-images.githubusercontent.com/24505302/189541954-46afd772-d0c8-4005-874c-e2eca40c02f2.jpg) | ![](https://user-images.githubusercontent.com/24505302/189541956-5b528de7-1b5d-479f-a1db-d3f5a53afc59.jpg) | ![](https://user-images.githubusercontent.com/24505302/189541957-cf78b352-a071-486d-8889-f26952779a61.jpg) | -you can let it build in the background while you download the different models +### hlky -- [Stable Diffusion v1.4 (4GB)](https://www.googleapis.com/storage/v1/b/aai-blog-files/o/sd-v1-4.ckpt?alt=media), rename to `model.ckpt` -- (Optional) [GFPGANv1.3.pth (333MB)](https://github.com/TencentARC/GFPGAN/releases/download/v1.3.0/GFPGANv1.3.pth). -- (Optional) [RealESRGAN_x4plus.pth (64MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth) and [RealESRGAN_x4plus_anime_6B.pth (18MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.2.4/RealESRGAN_x4plus_anime_6B.pth). -- (Optional) [LDSR (2GB)](https://heibox.uni-heidelberg.de/f/578df07c8fc04ffbadf3/?dl=1) and [its configuration](https://heibox.uni-heidelberg.de/f/31a76b13ea27482981b4/?dl=1), rename to `LDSR.ckpt` and `LDSR.yaml` respectively. - +[hlky's fork](https://github.com/hlky/stable-diffusion-webui) is one of the most popular UIs, with many features: -Put all of the downloaded files in the `models` folder, it should look something like this: +- Text to image, with many samplers +- Image to image, with masking, cropping, in-painting, variations. +- GFPGAN, RealESRGAN, LDSR, GoBig, GoLatent +- Loopback, prompt weighting +- 6GB or even 4GB GPU support! +- [Full feature list here](https://github.com/sd-webui/stable-diffusion-webui/blob/master/README.md) -``` -models/ -├── model.ckpt -├── GFPGANv1.3.pth -├── RealESRGAN_x4plus.pth -├── RealESRGAN_x4plus_anime_6B.pth -├── LDSR.ckpt -└── LDSR.yaml -``` +Screenshots: -## Run +| Text to image | Image to image | Image Lab | +| ---------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------- | +| ![](https://user-images.githubusercontent.com/24505302/189541298-f902b021-a1eb-4e4b-b2eb-b6a696a8ec80.jpg) | ![](https://user-images.githubusercontent.com/24505302/189541295-7d7f2162-2189-4e0a-abbd-703f4779e1cd.jpg) | ![](https://user-images.githubusercontent.com/24505302/189541294-aa7f7735-a973-4e17-ada0-1fe3acbb1772.jpg) | -After the build is done, you can run the app with: +### lstein -``` -docker compose up --build -``` +[lstein's fork](https://github.com/lstein/stable-diffusion) is very mature when it comes to the cli, but less so for the WebUI. -Will start the app on http://localhost:7860/ +## Setup & Usage -Note: the first start will take sometime as some other models will be downloaded, these will be cached in the `cache` folder, so next runs are faster. +Visit the wiki for [Setup](https://github.com/AbdBarho/stable-diffusion-webui-docker/wiki/Setup) and [Usage](https://github.com/AbdBarho/stable-diffusion-webui-docker/wiki/Usage) instructions, checkout the [FAQ](https://github.com/AbdBarho/stable-diffusion-webui-docker/wiki/FAQ) page if you face any problems, or create a new issue! -### FAQ - -You can find fixes to common issues [in the wiki page.](https://github.com/AbdBarho/stable-diffusion-webui-docker/wiki/FAQ) - -## Config - -in the `docker-compose.yml` you can change the `CLI_ARGS` variable, which contains the arguments that will be passed to the WebUI. By default: `--extra-models-cpu --optimized-turbo` are given, which allow you to use this model on a 6GB GPU. However, some features might not be available in the mode. [You can find the full list of arguments here.](https://github.com/hlky/stable-diffusion-webui/blob/2b1ac8daf7ea82c6c56eabab7e80ec1c33106a98/scripts/webui.py) - -You can set the `WEBUI_SHA` to [any SHA from the main repo](https://github.com/hlky/stable-diffusion/commits/main), this will build the container against that commit. Use at your own risk. - -# Disclaimer +## Disclaimer The authors of this project are not responsible for any content generated using this interface. This license of this software forbids you from sharing any content that violates any laws, produce any harm to a person, disseminate any personal information that would be meant for harm, spread misinformation and target vulnerable groups. For the full list of restrictions please read [the license](./LICENSE). -# Thanks +## Thanks Special thanks to everyone behind these awesome projects, without them, none of this would have been possible: