diff --git a/README.md b/README.md index d83561b..750f99f 100644 --- a/README.md +++ b/README.md @@ -1,85 +1,95 @@ -# Lama-cleaner: Image inpainting tool powered by SOTA AI model +

Lama Cleaner

+

A free and open-source inpainting tool powered by SOTA AI model.

-[![Downloads](https://pepy.tech/badge/lama-cleaner)](https://pepy.tech/project/lama-cleaner) -[![Downloads](https://pepy.tech/badge/lama-cleaner/month)](https://pepy.tech/project/lama-cleaner) -![version](https://img.shields.io/pypi/v/lama-cleaner) -Open in Colab +

+ + total download + + + version + + + Open in Colab + + + Sanster + +

+![img](./assets/dark.png) -https://user-images.githubusercontent.com/3998421/153323093-b664bb68-2928-480b-b59b-7c1ee24a4507.mp4 +## Features -- [x] Support multiple model architectures +- Completely free and open-source +- Fully self-hosted +- Multiple SOTA AI models 1. [LaMa](https://github.com/saic-mdal/lama) 1. [LDM](https://github.com/CompVis/latent-diffusion) -- [x] Support CPU & GPU -- [x] High resolution support -- [x] Run as a desktop APP -- [x] Multi stroke support. Press and hold the `cmd/ctrl` key to enable multi stroke mode. -- [x] Zoom & Pan + 1. [ZITS](https://github.com/DQiaole/ZITS_inpainting) +- Support CPU & GPU +- Various high-resolution image processing [strategy](#high-resolution-strategy) +- Run as a desktop APP -## Install +## Usage + +| Usage | Before | After | +| ---------------------- | --------------------------------------------- | --------------------------------------------------- | +| Remove unwanted things | ![unwant_object2](./assets/unwant_object.jpg) | ![unwant_object2](./assets/unwant_object_clean.jpg) | +| Remove unwanted person | ![unwant_person](./assets/unwant_person.jpg) | ![unwant_person](./assets/unwant_person_clean.jpg) | +| Remove watermark | ![watermark](./assets/watermark.jpg) | ![watermark_clean](./assets/watermark_cleanup.jpg) | +| Fix old photo | ![oldphoto](./assets/old_photo.jpg) | ![oldphoto_clean](./assets/old_photo_clean.jpg) | + +## Quick Start ```bash pip install lama-cleaner -lama-cleaner --device=cpu --port=8080 +# Model will be downloaded automatically +lama-cleaner --model=lama --device=cpu --port=8080 +# Lama Cleaner is now running at http://localhost:8080 ``` -Available commands: +Available arguments: -| Name | Description | Default | -| ---------- | ------------------------------------------------ | -------- | -| --model | lama or ldm. See details in **Model Comparison** | lama | -| --device | cuda or cpu | cuda | -| --gui | Launch lama-cleaner as a desktop application | | -| --gui_size | Set the window size for the application | 1200 900 | -| --input | Path to image you want to load by default | None | -| --port | Port for flask web server | 8080 | -| --debug | Enable debug mode for flask web server | | +| Name | Description | Default | +| ---------- | ---------------------------------------------------------------- | -------- | +| --model | lama/ldm/zits. See details in [Inpaint Model](#inpainting-model) | lama | +| --device | cuda or cpu | cuda | +| --port | Port for backend flask web server | 8080 | +| --gui | Launch lama-cleaner as a desktop application | | +| --gui_size | Set the window size for the application | 1200 900 | +| --input | Path to image you want to load by default | None | +| --debug | Enable debug mode for flask web server | | -## Settings +## Inpainting Model -You can change the configs of inpainting process in the settings interface of the web page. +| Model | Description | Config | +| ----- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | +| LaMa | :+1: Generalizes well on high resolutions(~2k)
| | +| LDM | :+1: Possiblablity to get better and more detail result
:+1: The balance of time and quality can be achieved by adjusting `steps`
:neutral_face: Slower than GAN model
:neutral_face: Need more GPU memory | `Steps`: You can get better result with large steps, but it will be more time-consuming
`Sampler`: ddim or [plms](https://arxiv.org/abs/2202.09778). In general plms can get [better results](https://github.com/Sanster/lama-cleaner/releases/tag/0.13.0) with fewer steps | +| ZITS | :+1: Better holistic structures compared with previous methods
:neutral_face: Wireframe module is **very** slow on CPU | `Wireframe`: Enable edge and line detect | - - -### Inpainting Model - -Select the inpainting model to use, and set the configs corresponding to the model. - -LaMa model has no configs that can be specified at runtime. - -LDM model has two configs to control the quality of final result: -1. Steps: You can get better result with large steps, but it will be more time-consuming -2. Sampler: ddim or [plms](https://arxiv.org/abs/2202.09778). In general plms can get better results with fewer steps - - -### High Resolution Strategy - -There are three strategies for handling high-resolution images. - -- **Original**: Use the original resolution of the picture, suitable for picture size below 2K. -- **Resize**: Resize the longer side of the image to a specific size(keep ratio), then do inpainting on the resized image. -The inpainting result will be pasted back on the original image to make sure other part of image not loss quality. -- **Crop**: Crop masking area from the original image to do inpainting, and paste the result back. -Mainly for performance and memory reasons on high resolution image. This strategy may give better results for ldm model. - - -## Model Comparison - -| Model | Pron | Corn | -|-------|-----------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------| -| LaMa | - Perform will on high resolution image(~2k)
- Faster than diffusion model | | -| LDM | - It's possible to get better and more detail result, see example below
- The balance of time and quality can be achieved by steps | - Slower than GAN model
- Need more GPU memory
- Not good for high resolution images | +### LaMa vs LDM | Original Image | LaMa | LDM | | ----------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------ | ----------------------------------------------------------------------------------------------------------------------------------------------------- | | ![photo-1583445095369-9c651e7e5d34](https://user-images.githubusercontent.com/3998421/156923525-d6afdec3-7b98-403f-ad20-88ebc6eb8d6d.jpg) | ![photo-1583445095369-9c651e7e5d34_cleanup_lama](https://user-images.githubusercontent.com/3998421/156923620-a40cc066-fd4a-4d85-a29f-6458711d1247.png) | ![photo-1583445095369-9c651e7e5d34_cleanup_ldm](https://user-images.githubusercontent.com/3998421/156923652-0d06c8c8-33ad-4a42-a717-9c99f3268933.png) | -Blogs about diffusion models: +### LaMa vs ZITS -- https://lilianweng.github.io/posts/2021-07-11-diffusion-models/ -- https://yang-song.github.io/blog/2021/score/ +| Original Image | ZITS | LaMa | +| ---------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------- | +| ![zits_original](https://user-images.githubusercontent.com/3998421/180464918-eb13ebfb-8718-461c-9e8b-7f6d8bb7a84f.png) | ![zits_compare_zits](https://user-images.githubusercontent.com/3998421/180464914-4db722c9-047f-48fe-9bb4-916ba09eb5c6.png) | ![zits_compare_lama](https://user-images.githubusercontent.com/3998421/180464903-ffb5f770-4372-4488-ba76-4b4a8c3323f5.png) | + +Image is from [ZITS](https://github.com/DQiaole/ZITS_inpainting) paper. I didn't find a good example to show the advantages of ZITS and let me know if you have a good example. There can also be possible problems with my code, if you find them, please let me know too! + +## High Resolution Strategy + +There are three strategies for handling high-resolution images. + +- **Original**: Use the original resolution of the picture, suitable for picture size below 2K. +- **Resize**: Resize the image to a smaller size, then do inpainting on the resized image. The inpainting area will be croped, upsampled and pasted back on the original image to make sure other part of image not loss quality. +- **Crop**: Crop masking area from the original image to do inpainting, and paste the result back. Mainly for performance and memory reasons on high resolution image. ## Development @@ -118,9 +128,3 @@ docker run --gpus all -p 8080:8080 -e CACHE_DIR=/app/models -v $(pwd)/models:/ap ``` Then open [http://localhost:8080](http://localhost:8080) - -## Like My Work? - - - Sanster - diff --git a/assets/dark.png b/assets/dark.png new file mode 100644 index 0000000..e737f78 Binary files /dev/null and b/assets/dark.png differ diff --git a/assets/old_photo.jpg b/assets/old_photo.jpg new file mode 100644 index 0000000..9539076 Binary files /dev/null and b/assets/old_photo.jpg differ diff --git a/assets/old_photo_clean.jpg b/assets/old_photo_clean.jpg new file mode 100644 index 0000000..f75fb7c Binary files /dev/null and b/assets/old_photo_clean.jpg differ diff --git a/assets/settings.png b/assets/settings.png deleted file mode 100644 index 6ed3258..0000000 Binary files a/assets/settings.png and /dev/null differ diff --git a/assets/unwant_object.jpg b/assets/unwant_object.jpg new file mode 100644 index 0000000..092a033 Binary files /dev/null and b/assets/unwant_object.jpg differ diff --git a/assets/unwant_object_clean.jpg b/assets/unwant_object_clean.jpg new file mode 100644 index 0000000..9e3b9d5 Binary files /dev/null and b/assets/unwant_object_clean.jpg differ diff --git a/assets/unwant_person.jpg b/assets/unwant_person.jpg new file mode 100644 index 0000000..02c5028 Binary files /dev/null and b/assets/unwant_person.jpg differ diff --git a/assets/unwant_person_clean.jpg b/assets/unwant_person_clean.jpg new file mode 100644 index 0000000..f4d0958 Binary files /dev/null and b/assets/unwant_person_clean.jpg differ diff --git a/assets/watermark.jpg b/assets/watermark.jpg new file mode 100644 index 0000000..2eaef9d Binary files /dev/null and b/assets/watermark.jpg differ diff --git a/assets/watermark_cleanup.jpg b/assets/watermark_cleanup.jpg new file mode 100644 index 0000000..907122e Binary files /dev/null and b/assets/watermark_cleanup.jpg differ