目录

ImageCritic

The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment

HuggingFace HuggingFace HuggingFace HuggingFace HuggingFace

🖼️ Visual Results

🔧 Dependencies and Installation

We recommend using Python 3.10 and PyTorch with CUDA support. To set up the environment:

# Create a new conda environment
conda create -n imagecritic python=3.10
conda activate imagecritic

# Install other dependencies
pip install -r requirements.txt

⚡ Quick Inference

Tips

Due to copyright issues, we have embedded the download of the kontext model weights in the inference code below, You can run following inference code directly. If you have already downloaded the corresponding model, you can comment out the related code and directly replace the inference path.

Local Gradio Demo

python app.py

How to use

During testing, if the details that need to be fixed are located in a very low-resolution area, you should expand the bounding box to cover a larger region. Try to include both the target area to be fixed and some of the surrounding context, as illustrated in the example.

Since the method is based on local inpainting, it cannot replace objects when the difference is too large. If you need to replace an entire object, you must manually paint a black mask (using any drawing tool) over the part to be replaced, and then feed it into the model to perform the replacement.

Single case inference

python infer.py

Single Model Download

You can download the base model FLUX.1-Kontext-dev directly from Hugging Face.

Alternatively, you can download it via the following command
(⚠️ Remember to replace your_hf_token in the script with your actual Hugging Face access token):

python ./download_kontext.py

You can download our ImageCritic directly from Hugging Face.

Alternatively, you can download it via following code:

python ./download_imageCritic.py

Or using Git:

git lfs install
git clone https://huggingface.co/ziheng1234/ImageCritic

Dataset Download

You can download our training dataset Critic-10K directly from Hugging Face.

Alternatively, you can download it via Python:

python /raid/users/oyzh/ImageCritic/download_dataset.py

Or using Git:

git lfs install
git clone https://huggingface.co/datasets/ziheng1234/Critic-10K

Online HuggingFace Demo

You can try ImageCritic demo on HuggingFace.

Citation

If ImageCritic is helpful, please help to ⭐ the repo.

If you find this project useful for your research, please consider citing our paper.

📧 Contact

If you have any comments or questions, please open a new issue or contact Ziheng Ouyang

License

Licensed under a Creative Commons Attribution-NonCommercial 4.0 International for Non-commercial use only. Any commercial use should get formal permission first.

关于
33.5 MB
邀请码
    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9 京公网安备 11010802032778号