Low-Light Image Enhancement With Multi-Scale Attention and Frequency-Domain Optimization

The official pytorch implementation of the paper Low-Light Image Enhancement With Multi-Scale Attention and Frequency-Domain Optimization (TCSVT 2024)

Abstract: Low-light image enhancement aims to improve the perceptual quality of images captured in conditions of insufficient illumination. However, such images are often characterized by low visibility and noise, making the task challenging. Recently, significant progress has been made using deep learning-based approaches. Nonetheless, existing methods encounter difficulties in balancing global and local illumination enhancement and may fail to suppress noise in complex lighting conditions. To address these issues, we first propose a multi-scale illumination adjustment network to balance both global illumination and local contrast. Furthermore, to effectively suppress noise potentially amplified by the illumination adjustment, we introduce a wavelet-based attention network that efficiently perceives and removes noise in the frequency domain. We additionally incorporate a discrete wavelet transform loss to supervise the training process. Particularly, the proposed wavelet-based attention network has been shown to enhance the performance of existing low-light image enhancement methods. This observation indicates that the proposed wavelet-based attention network can be flexibly adapted to current approaches to yield superior enhancement results. Furthermore, extensive experiments conducted on benchmark datasets and downstream object detection task demonstrate that our proposed method achieves state-of-the-art performance and generalization ability.

Architecture

Results and Pre-trained models

Related material can be found here

We provide the pre-trained models and visual results.

Dataset	PSNR	SSIM	LPIPS	Pre-trained Model	Visual Results
LOL	25.37	0.859	0.116	LOL ckpt	LOL images
MIT-Adobe FiveK	25.21	0.896	0.055	MIT ckpt	MIT images

The pre-trained models are organized as below:

pretrained_model/
├── lol
│   ├── stage1.pth
│   └── stage2.pth
└── mit
    ├── stage1.pth
    └── stage2.pth

Requirements

python=3.8
pytorch=1.8.0 
cudatoolkit=11.1.1
torchvision=0.9.0

Details can be found in pytorch180.yaml

Datasets

Low-light dataset: LOL
MIT-Adobe FiveK dataset: MIT

datasets/
├── LOL
│   ├── eval15
│   │   ├── high
│   │   │   ├── 111.png
│   │   │   └── ...
│   │   └── low
│   │       ├── 111.png
│   │       └── ...
│   └── our485
│       ├── high
│       │   ├── 100.png
│       │   └── ...
│       └── low
│           ├── 100.png
│           └── ...
└── MIT_Adobe_fivek_split
    ├── test
    │   ├── high
    │   └── low
    └── train
        ├── high
        └── low

Training

To train new models from scratch:

Overall procedure

Initially, train the network for the first stage independently.
Subsequently, utilize the acquired weights to generate enhanced results via test.py. These enhanced outcomes are then employed as inputs for the second stage, followed by separate training of the network for the second stage.

Train first stage

Set the directories for training, testing, and saving the model in the model/stage1/training.yaml file.

TRAIN_DIR: '/{dataset_path}/datasets/LOL/our485'
TEST_DIR: '/{dataset_path}/datasets/LOL/eval15'
SAVE_DIR: '/{project_path}/checkpoints'

Run the code

cd model/stage1
python train_MIANet.py

Train second stage

The enhanced image is obtained using the first-stage network and used as input to the second-stage network.

The second stage input is organized as below:

stage1/
└── LOL
    ├── eval15
    │   ├── high # GT
    │   └── low # generate by MIANet
    └── our485
        ├── high # GT
        └── low # generate by MIANet

generate the input of the second stage, which is the output of the first stage

# --model_type=1 means using MIANet
python test.py --model_type=1 --input_dir=datasets/LOL/eval15/low --output_dir=datasets/stage1/LOL/eval15/low --weights_1=pretrained_model/lol/stage1.pth
python test.py --model_type=1 --input_dir=datasets/LOL/our485/low --output_dir=datasets/stage1/LOL/our485/low --weights_1=pretrained_model/lol/stage1.pth

# move the GT
cp -r datasets/LOL/eval15/high  datasets/stage1/LOL/eval15/high
cp -r datasets/LOL/our485/high  datasets/stage1/LOL/our485/high

Set the directories for training, testing, and saving the model in the model/stage2/training.yaml file.

TRAIN_DIR: '/{dataset_path}/datasets/stage1/LOL/our485'
TEST_DIR: '/{dataset_path}/datasets/stage1/LOL/eval15'
SAVE_DIR: '/{project_path}/checkpoints'

Run the code

cd model/stage2
python train_WNENet.py

Evaluation

To evaluate trained models:

LOL

python test.py --input_dir=datasets/LOL/eval15/low --output_dir=output/lol/ --high_dir=datasets/LOL/eval15/high --weights_1=pretrained_model/lol/stage1.pth --weights_2=pretrained_model/lol/stage2.pth

MIT

python test.py --input_dir=datasets/MIT_Adobe_fivek_split/test/low --output_dir=output/mit/ --high_dir=datasets/MIT_Adobe_fivek_split/test/high --weights_1=pretrained_model/mit/stage1.pth --weights_2=pretrained_model/mit/stage2.pth

Citation

If SWANet helps your research or work, please consider citing this paper.

@ARTICLE{10244055,
  author={He, Zhiquan and Ran, Wu and Liu, Shulin and Li, Kehua and Lu, Jiawen and Xie, Changyong and Liu, Yong and Lu, Hong},
  journal={IEEE Transactions on Circuits and Systems for Video Technology}, 
  title={Low-Light Image Enhancement With Multi-Scale Attention and Frequency-Domain Optimization}, 
  year={2024},
  volume={34},
  number={4},
  pages={2861-2875}
}

Contact

If you have any questions, please contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
WT		WT
figs		figs
model		model
transform		transform
utils		utils
warmup_scheduler		warmup_scheduler
.gitignore		.gitignore
IQA.py		IQA.py
LICENSE		LICENSE
README.md		README.md
pytorch180.yaml		pytorch180.yaml
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Low-Light Image Enhancement With Multi-Scale Attention and Frequency-Domain Optimization

Architecture

Results and Pre-trained models

Requirements

Datasets

Training

Overall procedure

Train first stage

Train second stage

Evaluation

Citation

Contact

About

Releases

Packages

Languages

License

hezhiquan/SWANet

Folders and files

Latest commit

History

Repository files navigation

Low-Light Image Enhancement With Multi-Scale Attention and Frequency-Domain Optimization

Architecture

Results and Pre-trained models

Requirements

Datasets

Training

Overall procedure

Train first stage

Train second stage

Evaluation

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages