README.md

Manga109 Dataset

Introduction

The Manga109 dataset contains artificial images of manga (Japanese comics) and annotations for four categories (body, face, frame, and text). Many characteristics are different from natural images.

The Manga109-s dataset (87 volumes) is a subset of the full Manga109 dataset (109 volumes). Unlike the full Manga109 dataset, the Manga109-s dataset can be used by commercial organizations. For a wide range of users, we conduct experiments on Manga109-s.

Please see this page to download Manga109-s. Please see our manga109api fork to convert the dataset to COCO format. We use 68train, 4val, and 15test splits. The 15test set was selected to be well-balanced for reliable evaluation.

Results

68train, 15test (Manga109-s v2020.12.18)

Method	Backbone	Lr schd	AP	Download
Faster R-CNN	R-50	1x	65.8	model
Cascade R-CNN	R-50	1x	67.6	model
RetinaNet	R-50	1x	65.3	model
ATSS	R-50	1x	66.5	model
GFL	R-50	1x	67.3	model
DETR	R-50	1x	31.2	model
Deformable DETR	R-50	1x	64.1	model
Sparse R-CNN	R-50	1x	63.1	model
ATSS	Swin-T	1x	66.2	model
ATSS	ConvNeXt-T	1x	67.4	model
ATSS+SEPC	R-50	1x	67.1	model
ATSS+DyHead	R-50	1x	67.9	model
YOLOX-L	CSP v5	1x	70.2	model
UniverseNet	R2-50	1x	68.9	model
UniverseNet 20.08	R2-50	1x	69.9	model

In addition to ATSS+SEPC, UniverseNet uses Res2Net-v1b-50, DCN, and multi-scale training (480-960).
The settings for normalization layers (including whether to use iBN of SEPC) depend on the config files.
Most models were trained and evaluated using fp16 (mixed precision).
Each model was fine-tuned from a corresponding COCO pre-trained model.

Dataset splits

15test: ["Akuhamu", "BakuretsuKungFuGirl", "DollGun", "EvaLady", "HinagikuKenzan", "KyokugenCyclone", "LoveHina_vol01", "MomoyamaHaikagura", "TennenSenshiG", "UchiNoNyan'sDiary", "UnbalanceTokyo", "YamatoNoHane", "YoumaKourin", "YumeNoKayoiji", "YumeiroCooking"]
4val: ["HealingPlanet", "LoveHina_vol14", "SeisinkiVulnus", "That'sIzumiko"]
68train: All the other volumes

Notes

Please check the dataset licenses (Manga109, Manga109-s).
The typical scale of the original images is (1654, 1170). The number of maximum total pixels of (1216, 864) for Manga109 is almost the same as that of (1333, 800) for COCO.

Citations

Users must cite the two papers below for use in academic papers.

@article{mtap_matsui_2017,
    author={Yusuke Matsui and Kota Ito and Yuji Aramaki and Azuma Fujimoto and Toru Ogawa and Toshihiko Yamasaki and Kiyoharu Aizawa},
    title={Sketch-based Manga Retrieval using Manga109 Dataset},
    journal={Multimedia Tools and Applications},
    volume={76},
    number={20},
    pages={21811--21838},
    doi={10.1007/s11042-016-4020-z},
    year={2017}
}

@article{multimedia_aizawa_2020,
    author={Kiyoharu Aizawa and Azuma Fujimoto and Atsushi Otsubo and Toru Ogawa and Yusuke Matsui and Koki Tsubota and Hikaru Ikuta},
    title={Building a Manga Dataset ``Manga109'' with Annotations for Multimedia Applications},
    journal={IEEE MultiMedia},
    volume={27},
    number={2},
    pages={8--18},
    doi={10.1109/mmul.2020.2987895},
    year={2020}
}

Please cite the following paper for the benchmark results. https://arxiv.org/abs/2103.14027

@inproceedings{USB_shinya_BMVC2022,
  title={{USB}: Universal-Scale Object Detection Benchmark},
  author={Shinya, Yosuke},
  booktitle={British Machine Vision Conference (BMVC)},
  year={2022}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

manga109

manga109

README.md

Manga109 Dataset

Introduction

Results

68train, 15test (Manga109-s v2020.12.18)

Dataset splits

Notes

Citations

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
atss_convnext-t_p4_w7_fpn_fp16_4x4_1x_manga109s.py		atss_convnext-t_p4_w7_fpn_fp16_4x4_1x_manga109s.py
atss_r50_fpn_dyhead_fp16_4x4_1x_manga109s.py		atss_r50_fpn_dyhead_fp16_4x4_1x_manga109s.py
atss_r50_fpn_fp16_4x4_1x_manga109s.py		atss_r50_fpn_fp16_4x4_1x_manga109s.py
atss_r50_fpn_sepc_noibn_fp16_4x4_1x_manga109s.py		atss_r50_fpn_sepc_noibn_fp16_4x4_1x_manga109s.py
atss_swint_fpn_fp16_4x4_adamw_1x_manga109s.py		atss_swint_fpn_fp16_4x4_adamw_1x_manga109s.py
cascade_rcnn_r50_fpn_fp16_4x4_1x_manga109s.py		cascade_rcnn_r50_fpn_fp16_4x4_1x_manga109s.py
deformable_detr_r50_4x2x2_1x_manga109s.py		deformable_detr_r50_4x2x2_1x_manga109s.py
detr_r50_4x4_1x_manga109s.py		detr_r50_4x4_1x_manga109s.py
faster_rcnn_r50_fpn_fp16_4x4_1x_manga109s.py		faster_rcnn_r50_fpn_fp16_4x4_1x_manga109s.py
gfl_r50_fpn_fp16_4x4_1x_manga109s.py		gfl_r50_fpn_fp16_4x4_1x_manga109s.py
retinanet_r50_fpn_fp16_4x4_1x_manga109s.py		retinanet_r50_fpn_fp16_4x4_1x_manga109s.py
sparse_rcnn_r50_fpn_fp16_4x4_1x_manga109s.py		sparse_rcnn_r50_fpn_fp16_4x4_1x_manga109s.py
universenet50_2008_fp16_1x4x4_mstrain_480_960_1x_manga109s.py		universenet50_2008_fp16_1x4x4_mstrain_480_960_1x_manga109s.py
universenet50_2008_fp16_4x4_1x_manga109s.py		universenet50_2008_fp16_4x4_1x_manga109s.py
universenet50_2008_fp16_4x4_from_imagenet_mstrain_480_960_1x_manga109s.py		universenet50_2008_fp16_4x4_from_imagenet_mstrain_480_960_1x_manga109s.py
universenet50_2008_fp16_4x4_mstrain_480_960_1x_manga109s.py		universenet50_2008_fp16_4x4_mstrain_480_960_1x_manga109s.py
universenet50_fp16_4x4_mstrain_480_960_1x_manga109s.py		universenet50_fp16_4x4_mstrain_480_960_1x_manga109s.py
yolox_l_fp16_4x4_12e_manga109s.py		yolox_l_fp16_4x4_12e_manga109s.py

Files

manga109

Directory actions

More options

Directory actions

More options

Latest commit

History

manga109

Folders and files

parent directory

README.md

Manga109 Dataset

Introduction

Results

68train, 15test (Manga109-s v2020.12.18)

Dataset splits

Notes

Citations