Note About Public Data Sets for Scientific Research Document Image Classification RVL-CDIP Tobacco3482 # thanks Lucia Noce for the URL Others GTSRB: German Traffic Sign Recognition Benchmark ~40 classes, ~50000 samples