Barcode recognition via the mobile device camera is an actual problem which occurs in various fields. To receive experimental baselines and provide methods comparison researchers require some carefully annotated data samples. For those purposes, some datasets have already been collected and published in the public domain. But they suffer from different disadvantages. In this paper, we present a novel challenging dataset designed for barcode reading quality evaluation. It is populated using the generative approach with well-established image augmentation technique application. Among them, different geometrical transformations, kind of noises, brightness and lighting variance were used. The generative approach also allowed to populate ground-truth automatically and exclude errors introduced during the manual process. The dataset consists of training (41184 images) and validation parts (10296 images). It contains seven most popular symbologies: CODABAR, CODE-39, CODE-93, CODE-128, EAN-13, UPC-A, UPC-E. As an experimental baseline, the reading quality obtained with the open-source Zxing library is provided. The dataset with all the supplementary materials is available at ftp://smartengines.com/barcode.
|