Bhookya Nageswararao Naik, Malmathanraj Ramanathan, Palanisamy Ponnusamy
Journal of Electronic Imaging, Vol. 32, Issue 03, 033039, (June 2023) https://doi.org/10.1117/1.JEI.32.3.033039
TOPICS: Object detection, Diseases and disorders, Data modeling, Education and training, Computer vision technology, Image classification, Performance modeling, Image processing, Neurological disorders, Machine learning
Agriculture is a crucial sector for every country’s economy, as it provides livestock and crops that are essential for farmers and their families’ survival. Crop diseases significantly impact farmers’ livelihood, making early detection and accurate diagnosis essential for informed crop management decisions that can improve yield and crop quality. While the manual diagnosis of plant diseases has limitations, research on automatic plant disease identification and categorization using computer vision algorithms is a popular research topic. We focus on leaf disease recognition and compare classification techniques to object detection techniques. Classification techniques in leaf disease recognition involve assigning an input image of a leaf to one of the pre-defined classes or categories, such as “healthy” or “diseased.” Object detection techniques, on the other hand, involve identifying the presence of an object of interest (e.g., a leaf) in an image and locating its boundaries, in addition to classifying the object. The advantage of object detection techniques over classification techniques in leaf disease recognition is that they provide additional information about the location of the disease on the leaf, which can be useful for tasks such as disease diagnosis and monitoring. In our work, 5100 chilli leaf images of 6 different classes were collected and labeled using the makeSense artificial intelligence (AI) annotating tool; we applied the YOLOv5s object detection model, producing a precision of 0.951, recall of 0.926, mAP@0.5 of 0.959, mAP@0.5:0.95 of 0.826, and accuracy of 95%. We also proposed an enhanced single-stage object detection model that employs the exponential linear unit activation function in the convolution layers and the sigmoid activation function at the output layer, resulting in a precision of 0.998, recall of 0.998, mAP@0.5 of 0.995, mAP@0.5:0.95 of 0.975, and accuracy of 99.83% for the chilli leaf dataset.