Infrared road object detection algorithm based on spatial depth channel attention network and improved YOLOv8
CSTR:
Author:
Affiliation:

1. School of Electrical Engineering, North China University of Science and Technology, Tangshan 063210, China;2. School of Electrical Engineering and Automation, Tianjin University of Technology, Tianjin 300384, China

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Aiming at the problems of low detection accuracy and large model size of existing object detection algorithms applied to complex road scenes, an improved you only look once version 8 (YOLOv8) object detection algorithm for infrared images, F-YOLOv8, is proposed. First, a spatial-to-depth network replaces the traditional backbone network’s strided convolution or pooling layer. At the same time, it combines with the channel attention mechanism so that the neural network focuses on the channels with large weight values to better extract low-resolution image feature information; then an improved feature pyramid network of lightweight bidirectional feature pyramid network (L-BiFPN) is proposed, which can efficiently fuse features of different scales. In addition, a loss function of insertion of union based on the minimum point distance (MPDIoU) is introduced for bounding box regression, which obtains faster convergence speed and more accurate regression results. Experimental results on the FLIR dataset show that the improved algorithm can accurately detect infrared road targets in real time with 3% and 2.2% enhancement in mean average precision at 50% IoU (mAP50) and mean average precision at 50%—95% IoU (mAP50-95), respectively, and 38.1%, 37.3% and 16.9% reduction in the number of model parameters, the model weight, and floating-point operations per second (FLOPs), respectively. To further demonstrate the detection capability of the improved algorithm, it is tested on the public dataset PASCAL VOC, and the results show that F-YOLO has excellent generalized detection performance.

    Reference
    Related
    Cited by
Get Citation

LI Song, SHI Tao, JING Fangke, CUI Jie. Infrared road object detection algorithm based on spatial depth channel attention network and improved YOLOv8[J]. Optoelectronics Letters,2025,(8):491-498

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:May 21,2024
  • Revised:January 24,2025
  • Adopted:
  • Online: July 10,2025
  • Published:
Article QR Code