Aircraft Detection on the MADD Dataset Using YOLOv8


This project focuses on object detection of military aircraft using the MADD dataset, leveraging the YOLOv8 architecture. The goal was to build a lightweight and effective pipeline for real-world application with reliable performance.




Dataset Preparation

The MADD dataset was structured to follow the YOLOv8 training format. A preprocessing script was written to collect images from 85 subfolders and align them with the corresponding label files. A link to the dataset is below :

https://www.kaggle.com/datasets/a2015003713/militaryaircraftdetectiondataset

python:

# folderstructure.py source_images_root = '.../Datasets/Images' source_labels = '.../train' target_images = 'dataset/images/train' target_labels = 'dataset/labels/train' os.makedirs(target_images, exist_ok=True) os.makedirs(target_labels, exist_ok=True) for folder in os.listdir(source_images_root): ... shutil.copy2(src_img, dst_img) ... if os.path.exists(src_lbl): shutil.copy2(src_lbl, dst_lbl)

This ensured a clean and consistent training folder layout compatible with Ultralytics YOLOv8.

Model and Training

The detection model was trained using the YOLOv8s architecture. The following script defines a minimal training routine:

python :
# yolov8train.py from ultralytics import YOLO model = YOLO('yolov8s.pt') model.train( data='data.yaml', epochs=5, batch=8, name="aircraftdetector" )

The training process was executed for 5 epochs with a batch size of 8. The model weights and logs are stored under the runs/detect/aircraftdetector directory.

Results

The trained model demonstrated strong detection capability, even in complex aerial scenes with overlapping aircraft and varied scales. Evaluation metrics and inference visuals will be shared in the GitHub repository. 



GitHub Repository : https://github.com/reaperae/madd-aircraft-detection 

Comments

  1. The article provides an insightful technical analysis of modernizing aircraft localization frameworks by utilizing the Multi-Perspective Aircraft Dataset (MAD/MADD) to train deep convolutional architectures. By introducing multi-angle imagery that spans oblique, ground-level, and top-down views, the research addresses the structural limitations of conventional single-perspective imagery under challenging weather and lighting conditions. The primary objective is to prove how optimizing localized loss functions, like adaptive threshold focal loss, can dramatically mitigate class imbalances and increase bounding-box regression accuracy across highly similar aviation targets.

    ReplyDelete
  2. Overcoming severe inter-class similarities and orientation variations within a unified deep learning network represents a significant advance in standard computer vision workflows. For undergraduate engineers exploring specialized localization algorithms within Image Processing Projects For Final Year, implementing multi-view training frameworks yields highly generalizable systems capable of identifying distorted target profiles. Integrating contextual attention mechanisms or dynamic visual centers allows developers to maintain superior mean average precision without demanding prohibitive compute resources.

    Furthermore, the post's focus on validating state-of-the-art detectors across diverse environmental backgrounds highlights the absolute necessity of robust feature aggregation maps. For student research teams designing advanced Object Detection Projects, using this diverse multi-perspective dataset offers a highly stable baseline for training resilient localization networks. The detailed methodology provides a clear pathway for deploying intelligent surveillance and air traffic monitoring applications capable of processing intricate spatial feeds in real time.

    ReplyDelete

Post a Comment