A system for detecting traffic objects  and estimating their distance

DOUAOUI Rayane

A system for detecting traffic objects and estimating their distance

Files

Mater's-Thesis-DOUAOUI Rayane- The final one.pdf (3.35 MB)

Date

2024

Authors

DOUAOUI Rayane

Publisher

université chadli ben djedid eltarf

Abstract

In recent years, advancements in computer vision have driven the automotive industry to develop driver assistance systems capable of significantly reducing road accidents. This thesis introduces a model designed for detecting road traffic objects and accurately estimating their distances. The proposed model identifies and classifies objects such as cars, trucks, and pedestrians located in front of the vehicle. Deep learning algorithms are employed to precisely determine the nature of each object. Subsequently, the model estimates both the distance between the camera and detected objects, as well as between pairs of objects. Integrating these insights enables drivers to maintain better focus on the road and gain a clearer perception of their surroundings. في السنوات الأخيرة، أدت التطورات في رؤية الحاسوب إلى دفع صناعة السيارات لتطوير أنظمة مساعدة السائق القادرة على تقليل حوادث الطرق بشكل كبير. يقدم هذا البحث نموذجًا مصممًا لاكتشاف أجسام المرور على الطرق وتقدير المسافات بينها بدقة. يقوم النموذج المقترح بتحديد وتصنيف الأجسام مثل السيارات والشاحنات والمشاة التي تقع أمام المركبة. يتم استخدام خوارزميات التعلم العميق لتحديد طبيعة كل جسم بدقة. بعد ذلك، يقوم النموذج بتقدير المسافة بين الكاميرا والأجسام المكتشفة، وكذلك بين أزواج من الأجسام. يتيح دمج هذه المعلومات للسائقين الحفاظ على تركيز أفضل على الطريق والحصول على تصور أوضح لمحيطهم. يستخدم النظام شبكة عصبية تلافيفية (CNN) تعتمد على بنية ResNet-50، مع إضافة طبقات تلافيفية إضافية لتعزيز أداء الاكتشاف. تم التدريب على مجموعة بيانات Pascal VOC 2012، وقدتحقيق دقة بنسبة 95.21٪ وخسارة تصنيف منخفضة باستخدام محلل Nadam. قارننا نموذجنا مع نموذج Faster R-CNN، وقد حقق نموذجنا أداءً أفضل. The system utilizes a Convolutional Neural Network (CNN) based on the ResNet-50 architecture, augmented with additional convolutional layers to enhance detection performance. Training was conducted on the Pascal VOC 2012 dataset, achieving an accuracy of 95.21% and minimal classification loss using the Nadam optimizer. We compared our model with the Faster R-CNN model, and ours achieved better performance.

URI

http://depotucbet.univ-eltarf.dz:4000/handle/123456789/2373

Collections

Informatique

Full item page