ISETHDR CV Experiment: Difference between revisions

From Psych 221 Image Systems Engineering
Jump to navigation Jump to search
Lschul (talk | contribs)
Lschul (talk | contribs)
Line 33: Line 33:
Next, we need to set up the camera. We selected four sensors. The ar0123at and mt9v024 are single pixel sensors for automotive applications while the ov2312 is a split pixel sensor also for automotive applications. In contrast, the imx363 is a single pixel sensor for smartphone applications.  
Next, we need to set up the camera. We selected four sensors. The ar0123at and mt9v024 are single pixel sensors for automotive applications while the ov2312 is a split pixel sensor also for automotive applications. In contrast, the imx363 is a single pixel sensor for smartphone applications.  


{|
 
! style="text-align:left;"| Sensor
{| class="wikitable"
! Resolution
! style="text-align:left; padding: 0 12px;"| Sensor
! Dynamic range
! style="padding: 0 12px;"| Pixel type
! style="text-align:center; padding: 0 12px;"| Resolution
! style="text-align:center; padding: 0 12px;"| Dynamic range
! style="padding: 0 12px;"| Application
! style="text-align:center; padding: 0 12px;"| FOV
|-
|-
|ar0132at
| style="padding: 0 12px;"| AR0132AT
|1.2 MP
| style="padding: 0 12px;"| Single
|>115 dB
| style="text-align:center; padding: 0 12px;"| 1.2 MP
| style="text-align:center; padding: 0 12px;"| 115 dB
| style="padding: 0 12px;"| Automotive
| style="text-align:center; padding: 0 12px;"| 76°
|-
|-
|mt9v024
| style="padding: 0 12px;"| MT9V024
|0.4 MP
| style="padding: 0 12px;"| Single
|>100 dB
| style="text-align:center; padding: 0 12px;"| 0.4 MP
| style="text-align:center; padding: 0 12px;"| 100 dB
| style="padding: 0 12px;"| Automotive
| style="text-align:center; padding: 0 12px;"| 69°
|-
|-
|imx363
| style="padding: 0 12px;"| IMX363
|12 MP
| style="padding: 0 12px;"| Single
|n/a
| style="text-align:center; padding: 0 12px;"| 12 MP
| style="text-align:center; padding: 0 12px;"| n/a
| style="padding: 0 12px;"| Smartphone
| style="text-align:center; padding: 0 12px;"| 21°
|-
|-
|ov2312
| style="padding: 0 12px;"| OV2312
|2 MP
| style="padding: 0 12px;"| Split
|n/a
| style="text-align:center; padding: 0 12px;"| 2 MP
| style="text-align:center; padding: 0 12px;"| 68 dB
| style="padding: 0 12px;"| Automotive
| style="text-align:center; padding: 0 12px;"| 81°
|}
|}




Automotive sensors
ar0123at: 1.2 MP, dynamic range >115 dB
mt9v024: 0.36 MP, dynamic range >100 dB
ov2312: 2 MP, dynamic range >68 dB
imx363: 12 MP
Now build the data set
use exposure time to alter dynamic range
times = 0.1, 0.5, 1, 2, 4, 6, 8, 12, 16, 20, 50, 100, 500, 1000 ms
Loop over:
4x scenes
4x lighting scenarios
4x sensors
14x exposures


total images - 896





Revision as of 19:11, 8 December 2025

Introduction

Background

Methods

Image generation

First, we need to assemble a dataset of driving images to run the YOLO algorithm on. We acquired four driving scenes from the ISR HDR Sensor Repository. Each scene includes .exr files that contain radiance data for the sky, street lights, headlights, and other lights.

ISET HDR Scenes 1112201236, 1112184733, 1113094429, and 1114031438

To set up the scenes, we consider four lighting scenarios that commonly occur during driving. The light scenarios are defined by a vector of weights for headlights, streetlights, other lights, sky map, in order. The daytime scenario has strong illumination from the sky only. The nighttime scenario is illuminated almost only by headlights and streetlights. The dusk scenario falls between day and night; it has half of the daytime sky illumination combined with headlights and streetlights. Finally, the blind scenario represents a nighttime scenario with stronger artificial lighting; the headlights and streetlights are 10x greater than in the nighttime scenario.

Illumination vector: [ headlights, streetlights, other lights, sky map ]
  • Day - [ 0, 0, 0, 50 ]
  • Night - [ 0.2, 0.001, 0, 0.0005 ]
  • Dusk - [ 0.2, 0.001, 0, 20 ]
  • Blind - [ 2, 0.1, 0, 0.0005 ]


Next, we need to set up the camera. We selected four sensors. The ar0123at and mt9v024 are single pixel sensors for automotive applications while the ov2312 is a split pixel sensor also for automotive applications. In contrast, the imx363 is a single pixel sensor for smartphone applications.


Sensor Pixel type Resolution Dynamic range Application FOV
AR0132AT Single 1.2 MP 115 dB Automotive 76°
MT9V024 Single 0.4 MP 100 dB Automotive 69°
IMX363 Single 12 MP n/a Smartphone 21°
OV2312 Split 2 MP 68 dB Automotive 81°




YOLO

Results

Conclusions

Appendix