Optical flow estimation from event-based cameras and spiking neural networks

Archive ouverte

Cuadrado, Javier | Rançon, Ulysse | Cottereau, Benoit | Barranco, Francisco | Masquelier, Timothée

Edité par CCSD ; Frontiers -

International audience. Event-based cameras are raising interest within the computer vision community. These sensors operate with asynchronous pixels, emitting events, or “spikes”, when the luminance change at a given pixel since the last event surpasses a certain threshold. Thanks to their inherent qualities, such as their low power consumption, low latency, and high dynamic range, they seem particularly tailored to applications with challenging temporal constraints and safety requirements. Event-based sensors are an excellent fit for Spiking Neural Networks (SNNs), since the coupling of an asynchronous sensor with neuromorphic hardware can yield real-time systems with minimal power requirements. In this work, we seek to develop one such system, using both event sensor data from the DSEC dataset and spiking neural networks to estimate optical flow for driving scenarios. We propose a U-Net-like SNN which, after supervised training, is able to make dense optical flow estimations. To do so, we encourage both minimal norm for the error vector and minimal angle between ground-truth and predicted flow, training our model with back-propagation using a surrogate gradient. In addition, the use of 3d convolutions allows us to capture the dynamic nature of the data by increasing the temporal receptive fields. Upsampling after each decoding stage ensures that each decoder's output contributes to the final estimation. Thanks to separable convolutions, we have been able to develop a light model (when compared to competitors) that can nonetheless yield reasonably accurate optical flow estimates.

Suggestions

Du même auteur

A general model unifying the adaptive, transient and sustained properties of ON and OFF auditory neural responses

Archive ouverte | Rançon, Ulysse | CCSD

International audience. Sounds are temporal stimuli decomposed into numerous elementary components by the auditory nervous system. For instance, a temporal to spectro-temporal transformation modelling the frequency ...

Sub-Optimality of the Early Visual System Explained Through Biologically Plausible Plasticity

Archive ouverte | Chauhan, Tushar | CCSD

International audience. The early visual cortex is the site of crucial pre-processing for more complex, biologically relevant computations that drive perception and, ultimately, behaviour. This pre-processing is oft...

Perceptual learning improves motion perception in patients with macular degeneration

Archive ouverte | Michaud, Célia | CCSD

International audience. Maculopathies such as age-related macular degeneration or Stargardt’s disease are typically characterized by a progressive and irreversible loss of central vision which has a dramatic impact ...

Chargement des enrichissements...