A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

From Computer Vision Freiburg:  Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks. Our datasets are the first large-scale datasets to enable training and evaluating scene flow methods. Besides the datasets, we present a convolutional network for real-time disparity estimation that provides state-of-the-art results. By combining a flow and disparity estimation network and training it jointly, we demonstrate the first scene flow estimation with a convolutional network.

his video shows impressions from various parts of our dataset, as well as state-of-the-art realtime disparity estimation results produced by one of our new CNNs... (full paper)

 

 

Featured Product

Palladyne IQ  - Unlocking new frontiers for robotic performance.

Palladyne IQ - Unlocking new frontiers for robotic performance.

Palladyne IQ is a closed-loop autonomy software that uses artificial intelligence (AI) and machine learning (ML) technologies to provide human-like reasoning capabilities for industrial robots and collaborative robots (cobots). By enabling robots to perceive variations or changes in the real-world environment and adapt to them dynamically, Palladyne IQ helps make robots smarter today and ready to handle jobs that have historically been too complex to automate.