Publications


Conference Papers

ObjectFlow: A Descriptor for Classifying Traffic Motion
A. Geiger and B. Kitt (IV 2010)

IV

We present and evaluate a novel scene descriptor for classifying urban traffic by object motion. Atomic 3D flow vectors are extracted and compensated for the vehicle's egomotion, using stereo video sequences. Votes cast by each flow vector are accumulated in a bird's eye view histogram grid. Since we are directly using low-level object flow, no prior object detection or tracking is needed. We demonstrate the effectiveness of the proposed descriptor by comparing it to two simpler baselines on the task of classifying more than 100 challenging video sequences into intersection and non-intersection scenarios. Our experiments reveal good classification performance in busy traffic situations, making our method a valuable complement to traditional approaches based on lane markings.

@INPROCEEDINGS{Geiger10,
 author = {Andreas Geiger and Bernd Kitt},
 title = {ObjectFlow: A Descriptor for Classifying Traffic Motion},
 booktitle = {IEEE Intelligent Vehicles Symposium},
 year = {2010},
 month = {June},
 address = {San Diego, USA}
}


Visual Odometry based on Stereo Image Sequences
B. Kitt, A. Geiger and H. Lategahn (IV 2010)

IV

A common prerequisite for many vision-based driver assistance systems is the knowledge of the vehicle's own movement. In this paper we propose a novel approach for estimating the egomotion of the vehicle from a sequence of stereo images. Our method is directly based on the trifocal geometry between image triples, thus no time expensive recovery of the 3-dimensional scene structure is needed. The only assumption we make is a known camera geometry, where the calibration may also vary over time. We employ an Iterated Sigma Point Kalman Filter in combination with a RANSAC-based outlier rejection scheme which yields robust frame-to-frame motion estimation even in dynamic environments. A high-accuracy inertial navigation system is used to evaluate our results on challenging real-world video sequences. Experiments show that our approach is clearly superior compared to other filtering techniques in terms of both, accuracy and run-time.
C++/MATLAB source code available!

@INPROCEEDINGS{Kitt10,
 author = {Bernd Kitt and Andreas Geiger and Henning Lategahn},
 title = {Visual Odometry based on Stereo Image Sequences with RANSAC-based Outlier Rejection Scheme},
 booktitle = {IEEE Intelligent Vehicles Symposium},
 year = {2010},
 month = {June},
 address = {San Diego, USA}
}


Video-based raindrop detection for improved image registration
M. Roser and A. Geiger (VOEC at ICCV 2009)

ICCV

In this paper we present a novel approach to improved image registration in rainy weather situations. To this end, we perform monocular raindrop detection in single images based on a photometric raindrop model. Our method is capable of detecting raindrops precisely, even in front of complex backgrounds. The effectiveness is demonstrated by a significant increase in image registration accuracy which also allows for successful image restoration. Experiments on video sequences taken from within a moving vehicle prove the applicability to real-world scenarios.

@INPROCEEDINGS{Geiger09c,
 author = {Martin Roser and Andreas Geiger},
 title = {Video-based Raindrop Detection for Improved Image Registration},
 booktitle = {ICCV Workshop on Video-Oriented Object and Event Classification},
 year = {2009},
 month = {September},
 address = {Kyoto, Japan}
}


Rank Priors for Continuous Non-Linear Dimensionality Reduction
A. Geiger, R. Urtasun and T. Darrell (CVPR 2009)

CVPR

Discovering the underlying low-dimensional latent structure in high-dimensional perceptual observations (e.g., images, video) can, in many cases, greately improve performance in recognition and tracking. However, non-linear dimensionality reduction methods are often susceptible to local minima and perform poorly when initialized far from the global optimum, even when the intrinsic dimensionality is known a priori. In this work we introduce a prior over the dimensionality of the latent space that penalizes high dimensional spaces, and simultaneously optimize both the latent space and its intrinsic dimensionality in a continuous fashion. Ad-hoc initialization schemes are unnecessary with our approach; we initialize the latent space to the observation space and automatically infer the latent dimensionality. We report results applying our prior to various probabilistic non-linear dimensionality reduction tasks, and show that our method can outperform graph-based dimensionality reduction techniques as well as previously suggested initialization strategies. We demonstrate the effectiveness of our approach when tracking and classifying human motion.

@INPROCEEDINGS{Geiger09b,
 author = {Andreas Geiger and Raquel Urtasun and Trevor Darrell},
 title = {Rank Priors for Continuous Non-Linear Dimensionality Reduction},
 booktitle = {IEEE Computer Vision and Pattern Recognition (CVPR)},
 year = {2009},
 month = {June},
 address = {Miami, USA}
}


Monocular Road Mosaicing for Urban Environments
A. Geiger (IV 2009)

IV
IV

Marking-based lane recognition requires an unobstructed view onto the road. In practice however, heavy traffic often constrains the visual field, especially in urban scenarios such as urban crossroads. In this paper we present a novel approach to road mosaicing for dynamic environments. Our method is based on a multistage registration procedure and uses blending techniques. We show that under modest assumptions accurate registration is possible from monocular image sequences. We further demonstrate that fusing visual information from previous frames into the current view can greatly extend the camera's field of view.

@INPROCEEDINGS{Geiger09a,
 author = {Andreas Geiger},
 title = {Monocular Road Mosaicing for Urban Environments},
 booktitle = {IEEE Intelligent Vehicles Symposium (IV)},
 year = {2009},
 month = {June},
 address = {Xi'an, China}
}


Topologically-Constrained Latent Variable Models
R. Urtasun, D. J. Fleet, A. Geiger, J. Popovic, T. Darrell and N. D. Lawrence (ICML 2008)

ICML

In dimensionality reduction approaches, the data are typically embedded in a Euclidean latent space. However for some data sets this is inappropriate. For example, in human motion data we expect latent spaces that are cylindrical or a toroidal, that are poorly captured with a Euclidean space. In this paper, we present a range of approaches for embedding data in a non-Euclidean latent space. Our focus is the Gaussian Process latent variable model. In the context of human motion modeling this allows us to (a) learn models with interpretable latent directions enabling, for example, style/content separation, and (b) generalise beyond the data set enabling us to learn transitions between motion styles even though such transitions are not present in the data.

@INPROCEEDINGS{Urtasun08,
 author = {R. Urtasun and D. Fleet and A. Geiger and J. Popovic and T. Darrell and N. Lawrence},
 title = {Topologically-Constrained Latent Variable Models},
 booktitle = {International Conference on Machine Learning (ICML)},
 year = {2008},
 month = {July},
 address = {Helsinki, Finland}
}


An All-In-One Solution to Geometric and Photometric Calibration
J. Pilet, A. Geiger, P. Lagger, V. Lepetit and P. Fua (ISMAR 2006)

ISMAR


More information ...

We propose a fully automated approach to calibrating multiple cameras whose fields of view may not all overlap. Our technique only requires waving an arbitrary textured planar pattern in front of the cameras, which is the only manual intervention that is required. The pattern is then automatically detected in the frames where it is visible and used to simultaneously recover geometric and photometric camera calibration parameters. In other words, even a novice user can use our system to extract all the information required to add virtual 3D objects into the scene and light them convincingly. This makes it ideal for Augmented Reality applications and we distribute the code under a GPL license.

@INPROCEEDINGS{Pilet06,
 author = {J. Pilet and A. Geiger and P. Lagger and V. Lepetit and P. Fua},
 title = {An All-in-One Solution to Geometric and Photometric Calibration},
 booktitle = {IEEE International Symposium on Mixed and Augmented Reality (ISMAR)},
 year = {2006},
 month = {October},
 address = {Santa Barbara, USA}
}


Diploma thesis

Human Body Tracking with Rank Priors for Non-Linear Dimensionality Reduction
A. Geiger (MIT CSAIL 2008)


Student Research Project

Automatic Multiple Camera Calibration
A. Geiger (EPFL CVLab 2006)


(c) Andreas Geiger