2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Tracking People and Their Objects

Tobias Baumgartner, Dennis Mitzel, Bastian Leibe

2013 IEEE Conference on Computer Vision and Pattern Recognition > 3658 - 3665

Current pedestrian tracking approaches ignore important aspects of human behavior. Humans are not moving independently, but they closely interact with their environment, which includes not only other persons, but also different scene objects. Typical everyday scenarios include people moving in groups, pushing child strollers, or pulling luggage. In this paper, we propose a probabilistic approach for...

chapter

Tracking Human Pose by Tracking Symmetric Parts

Varun Ramakrishna, Takeo Kanade, Yaser Sheikh

2013 IEEE Conference on Computer Vision and Pattern Recognition > 3728 - 3735

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The human body is structurally symmetric. Tracking by detection approaches for human pose suffer from \emph{double counting}, where the same image evidence is used to explain two separate but symmetric parts, such as the left and right feet. Double counting, if left unaddressed can critically affect subsequent processes, such as action recognition, affordance estimation, and pose reconstruction. In...

chapter

Optimized Pedestrian Detection for Multiple and Occluded People

Sitapa Rujikietgumjorn, Robert T. Collins

2013 IEEE Conference on Computer Vision and Pattern Recognition > 3690 - 3697

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a quadratic unconstrained binary optimization (QUBO) framework for reasoning about multiple object detections with spatial overlaps. The method maximizes an objective function composed of unary detection confidence scores and pairwise overlap constraints to determine which overlapping detections should be suppressed, and which should be kept. The framework is flexible enough to handle the...

chapter

Detection- and Trajectory-Level Exclusion in Multiple Object Tracking

Anton Milan, Konrad Schindler, Stefan Roth

2013 IEEE Conference on Computer Vision and Pattern Recognition > 3682 - 3689

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

When tracking multiple targets in crowded scenarios, modeling mutual exclusion between distinct targets becomes important at two levels: (1) in data association, each target observation should support at most one trajectory and each trajectory should be assigned at most one observation per frame, (2) in trajectory estimation, two trajectories should remain spatially separated at all times to avoid...

chapter

Improving an Object Detector and Extracting Regions Using Superpixels

Guang Shu, Afshin Dehghan, Mubarak Shah

2013 IEEE Conference on Computer Vision and Pattern Recognition > 3721 - 3727

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose an approach to improve the detection performance of a generic detector when it is applied to a particular video. The performance of offline-trained objects detectors are usually degraded in unconstrained video environments due to variant illuminations, backgrounds and camera viewpoints. Moreover, most object detectors are trained using Haar-like features or gradient features but ignore...

chapter

Seeking the Strongest Rigid Detector

Rodrigo Benenson, Markus Mathias, Tinne Tuytelaars, Luc Van Gool

2013 IEEE Conference on Computer Vision and Pattern Recognition > 3666 - 3673

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The current state of the art solutions for object detection describe each class by a set of models trained on discovered sub-classes (so called "components"), with each model itself composed of collections of interrelated parts (deformable models). These detectors build upon the now classic Histogram of Oriented Gradients+linear SVM combo. Abstract In this paper we revisit some of the core...

chapter

Computationally Efficient Regression on a Dependency Graph for Human Pose Estimation

Kota Hara, Rama Chellappa

2013 IEEE Conference on Computer Vision and Pattern Recognition > 3390 - 3397

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a hierarchical method for human pose estimation from a single still image. In our approach, a dependency graph representing relationships between reference points such as body joints is constructed and the positions of these reference points are sequentially estimated by a successive application of multidimensional output regressions along the dependency paths, starting from the root node...

chapter

Sparse Output Coding for Large-Scale Visual Recognition

Bin Zhao, Eric P. Xing

2013 IEEE Conference on Computer Vision and Pattern Recognition > 3350 - 3357

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Many vision tasks require a multi-class classifier to discriminate multiple categories, on the order of hundreds or thousands. In this paper, we propose sparse output coding, a principled way for large-scale multi-class classification, by turning high-cardinality multi-class categorization into a bit-by-bit decoding problem. Specifically, sparse output coding is composed of two steps: efficient coding...

chapter

Motionlets: Mid-level 3D Parts for Human Motion Recognition

LiMin Wang, Yu Qiao, Xiaoou Tang

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2674 - 2681

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes \emph{motion let}, a mid-level and spatiotemporal part, for human motion recognition. Motion let can be seen as a tight cluster in motion and appearance space, corresponding to the moving process of different body parts. We postulate three key properties of motion let for action recognition: high motion saliency, multiple scale representation, and representative-discriminative...

chapter

Poselet Key-Framing: A Model for Human Activity Recognition

Michalis Raptis, Leonid Sigal

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2650 - 2657

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we develop a new model for recognizing human actions. An action is modeled as a very sparse sequence of temporally local discriminative key frames - collections of partial key-poses of the actor(s), depicting key states in the action sequence. We cast the learning of key frames in a max-margin discriminative framework, where we treat key frames as latent variables. This allows us to...

chapter

Spatiotemporal Deformable Part Models for Action Detection

Yicong Tian, Rahul Sukthankar, Mubarak Shah

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2642 - 2649

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deformable part models have achieved impressive performance for object detection, even on difficult image datasets. This paper explores the generalization of deformable part models from 2D images to 3D spatiotemporal volumes to better study their effectiveness for action detection in video. Actions are treated as spatiotemporal patterns and a deformable part model is generated for each action from...

chapter

A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching

Pradipto Das, Chenliang Xu, Richard F. Doell, Jason J. Corso

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2634 - 2641

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The problem of describing images through natural language has gained importance in the computer vision community. Solutions to image description have either focused on a top-down approach of generating language through combinations of object detections and language models or bottom-up propagation of keyword tags from training images to test images through probabilistic or nearest neighbor techniques...

chapter

Multi-agent Event Detection: Localization and Role Assignment

Suha Kwak, Bohyung Han, Joon Hee Han

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2682 - 2689

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a joint estimation technique of event localization and role assignment when the target video event is described by a scenario. Specifically, to detect multi-agent events from video, our algorithm identifies agents involved in an event and assigns roles to the participating agents. Instead of iterating through all possible agent-role combinations, we formulate the joint optimization problem...

chapter

Complex Event Detection via Multi-source Video Attributes

Zhigang Ma, Yi Yang, Zhongwen Xu, Shuicheng Yan, more

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2627 - 2633

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Complex events essentially include human, scenes, objects and actions that can be summarized by visual attributes, so leveraging relevant attributes properly could be helpful for event detection. Many works have exploited attributes at image level for various applications. However, attributes at image level are possibly insufficient for complex event detection in videos due to their limited capability...

chapter

Cross-View Action Recognition via a Continuous Virtual Path

Zhong Zhang, Chunheng Wang, Baihua Xiao, Wen Zhou, more

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2690 - 2697

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we propose a novel method for cross-view action recognition via a continuous virtual path which connects the source view and the target view. Each point on this virtual path is a virtual view which is obtained by a linear transformation of the action descriptor. All the virtual views are concatenated into an infinite-dimensional feature to characterize continuous changes from the source...

chapter

Least Soft-Threshold Squares Tracking

Dong Wang, Huchuan Lu, Ming-Hsuan Yang

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2371 - 2378

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we propose a generative tracking method based on a novel robust linear regression algorithm. In contrast to existing methods, the proposed Least Soft-thresold Squares (LSS) algorithm models the error term with the Gaussian-Laplacian distribution, which can be solved efficiently. Based on maximum joint likelihood of parameters, we derive a LSS distance to measure the difference between...

chapter

Self-Paced Learning for Long-Term Tracking

James Steven Supancic III, Deva Ramanan

2013 IEEE Conference on Computer Vision and Pattern Recognition > 2379 - 2386

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address the problem of long-term object tracking, where the object may become occluded or leave-the-view. In this setting, we show that an accurate appearance model is considerably more effective than a strong motion model. We develop simple but effective algorithms that alternate between tracking and learning a good appearance model given a track. We show that it is crucial to learn from the "right"...

chapter

Learning the Change for Automatic Image Cropping

Jianzhou Yan, Stephen Lin, Sing Bing Kang, Xiaoou Tang

2013 IEEE Conference on Computer Vision and Pattern Recognition > 971 - 978

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Image cropping is a common operation used to improve the visual quality of photographs. In this paper, we present an automatic cropping technique that accounts for the two primary considerations of people when they crop: removal of distracting content, and enhancement of overall composition. Our approach utilizes a large training set consisting of photos before and after cropping by expert photographers...

chapter

Globally Consistent Multi-label Assignment on the Ray Space of 4D Light Fields

Sven Wanner, Christoph Straehle, Bastian Goldluecke

2013 IEEE Conference on Computer Vision and Pattern Recognition > 1011 - 1018

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present the first variational framework for multi-label segmentation on the ray space of 4D light fields. For traditional segmentation of single images, features need to be extracted from the 2D projection of a three-dimensional scene. The associated loss of geometry information can cause severe problems, for example if different objects have a very similar visual appearance. In this work, we show...

chapter

Non-parametric Filtering for Geometric Detail Extraction and Material Representation

Zicheng Liao, Jason Rock, Yang Wang, David Forsyth

2013 IEEE Conference on Computer Vision and Pattern Recognition > 963 - 970

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Geometric detail is a universal phenomenon in real world objects. It is an important component in object modeling, but not accounted for in current intrinsic image works. In this work, we explore using a non-parametric method to separate geometric detail from intrinsic image components. We further decompose an image as albedo * (coarse-scale shading + shading detail). Our decomposition offers quantitative...

INFONA - science communication portal

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Tracking People and Their Objects

Tracking Human Pose by Tracking Symmetric Parts

Optimized Pedestrian Detection for Multiple and Occluded People

Detection- and Trajectory-Level Exclusion in Multiple Object Tracking

Improving an Object Detector and Extracting Regions Using Superpixels

Seeking the Strongest Rigid Detector

Computationally Efficient Regression on a Dependency Graph for Human Pose Estimation

Sparse Output Coding for Large-Scale Visual Recognition

Motionlets: Mid-level 3D Parts for Human Motion Recognition

Poselet Key-Framing: A Model for Human Activity Recognition

Spatiotemporal Deformable Part Models for Action Detection

A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching

Multi-agent Event Detection: Localization and Role Assignment

Complex Event Detection via Multi-source Video Attributes

Cross-View Action Recognition via a Continuous Virtual Path

Least Soft-Threshold Squares Tracking

Self-Paced Learning for Long-Term Tracking

Learning the Change for Automatic Image Cropping

Globally Consistent Multi-label Assignment on the Ray Space of 4D Light Fields

Non-parametric Filtering for Geometric Detail Extraction and Material Representation

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)