Search results

chapter

Perceptual Analysis of Perspective Projection for Viewport Rendering in 360° Images

Falah Jabar, Joao Ascenso, Maria Paula Queluz

2017 IEEE International Symposium on Multimedia (ISM) > 53 - 60

2017 IEEE International Symposium on Multimedia (ISM)

Omnidirectional, also referred to as 360º, visual content provides an immersive experience since it allows users to view a visual scene from different directions. The overall content typically covers a full sphere, and omnidirectional videos or images are processed to obtain a projection on a 2D plane of a fraction of the sphere (aka viewport), which is shown to the user. Therefore, users can look...

chapter

Robust and Fast Object Tracking for Challenging 360-degree Videos

Ahmad Delforouzi, Marcin Grzegorzek

2017 IEEE International Symposium on Multimedia (ISM) > 274 - 277

2017 IEEE International Symposium on Multimedia (ISM)

The task of object tracking in rectangular videos has been addressed in recent years by many researchers, where each method tries to propose a solution for a special challenge. Handling a variety of challenging situation of object tracking in 360-degree videos is still an unsolved problem and needs to be more considered. In the real world, the challenging situations include moving camera, high-resolution...

chapter

Fully Automatic Horizon Estimation for Surveillance Cameras

Vojtech Bartl, Adam Herout

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

This paper deals with automatic estimation of the horizon in videos from fixed surveillance cameras. The proposed algorithm is fully automatic in the sense that no user input is needed per-camera and it works with various scenes (indoor, outdoor, traffic, pedestrian, livestock, etc.). The algorithm detects moving objects, tracks them in time, assesses some of their geometric properties related to...

chapter

Viewpoint Invariant RGB-D Human Action Recognition

Jain Liu, Naveed Akhtar, Ajmal Mian

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Viewpoint variation is a major challenge in video- based human action recognition. We exploit the simultaneous RGB and Depth sensing of RGB-D cameras to address this problem. Our technique capitalizes on the complementary spatio-temporal information in RGB and Depth frames of the RGB-D videos to achieve viewpoint invariant action recognition. We extract view invariant features from the dense trajectories...

chapter

Learning Action Recognition Model from Depth and Skeleton Videos

Hossein Rahmani, Mohammed Bennamoun

2017 IEEE International Conference on Computer Vision (ICCV) > 5833 - 5842

2017 IEEE International Conference on Computer Vision (ICCV)

Depth sensors open up possibilities of dealing with the human action recognition problem by providing 3D human skeleton data and depth images of the scene. Analysis of human actions based on 3D skeleton data has become popular recently, due to its robustness and view-invariant representation. However, the skeleton alone is insufficient to distinguish actions which involve human-object interactions...

chapter

Drone-Based Object Counting by Spatially Regularized Regional Proposal Network

Meng-Ru Hsieh, Yen-Liang Lin, Winston H. Hsu

2017 IEEE International Conference on Computer Vision (ICCV) > 4165 - 4173

2017 IEEE International Conference on Computer Vision (ICCV)

Existing counting methods often adopt regression-based approaches and cannot precisely localize the target objects, which hinders the further analysis (e.g., high-level understanding and fine-grained classification). In addition, most of prior work mainly focus on counting objects in static environments with fixed cameras. Motivated by the advent of unmanned flying vehicles (i.e., drones), we are...

chapter

Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks with Spatiotemporal Transformer Modules

Congqi Cao, Yifan Zhang, Yi Wu, Hanqing Lu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3783 - 3791

2017 IEEE International Conference on Computer Vision (ICCV)

Gesture is a natural interface in interacting with wearable devices such as VR/AR helmet and glasses. The main challenge of gesture recognition in egocentric vision arises from the global camera motion caused by the spontaneous head movement of the device wearer. In this paper, we address the problem by a novel recurrent 3D convolutional neural network for end-to-end learning. We specially design...

chapter

[POSTER] AirGestAR: Leveraging Deep Learning for Complex Hand Gestural Interaction with Frugal AR Devices

Varun Jain, Ramakrishna Perla, Ramya Hebbalaguppe

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 235 - 239

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

Hand gestures provide a natural and an intuitive way of user interaction in AR/VR applications. However, the most popular and commercially available devices such as the Google Cardboard and Wearality1 still employ only primitive modes of interaction such as the magnetic trigger, conductive lever and have limited user-input capability. The truly instinctual gestures work only with inordinately priced...

chapter

[POSTER] A Benchmark Dataset for 6DoF Object Pose Tracking

Po-Chen Wu, Yueh-Ying Lee, Hung-Yu Tseng, Hsuan-I Ho, more

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 186 - 191

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

Accurately tracking the six degree-of-freedom pose of an object in real scenes is an important task in computer vision and augmented reality with numerous applications. Although a variety of algorithms for this task have been proposed, it remains difficult to evaluate existing methods in the literature as oftentimes different sequences are used and no large benchmark datasets close to realworld scenarios...

chapter

[POSTER] Reactive Displays for Virtual Reality

G S S Srinivas Rao, Neeraj Thakur, Vinay Namboodiri

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 63 - 68

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

The feeling of presence in virtual reality has enabled a large number of applications. These applications typically deal with 360° content. However, a large amount of existing content is available in terms of images and videos i.e 2D content. Unfortunately, these do not react to the viewer's position or motion when viewed through a VR HMD. Thus in this work, we propose reactive displays for VR which...

chapter

A new method for measuring the speed characteristics of high voltage circuit breaker based on machine vision algorithm

Jinqiu Deng, Guogang Zhang, Yingsan Geng, Jianhua Wang, more

2017 4th International Conference on Electric Power Equipment - Switching Technology (ICEPE-ST) > 837 - 842

2017 4th International Conference on Electric Power Equipment - Switching Technology (ICEPE-ST)

The speed characteristic of moving contact is vital to the high voltage circuit breakers. But due to the contacts of high voltage circuit breaker are completely isolated by arc chamber, the common measurement methods have some disadvantages. In this paper, a new method for measuring the speed characteristics based on machine vision algorithm is developed. This noninvasive method just needs to draw...

chapter

Jointly Recognizing Object Fluents and Tasks in Egocentric Videos

Yang Liu, Ping Wei, Song-Chun Zhu

2017 IEEE International Conference on Computer Vision (ICCV) > 2943 - 2951

2017 IEEE International Conference on Computer Vision (ICCV)

This paper addresses the problem of jointly recognizing object fluents and tasks in egocentric videos. Fluents are the changeable attributes of objects. Tasks are goal-oriented human activities which interact with objects and aim to change some attributes of the objects. The process of executing a task is a process to change the object fluents over time. We propose a hierarchical model to represent...

chapter

Deformable block-based motion estimation in omnidirectional image sequences

Francesca De Simone, Pascal Frossard, Neil Birkbeck, Balu Adsumilli

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

This paper presents an extension of block-based motion estimation for omnidirectional videos, based on a translational object motion model that accounts for the spherical geometry of the imaging system. We use this model to design a new algorithm to perform block matching in sequences of panoramic frames that are the result of the equirectangular projection. Experimental results demonstrate that significant...

chapter

Enhancing viewability for first-person videos based on a human perception model

Biao Ma, Amy R. Reibman

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

First-person videos (FPVs) captured by wearable cameras have undesired shakiness because of fast changing views. When existing video stabilization techniques are applied, FPVs are transformed into cinematographic videos, losing the First-person motion information (FPMI) such as the recorder's interests and actions. We propose a system that can enhance viewability of FPVs by stabilizing them while...

chapter

Moving object detection using background subtraction for a moving camera with pronounced parallax

Yifan Zhou, Simon Maskell

2017 Sensor Data Fusion: Trends, Solutions, Applications (SDF) > 1 - 6

2017 Sensor Data Fusion: Trends, Solutions, Applications (SDF)

This paper proposes a moving object detection algorithm which can handle videos taken by a moving camera in the presence of pronounced parallax. The paper considers the idea that objects in a image can be considered to be spatially distributed across multiple planes, the movement of each of which can be estimated using a Visual Odometry (VO) algorithm. For each plane, a Homography matrix between consecutive...

chapter

Recurrent Color Constancy

Yanlin Qian, Ke Chen, Jarno Nikkanen, Joni-Kristian Kamarainen, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5459 - 5467

2017 IEEE International Conference on Computer Vision (ICCV)

We introduce a novel formulation of temporal color constancy which considers multiple frames preceding the frame for which illumination is estimated. We propose an end-to-end trainable recurrent color constancy network – the RCC-Net – which exploits convolutional LSTMs and a simulated sequence to learn compositional representations in space and time. We use a standard single frame color constancy...

chapter

FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras

Shanghang Zhang, Guanhang Wu, Joao P. Costeira, Jose M. F. Moura

2017 IEEE International Conference on Computer Vision (ICCV) > 3687 - 3696

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we develop deep spatio-temporal neural networks to sequentially count vehicles from low quality videos captured by city cameras (citycams). Citycam videos have low resolution, low frame rate, high occlusion and large perspective, making most existing methods lose their efficacy. To overcome limitations of existing methods and incorporate the temporal information of traffic video, we...

chapter

Mutual Enhancement for Detection of Multiple Logos in Sports Videos

Yuan Liao, Xiaoqing Lu, Chengcui Zhang, Yongtao Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4856 - 4865

2017 IEEE International Conference on Computer Vision (ICCV)

Detecting logo frequency and duration in sports videos provides sponsors an effective way to evaluate their advertising efforts. However, general-purposed object detection methods cannot address all the challenges in sports videos. In this paper, we propose a mutual-enhanced approach that can improve the detection of a logo through the information obtained from other simultaneously occurred logos...

chapter

Anticipating Daily Intention Using On-wrist Motion Triggered Sensing

Tz-Ying Wu, Ting-An Chien, Cheng-Sheng Chan, Chan-Wei Hu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 48 - 56

2017 IEEE International Conference on Computer Vision (ICCV)

Anticipating human intention by observing one’s actions has many applications. For instance, picking up a cellphone, then a charger (actions) implies that one wants to charge the cellphone (intention) (Fig. 1). By anticipating the intention, an intelligent system can guide the user to the closest power outlet. We propose an on-wrist motion triggered sensing system for anticipating daily intentions,...

chapter

Automatic Content-Aware Projection for 360° Videos

Yeong Won Kim, Chang-Ryeol Lee, Dae-Yong Cho, Yong Hoon Kwon, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4753 - 4761

2017 IEEE International Conference on Computer Vision (ICCV)

To watch 360° videos on normal 2D displays, we need to project the selected part of the 360° image onto the 2D display plane. In this paper, we propose a fully-automated framework for generating content-aware 2D normal-view perspective videos from 360° videos. Especially, we focus on the projection step preserving important image contents and reducing image distortion. Basically, our projection method...

INFONA - science communication portal

Search results

Perceptual Analysis of Perspective Projection for Viewport Rendering in 360° Images

Robust and Fast Object Tracking for Challenging 360-degree Videos

Fully Automatic Horizon Estimation for Surveillance Cameras

Viewpoint Invariant RGB-D Human Action Recognition

Learning Action Recognition Model from Depth and Skeleton Videos

Drone-Based Object Counting by Spatially Regularized Regional Proposal Network

Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks with Spatiotemporal Transformer Modules

[POSTER] AirGestAR: Leveraging Deep Learning for Complex Hand Gestural Interaction with Frugal AR Devices

[POSTER] A Benchmark Dataset for 6DoF Object Pose Tracking

[POSTER] Reactive Displays for Virtual Reality

A new method for measuring the speed characteristics of high voltage circuit breaker based on machine vision algorithm

Jointly Recognizing Object Fluents and Tasks in Egocentric Videos

Deformable block-based motion estimation in omnidirectional image sequences

Enhancing viewability for first-person videos based on a human perception model

Moving object detection using background subtraction for a moving camera with pronounced parallax

Recurrent Color Constancy

FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras

Mutual Enhancement for Detection of Multiple Logos in Sports Videos

Anticipating Daily Intention Using On-wrist Motion Triggered Sensing

Automatic Content-Aware Projection for 360° Videos

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options