Search results

chapter

Improvement of visual servoing tasks by underwater image enhancement

Diego Cesar, Sylvain Joyeux, Marco Reis, Andre Conceicao, more

OCEANS 2017 – Anchorage > 1 - 6

OCEANS 2017 - Anchorage

Underwater image formation is degraded by several factors, which causes the ocean to be a challenging environment for image processing. This paper aims to improve the visual servoing capability of an autonomous underwater vehicle by using pre-processing algorithms to improve the image quality. We used artificial fiducial markers to feed the visual controller. Therefore, three different methods for...

chapter

Survey of Visual Feature Extraction Algorithms in a Mars-like Environment

Martin Oelsch, Dominik Van Opdenbosch, Eckehard Steinbach

2017 IEEE International Symposium on Multimedia (ISM) > 322 - 325

2017 IEEE International Symposium on Multimedia (ISM)

This paper presents a performance comparison of several state-of-the-art visual feature extraction algorithms when applied in a poorly-structured environment as found on the planet Mars. So far, no systematic evaluation of feature extraction algorithms in extraterrestrial environments is available. The algorithms in this paper are evaluated using the Devon Island dataset which is said to have one...

chapter

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents

Ichiro Ide, Ye Zhang, Ryunosuke Tanishige, Keisuke Doman, more

2017 IEEE International Symposium on Multimedia (ISM) > 193 - 199

2017 IEEE International Symposium on Multimedia (ISM)

Since news videos are valuable sources of multimedia information on real-world events, there is a demand for viewing them efficiently. However, there is a problem that summarization methods based on auditory contents do not take into account the visual contents. In the case of news videos, due to its presentation style where audio contents and visual contents do not necessarily come from the same...

chapter

Multi-Object Model-Free Tracking with Joint Appearance and Motion Inference

Chongyu Liu, Rui Yao, S. Hamid Rezatofighi, Ian Reid, more

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Multi-object model-free tracking is challenging because the tracker is not aware of the objects' type (not allowed to use object detectors), and needs to distinguish one object from background as well as other similar objects. Most existing methods keep updating their appearance model individually for each target, and their performance is hampered by sudden appearance change and/or occlusion. We propose...

chapter

An unsupervised machine learning algorithm for visual target identification in the context of a robotics competition

Camila Barbosa, Orivaldo Santana, Bruno Silva

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR) > 1 - 6

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR)

Computer Vision and Machine Learning are the key to develop autonomous robots. While engaged with a IEEE Open Challenge, in which the robots need to recognize a miniature of a cow, we saw a solution in these areas. The main contribution of this paper is the algorithm implemented to identify and follow a known object, the miniature of a cow. We are constructing an application based on Image Processing...

chapter

Weakly-Supervised Learning of Visual Relations

Julia Peyre, Ivan Laptev, Cordelia Schmid, Josef Sivic

2017 IEEE International Conference on Computer Vision (ICCV) > 5189 - 5198

2017 IEEE International Conference on Computer Vision (ICCV)

This paper introduces a novel approach for modeling visual relations between pairs of objects. We call relation a triplet of the form (subject; predicate; object) where the predicate is typically a preposition (eg. ’under’, ’in front of’) or a verb (’hold’, ’ride’) that links a pair of objects (subject; object). Learning such relations is challenging as the objects have different spatial configurations...

chapter

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection

Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1819 - 1828

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we investigate a weakly-supervised object detection framework. Most existing frameworks focus on using static images to learn object detectors. However, these detectors often fail to generalize to videos because of the existing domain shift. Therefore, we investigate learning these detectors directly from boring videos of daily activities. Instead of using bounding boxes, we explore...

chapter

Areas of Attention for Image Captioning

Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek

2017 IEEE International Conference on Computer Vision (ICCV) > 1251 - 1259

2017 IEEE International Conference on Computer Vision (ICCV)

We propose “Areas of Attention”, a novel attentionbased model for automatic image captioning. Our approach models the dependencies between image regions, caption words, and the state of an RNN language model, using three pairwise interactions. In contrast to previous attentionbased approaches that associate image regions only to the RNN state, our method allows a direct association between caption...

chapter

Clustering-based threshold estimation for vortex extraction and visualization

Kavya Padmesh, Simon Ferrari, Yaoping Hu, Robert J. Martinuzzi

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 677 - 682

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Research efforts have been devoted to extraction and visualization of vortices in an unsteady (turbulent) flow. Characterizing the behaviors of the flow, vortices are identifiable as regions using a vortex detector known as the lambda2-criterion. Isosurface visualization renders vortex regions based on a chosen isovalue. However, it is highly challenging to choose one isovalue suitable for visualizing...

chapter

Epipolar based light field key-location detector

Jose Abecasis Teixeira, Catarina Brites, Fernando Pereira, Joao Ascenso

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Nowadays, visual features play a key role, as they can provide a concise representation of visual data that is efficient for multiple tasks, notably content retrieval and object recognition. In parallel, visual sensors have been improving, targeting richer acquisitions of the light in a visual scene. In this context, the so-called light field cameras, which have recently emerged, are able to go beyond...

chapter

Mid-level Image Representation for Fruit Fly Identification (Diptera: Tephritidae)

Matheus Macedo Leonardo, Sandra Avila, Roberto A. Zucchi, Fabio A. Faria

2017 IEEE 13th International Conference on e-Science (e-Science) > 202 - 209

2017 IEEE 13th International Conference on e-Science (e-Science)

Fruit flies are of huge biological and economic importance for the farming of different countries in the World, especially for Brazil. Brazil is the third largest fruit producer in the world with 44 million tons in 2016. The direct and indirect losses caused by fruit flies can exceed USD 2 billion, putting these pests as one of the biggest problems of the world agriculture. In Brazil, it is estimated...

chapter

A visual method tor the detection of available parking slots

Jian-Yu Chen, Chih-Ming Hsu

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2980 - 2985

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Visual sensors are widely used in automatic parking systems, so this paper proposes an algorithm for the visual detection of available parking slots. The proposed system consists of two stages: parking slot recognition and slot occupancy classification. The parking slot recognition stage generates parking slots using the corner features of parking slot markings. The slot occupancy classification stage...

chapter

Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge

Ryota Hinami, Tao Mei, Shin'ichi Satoh

2017 IEEE International Conference on Computer Vision (ICCV) > 3639 - 3647

2017 IEEE International Conference on Computer Vision (ICCV)

This paper addresses the problem of joint detection and recounting of abnormal events in videos. Recounting of abnormal events, i.e., explaining why they are judged to be abnormal, is an unexplored but critical task in video surveillance, because it helps human observers quickly judge if they are false alarms or not. To describe the events in the human-understandable form for event recounting, learning...

chapter

Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues

Bryan A. Plummer, Arun Mallya, Christopher M. Cervantes, Julia Hockenmaier, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1946 - 1955

2017 IEEE International Conference on Computer Vision (ICCV)

This paper presents a framework for localization or grounding of phrases in images using a large collection of linguistic and visual cues. We model the appearance, size, and position of entity bounding boxes, adjectives that contain attribute information, and spatial relationships between pairs of entities connected by verbs or prepositions. Special attention is given to relationships between people...

chapter

Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Debidatta Dwibedi, Ishan Misra, Martial Hebert

2017 IEEE International Conference on Computer Vision (ICCV) > 1310 - 1319

2017 IEEE International Conference on Computer Vision (ICCV)

A major impediment in rapidly deploying object detection models for instance detection is the lack of large annotated datasets. For example, finding a large labeled dataset containing instances in a particular kitchen is unlikely. Each new environment with new instances requires expensive data collection and annotation. In this paper, we propose a simple approach to generate large annotated instance...

chapter

Soft Proposal Networks for Weakly Supervised Object Localization

Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1859 - 1868

2017 IEEE International Conference on Computer Vision (ICCV)

Weakly supervised object localization remains challenging, where only image labels instead of bounding boxes are available during training. Object proposal is an effective component in localization, but often computationally expensive and incapable of joint optimization with some of the remaining modules. In this paper, to the best of our knowledge, we for the first time integrate weakly supervised...

chapter

Enhanced target representation for moving objects classification

Tin Tin Yu, Nu War

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 3

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

Instead of using HOG feature on cells or blocks, the extraction of HOG features on corner points is proposed for multiple object visual tracking system in which single or multiple moving objects could be classified. Background subtraction and extraction of corner feature are applied to track and classify the moving objects. Firstly, moving objects will be detected in the form of regions from background...

chapter

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

Yousong Zhu, Chaoyang Zhao, Jinqiao Wang, Xu Zhao, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4146 - 4154

2017 IEEE International Conference on Computer Vision (ICCV)

The region-based Convolutional Neural Network (CNN) detectors such as Faster R-CNN or R-FCN have already shown promising results for object detection by combining the region proposal subnetwork and the classification subnetwork together. Although R-FCN has achieved higher detection speed while keeping the detection performance, the global structure information is ignored by the position-sensitive...

chapter

WordSup: Exploiting Word Annotations for Character Based Text Detection

Han Hu, Chengquan Zhang, Yuxuan Luo, Yuzhuo Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4950 - 4959

2017 IEEE International Conference on Computer Vision (ICCV)

Imagery texts are usually organized as a hierarchy of several visual elements, i.e. characters, words, text lines and text blocks. Among these elements, character is the most basic one for various languages such as Western, Chinese, Japanese, mathematical expression and etc. It is natural and convenient to construct a common text detection engine based on character detectors. However, training character...

chapter

Towards real-time motion estimation in high-definition video based on points of interest

Petr Pulc, Martin Holena

2017 Federated Conference on Computer Science and Information Systems (FedCSIS) > 67 - 70

2017 Federated Conference on Computer Science and Information Systems (FedCSIS)

Currently used motion estimation is usually based on a computation of optical flow from individual images or short sequences. As these methods do not require an extraction of the visual description in points of interest, correspondence can be deduced only by the position of such points. In this paper, we propose an alternative motion estimation method solely using a binary visual descriptor. By tuning...

INFONA - science communication portal

Search results

Improvement of visual servoing tasks by underwater image enhancement

Survey of Visual Feature Extraction Algorithms in a Mars-like Environment

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents

Multi-Object Model-Free Tracking with Joint Appearance and Motion Inference

An unsupervised machine learning algorithm for visual target identification in the context of a robotics competition

Weakly-Supervised Learning of Visual Relations

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection

Areas of Attention for Image Captioning

Clustering-based threshold estimation for vortex extraction and visualization

Epipolar based light field key-location detector

Mid-level Image Representation for Fruit Fly Identification (Diptera: Tephritidae)

A visual method tor the detection of available parking slots

Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge

Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues

Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Soft Proposal Networks for Weakly Supervised Object Localization

Enhanced target representation for moving objects classification

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

WordSup: Exploiting Word Annotations for Character Based Text Detection

Towards real-time motion estimation in high-definition video based on points of interest

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options