Search results

Items from 1 to 20 out of 583 results

chapter

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Chairath Sirirattanapol, Yusuke Matsui, Shin'ichi Satoh, Kuninori Matsuda, more

2017 IEEE International Symposium on Multimedia (ISM) > 495 - 499

2017 IEEE International Symposium on Multimedia (ISM)

Kotenseki is a collection of classical and ancient Japanese literature. It is comprised of image books that express Japanese stories by using comic drawings of different characters, such as humans, nature, and animals. To effectively store them for posterity, a search system is important. We propose an efficient CBIR system to assist the users in easily accessing the information and have an enjoyable...

chapter

Deep affordance learning for single- and multiple-instance object detection

Jian-Gang Wang, Prabhu Shankar Mahendran, Eam-Khwang Teoh

TENCON 2017 - 2017 IEEE Region 10 Conference > 321 - 326

TENCON 2017 - 2017 IEEE Region 10 Conference

Affordance learning in general, is to identify the purpose, use, and ways to interact with an object, based on information gained from observing the object. Most of the existing affordance learning approaches assume the object target has been cropped individually from images. However, the object could not be easily separated from others due to occlusion or noise. Actually, two or more neighboring...

chapter

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection

Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1819 - 1828

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we investigate a weakly-supervised object detection framework. Most existing frameworks focus on using static images to learn object detectors. However, these detectors often fail to generalize to videos because of the existing domain shift. Therefore, we investigate learning these detectors directly from boring videos of daily activities. Instead of using bounding boxes, we explore...

chapter

[POSTER] Prevention of Visually Induced Motion Sickness Based on Dynamic Real-Time Content-Aware Non-salient Area Blurring

Guangyu Nie, Yue Liu, Yongtian Wang

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 75 - 78

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

This paper proposes an innovative method for reducing the visually induced motion sickness (MS) occurred in a 3D immersive virtual environment (VE) by utilizing a flexible dynamic scene smoothing approach based on saliency analysis. A saliency model based on fully convolutional network (FCN) is first trained to establish the saliency map, then the probability maps representing the salient information...

chapter

Crowdedness measuring system considering view angle of CCTV camera

Yewon Kim, Seongjoon Park, Hwangnam Kim

2017 International Conference on Information and Communication Technology Convergence (ICTC) > 1149 - 1152

2017 International Conference on Information and Communication Technology Convergence (ICTC)

In this paper, we propose a system to measure the crowdedness of a specific area using CCTV images. In the proposed system, objects are distinguished by using the Visual Background Extractor (ViBe) algorithm. The background that has not been removed by the ViBe is excluded from the object area using the opening technique. Then, the object area assigned to different weights based on the pixel positions...

chapter

Scene Graph Generation from Objects, Phrases and Region Captions

Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1270 - 1279

2017 IEEE International Conference on Computer Vision (ICCV)

Object detection, scene graph generation and region captioning, which are three scene understanding tasks at different semantic levels, are tied together: scene graphs are generated on top of objects detected in an image with their pairwise relationship predicted, while region captioning gives a language description of the objects, their attributes, relations and other context information. In this...

chapter

Transitive Invariance for Self-Supervised Visual Representation Learning

Xiaolong Wang, Kaiming He, Abhinav Gupta

2017 IEEE International Conference on Computer Vision (ICCV) > 1338 - 1347

2017 IEEE International Conference on Computer Vision (ICCV)

Learning visual representations with self-supervised learning has become popular in computer vision. The idea is to design auxiliary tasks where labels are free to obtain. Most of these tasks end up providing data to learn specific kinds of invariance useful for recognition. In this paper, we propose to exploit different self-supervised approaches to learn representations invariant to (i) inter-instance...

chapter

Focusness guided salient object detection

Xiaolin Xiao, Yicong Zhou

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 3462 - 3466

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Salient object detection aims to correctly highlight the most salient object(s) in an image. Combining fine-grained contrast prior with rough-grained object consistency, this paper proposes a Focusness Guided Salient object detection (FGS) algorithm. To obtain clean and precise contrast map, FGS uses the focusness prior to guide the contrast map. Combing different saliency priors, FGS utilizes a unified...

chapter

Dynamic textures based target detection for PTZ camera sequences

M. Sami Zitouni, Harish Bhaskar, Andrzej Sluzek

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1328 - 1332

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

In this paper, a temporally iterative Gaussian Mixture Model (GMM) of Dynamic Texture (DT) for target detection using a moving PTZ camera, is proposed. Camera movement in a PTZ sensor causes motion-based target detection techniques to fail for the periods affected by the scene change. This is because the whole scene is considered a representation of the target motion. When the camera is in motion,...

chapter

Object detection with sliding window in images including multiple similar objects

Jinsu Lee, Junseong Bang, Seong-Il Yang

2017 International Conference on Information and Communication Technology Convergence (ICTC) > 803 - 806

2017 International Conference on Information and Communication Technology Convergence (ICTC)

Given an image containing an object of interest, the object can be detected by comparing the feature points in the given image with those in a reference object image. In a case where the given image contains a large number of similar objects, the object of interest is difficult to be detected. It is because the feature points in the given image are concentrated in some regions where each region has...

chapter

TALL: Temporal Activity Localization via Language Query

Jiyang Gao, Chen Sun, Zhenheng Yang, Ram Nevatia

2017 IEEE International Conference on Computer Vision (ICCV) > 5277 - 5285

2017 IEEE International Conference on Computer Vision (ICCV)

This paper focuses on temporal localization of actions in untrimmed videos. Existing methods typically train classifiers for a pre-defined list of actions and apply them in a sliding window fashion. However, activities in the wild consist of a wide combination of actors, actions and objects; it is difficult to design a proper activity list that meets users’ needs. We propose to localize activities...

chapter

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Hanwang Zhang, Zawlin Kyaw, Jinyang Yu, Shih-Fu Chang

2017 IEEE International Conference on Computer Vision (ICCV) > 4243 - 4251

2017 IEEE International Conference on Computer Vision (ICCV)

We aim to tackle a novel vision task called Weakly Supervised Visual Relation Detection (WSVRD) to detect “subject-predicate-object” relations in an image with object relation groundtruths available only at the image level. This is motivated by the fact that it is extremely expensive to label the combinatorial relations between objects at the instance level. Compared to the extensively studied problem,...

chapter

Mutual Enhancement for Detection of Multiple Logos in Sports Videos

Yuan Liao, Xiaoqing Lu, Chengcui Zhang, Yongtao Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4856 - 4865

2017 IEEE International Conference on Computer Vision (ICCV)

Detecting logo frequency and duration in sports videos provides sponsors an effective way to evaluate their advertising efforts. However, general-purposed object detection methods cannot address all the challenges in sports videos. In this paper, we propose a mutual-enhanced approach that can improve the detection of a logo through the information obtained from other simultaneously occurred logos...

chapter

Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Debidatta Dwibedi, Ishan Misra, Martial Hebert

2017 IEEE International Conference on Computer Vision (ICCV) > 1310 - 1319

2017 IEEE International Conference on Computer Vision (ICCV)

A major impediment in rapidly deploying object detection models for instance detection is the lack of large annotated datasets. For example, finding a large labeled dataset containing instances in a particular kitchen is unlikely. Each new environment with new instances requires expensive data collection and annotation. In this paper, we propose a simple approach to generate large annotated instance...

chapter

Identification of autonomous landing sign for unmanned aerial vehicle based on faster regions with convolutional neural network

Junjie Chen, Xiren Miao, Hao Jiang, Jing Chen, more

2017 Chinese Automation Congress (CAC) > 2109 - 2114

2017 Chinese Automation Congress (CAC)

In order to realize autonomous landing of the unmanned aerial vehicle (UAV) in power patrolling, a visual method vision based on Faster Regions with Convolutional Neural Network (Faster R-CNN) for UAVs is studied. In this paper, we design the landing sign of the combination of concentric circles and pentagon, and propose the Faster R-CNN recognition algorithm which can be used to identify the target...

chapter

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

Yousong Zhu, Chaoyang Zhao, Jinqiao Wang, Xu Zhao, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4146 - 4154

2017 IEEE International Conference on Computer Vision (ICCV)

The region-based Convolutional Neural Network (CNN) detectors such as Faster R-CNN or R-FCN have already shown promising results for object detection by combining the region proposal subnetwork and the classification subnetwork together. Although R-FCN has achieved higher detection speed while keeping the detection performance, the global structure information is ignored by the position-sensitive...

chapter

Unsupervised Learning of Important Objects from First-Person Videos

Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi

2017 IEEE International Conference on Computer Vision (ICCV) > 1974 - 1982

2017 IEEE International Conference on Computer Vision (ICCV)

A first-person camera, placed at a person's head, captures, which objects are important to the camera wearer. Most prior methods for this task learn to detect such important objects from the manually labeled first-person data in a supervised fashion. However, important objects are strongly related to the camera wearer's internal state such as his intentions and attention, and thus, only the person...

chapter

Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 202 - 211

2017 IEEE International Conference on Computer Vision (ICCV)

Fully convolutional neural networks (FCNs) have shown outstanding performance in many dense labeling problems. One key pillar of these successes is mining relevant information from features in convolutional layers. However, how to better aggregate multi-level convolutional feature maps for salient object detection is underexplored. In this work, we present Amulet, a generic aggregating multi-level...

chapter

What looks good with my sofa: Multimodal search engine for interior design

Ivona Tautkute, Aleksandra Mozejko, Wojciech Stokowiec, Tomasz Trzcinski, more

2017 Federated Conference on Computer Science and Information Systems (FedCSIS) > 1275 - 1282

2017 Federated Conference on Computer Science and Information Systems (FedCSIS)

In this paper, we propose a multi-modal search engine for interior design that combines visual and textual queries. The goal of our engine is to retrieve interior objects, e.g. furniture or wall clocks, that share visual and aesthetic similarities with the query. Our search engine allows the user to take a photo of a room and retrieve with a high recall a list of items identical or visually similar...

chapter

Comparative analysis of eyes detection on face thermal images

M. Naeem Hussien, Mohd-Haris Lye, Mohammad Faizal Ahmad Fauzi, Tan Ching Seong, more

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 385 - 389

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

This paper presents the evaluation of visual features for the proposed two eye detection method applied to thermal images. The use of two eye region is due to its distinctive pattern and to overcome the issue of blurred and noisy characteristic in the thermal image. Comparative performance analysis on three different features which includes Haar, Histogram of Oriented Gradients (HoG) and Local Binary...

Keywords:
VISUALIZATION
OBJECT DETECTION

Publication date

Set your own date range

Content availability

Available (579)
None (4)

Keywords

FEATURE EXTRACTION (265)
IMAGE COLOR ANALYSIS (133)
TRAINING (103)
IMAGE SEGMENTATION (102)
CAMERAS (97)
COMPUTER VISION (96)
PIXEL (95)
COMPUTATIONAL MODELING (88)
DATA MINING (72)
OBJECT RECOGNITION (67)
HISTOGRAMS (66)
HUMANS (62)
VIDEO SIGNAL PROCESSING (59)
DETECTORS (57)
TARGET TRACKING (51)
TRACKING (47)
IMAGE COLOUR ANALYSIS (45)
IMAGE MOTION ANALYSIS (45)
IMAGE CLASSIFICATION (43)
IMAGE EDGE DETECTION (43)
ROBUSTNESS (41)
SHAPE (40)
ROBOT VISION (38)
LEARNING (ARTIFICIAL INTELLIGENCE) (36)
SUPPORT VECTOR MACHINES (35)
VISUAL ATTENTION (33)
IMAGE SEQUENCES (32)
ACCURACY (31)
DATABASES (31)
IMAGE PROCESSING (30)
NOISE (29)
ROBOTS (28)
SEMANTICS (28)
IMAGE MATCHING (27)
IMAGE REPRESENTATION (27)
MATHEMATICAL MODEL (27)
OBJECT TRACKING (27)
VISUAL TRACKING (26)
MOBILE ROBOTS (25)
ALGORITHM DESIGN AND ANALYSIS (24)
THREE DIMENSIONAL DISPLAYS (24)
VIDEO SURVEILLANCE (24)
IMAGE RECOGNITION (23)
SEARCH PROBLEMS (23)
SURVEILLANCE (23)
KERNEL (22)
IMAGE RESOLUTION (21)
SALIENCY MAP (21)
CLASSIFICATION ALGORITHMS (20)
ESTIMATION (20)
FACE (20)
STREAMING MEDIA (20)
CLUSTERING ALGORITHMS (19)
PROPOSALS (19)
ADAPTATION MODEL (18)
COLOR (18)
IMAGE FUSION (18)
PATTERN CLUSTERING (18)
VIDEOS (18)
VISUAL PERCEPTION (18)
DISTANCE MEASUREMENT (16)
IMAGE RETRIEVAL (16)
LIGHTING (16)
ROBOT SENSING SYSTEMS (16)
SALIENCY DETECTION (16)
CONFERENCES (15)
GRAPH THEORY (15)
PARTICLE FILTERING (NUMERICAL METHODS) (15)
REAL TIME SYSTEMS (15)
ENTROPY (14)
EDGE DETECTION (13)
ELECTROENCEPHALOGRAPHY (13)
FILTERING THEORY (13)
NEURONS (13)
PROBABILITY (13)
SENSOR FUSION (13)
TRAJECTORY (13)
TRANSFORMS (13)
VISUAL SALIENCY (13)
BOOSTING (12)
PARTICLE FILTER (12)
SALIENT OBJECT DETECTION (12)
BRAIN MODELING (11)
COMPUTERS (11)
CONTEXT (11)
FACE RECOGNITION (11)
GAUSSIAN PROCESSES (11)
HUMAN VISUAL SYSTEM (11)
IMAGE TEXTURE (11)
SENSORS (11)
TARGET DETECTION (11)
CLUTTER (10)
COMPUTER ARCHITECTURE (10)
EQUATIONS (10)
HIDDEN MARKOV MODELS (10)
MOTION DETECTION (10)
MOTION ESTIMATION (10)
MULTIMEDIA COMMUNICATION (10)
more

INFONA - science communication portal

Search results

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Deep affordance learning for single- and multiple-instance object detection

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection

[POSTER] Prevention of Visually Induced Motion Sickness Based on Dynamic Real-Time Content-Aware Non-salient Area Blurring

Crowdedness measuring system considering view angle of CCTV camera

Scene Graph Generation from Objects, Phrases and Region Captions

Transitive Invariance for Self-Supervised Visual Representation Learning

Focusness guided salient object detection

Dynamic textures based target detection for PTZ camera sequences

Object detection with sliding window in images including multiple similar objects

TALL: Temporal Activity Localization via Language Query

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Mutual Enhancement for Detection of Multiple Logos in Sports Videos

Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Identification of autonomous landing sign for unmanned aerial vehicle based on faster regions with convolutional neural network

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

Unsupervised Learning of Important Objects from First-Person Videos

Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

What looks good with my sofa: Multimodal search engine for interior design

Comparative analysis of eyes detection on face thermal images

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options