Search results

Items from 1 to 20 out of 381 results

chapter

A discrete Bayes filter for visual loop-closing in image sequences of coral reef explorations taken by humans and AUVs

Alejandro Maldonado-Ramirez, L. Abril Torres-Mendez

OCEANS 2017 – Anchorage > 1 - 5

OCEANS 2017 - Anchorage

The recognition of places by using visual information in underwater environments is important when performing autonomous robotic exploration of the same area at different periods of time. It helps the robot to know its location and take decisions accordingly. However, vision-based recognition of underwater places can be a very challenging task due to the inherent properties of this kind of places...

chapter

Recurrent Visual Relationship Recognition with Triplet Unit

Kento Masui, Akiyoshi Ochiai, Shintaro Yoshizawa, Hideki Nakayama

2017 IEEE International Symposium on Multimedia (ISM) > 69 - 76

2017 IEEE International Symposium on Multimedia (ISM)

The task of visual relationship recognition (VRR) is recognizing multiple objects and their relationships in an image. A fundamental difficulty of this task is class-number scalability, since the number of possible relationships we need to consider causes combinatorial explosion. Another difficulty of this task is modeling how to avoid outputting semantically redundant relationships. To overcome these...

chapter

Special glasses for obstacle detection with location system in case of emergency and aid for recognition of dollar bills for visually impaired persons

Guillen-Penarreta Jhonny, Vizhnay-Aguilar Carlos, Serpa-Andrade Luis, Pinos-Velez Eduardo

2017 IEEE Healthcare Innovations and Point of Care Technologies (HI-POCT) > 68 - 71

2017 IEEE Healthcare Innovation Point-of-Care Technologies (HI-POCT)

A visually impaired person or a person with visually impaired, daily has difficulties to learn to recognize or differentiate objects when performing any activity, such as walking the streets or being able to recognize dollar bills of different denomination, this makes that the people with this type of disability cannot adapt easily to the society. Usually the visually impaired persons depend on someone...

chapter

On a recent algorithm for multiple instance learning. Preliminary applications in image classification

Annabella Astorino, Antonio Fuduli, Pierangelo Veltri, Eugenio Vocaturo

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1615 - 1619

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

We present an application of a Multiple Instance Learning (MIL) approach to image classification. In particular we focus on a recent MIL method for binary classification where the objective is to discriminate between positive and negative sets of points. Such sets are called bags and the points inside the bags are called instances. In the case of two classes of instances (positive and negative), a...

chapter

Learning Robust Visual-Semantic Embeddings

Yao-Hung Hubert Tsai, Liang-Kang Huang, Ruslan Salakhutdinov

2017 IEEE International Conference on Computer Vision (ICCV) > 3591 - 3600

2017 IEEE International Conference on Computer Vision (ICCV)

Many of the existing methods for learning joint embedding of images and text use only supervised information from paired images and its textual attributes. Taking advantage of the recent success of unsupervised learning in deep neural networks, we propose an end-to-end learning framework that is able to extract more robust multi-modal representations across domains. The proposed method combines representation...

chapter

Dual-Glance Model for Deciphering Social Relationships

Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

2017 IEEE International Conference on Computer Vision (ICCV) > 2669 - 2678

2017 IEEE International Conference on Computer Vision (ICCV)

Since the beginning of early civilizations, social relationships derived from each individual fundamentally form the basis of social structure in our daily life. In the computer vision literature, much progress has been made in scene understanding, such as object detection and scene parsing. Recent research focuses on the relationship between objects based on its functionality and geometrical relations...

chapter

Scene Graph Generation from Objects, Phrases and Region Captions

Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1270 - 1279

2017 IEEE International Conference on Computer Vision (ICCV)

Object detection, scene graph generation and region captioning, which are three scene understanding tasks at different semantic levels, are tied together: scene graphs are generated on top of objects detected in an image with their pairwise relationship predicted, while region captioning gives a language description of the objects, their attributes, relations and other context information. In this...

chapter

An exploratory study on inclusion of visual representations of thermodynamics-related problems

Adetoun O. Yeaman, Karis Boyd-Sinkler, Diana Bairaktarova

2017 IEEE Frontiers in Education Conference (FIE) > 1 - 4

2017 IEEE Frontiers in Education Conference (FIE)

Engineering students conceptualize problems in diverse ways depending how the problems are presented. In this study, we investigate how different representations of problems, such as with images and sketches versus traditional word description of problems, allow students to recall information. Some students experience difficulties visualizing a concept when given a word problem while others do not...

chapter

A minimal convolutional neural network for handwritten digit recognition

Matthew Y. W. Teow

2017 7th IEEE International Conference on System Engineering and Technology (ICSET) > 171 - 176

2017 7th IEEE International Conference on System Engineering and Technology (ICSET)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network using a minimal model. The proposed minimal convolutional neural network is presented using a layering approach. This approach provides a clear understanding of the main mathematical operations in a convolutional neural network. Hence,...

chapter

Deep features for breast cancer histopathological image classification

Fabio A. Spanhol, Luiz S. Oliveira, Paulo R. Cavalin, Caroline Petitjean, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1868 - 1873

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Breast cancer (BC) is a deadly disease, killing millions of people every year. Developing automated malignant BC detection system applied on patient's imagery can help dealing with this problem more efficiently, making diagnosis more scalable and less prone to errors. Not less importantly, such kind of research can be extended to other types of cancer, making even more impact to help saving lives...

chapter

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Lili Pan, Samira Pouyanfar, Hao Chen, Jiaohua Qin, more

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC) > 181 - 189

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)

Deep learning has brought a series of breakthroughs in image processing. Specifically, there are significant improvements in the application of food image classification using deep learning techniques. However, very little work has been studied for the classification of food ingredients. Therefore, this paper proposes a new framework, called DeepFood which not only extracts rich and effective features...

chapter

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks

Tanmay Gupta, Kevin Shih, Saurabh Singh, Derek Hoiem

2017 IEEE International Conference on Computer Vision (ICCV) > 4223 - 4232

2017 IEEE International Conference on Computer Vision (ICCV)

An important goal of computer vision is to build systems that learn visual representations over time that can be applied to many tasks. In this paper, we investigate a vision-language embedding as a core representation and show that it leads to better cross-task transfer than standard multitask learning. In particular, the task of visual recognition is aligned to the task of visual question answering...

chapter

Learning Visual N-Grams from Web Data

Ang Li, Allan Jabri, Armand Joulin, Laurens van der Maaten

2017 IEEE International Conference on Computer Vision (ICCV) > 4193 - 4202

2017 IEEE International Conference on Computer Vision (ICCV)

Real-world image recognition systems need to recognize tens of thousands of classes that constitute a plethora of visual concepts. The traditional approach of annotating thousands of images per class for training is infeasible in such a scenario, prompting the use of webly supervised data. This paper explores the training of image-recognition systems on large numbers of images and associated user...

chapter

Large-Scale Image Retrieval with Attentive Deep Local Features

Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3476 - 3485

2017 IEEE International Conference on Computer Vision (ICCV)

We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELE (DEep Local Feature). The new feature is based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset. To identify semantically useful local features for image retrieval, we also propose an attention mechanism for key point selection,...

chapter

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

Xihui Liu, Haiyu Zhao, Maoqing Tian, Lu Sheng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 350 - 359

2017 IEEE International Conference on Computer Vision (ICCV)

Pedestrian analysis plays a vital role in intelligent video surveillance and is a key component for security-centric computer vision systems. Despite that the convolutional neural networks are remarkable in learning discriminative features from images, the learning of comprehensive features of pedestrians for fine-grained tasks remains an open problem. In this study, we propose a new attentionbased...

chapter

DualNet: Learn Complementary Features for Image Recognition

Saihui Hou, Xu Liu, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 502 - 510

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a novel framework named Dual-Net aiming at learning more accurate representation for image recognition. Here two parallel neural networks are coordinated to learn complementary features and thus a wider network is constructed. Specifically, we logically divide an end-to-end deep convolutional neural network into two functional parts, i.e., feature extractor and image classifier...

chapter

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

2017 IEEE International Conference on Computer Vision (ICCV) > 589 - 598

2017 IEEE International Conference on Computer Vision (ICCV)

Recognizing how objects interact with each other is a crucial task in visual recognition. If we define the context of the interaction to be the objects involved, then most current methods can be categorized as either: (i) training a single classifier on the combination of the interaction and its context; or (ii) aiming to recognize the interaction independently of its explicit context. Both methods...

chapter

Automatic identification of experts in visual arts: The use of transitions between regions of interest in the image

Marcin Kolodziej, Piotr Francuz, Andrzej Majkowski, Remigiusz J. Rak, more

2017 18th International Conference on Computational Problems of Electrical Engineering (CPEE) > 1 - 4

2017 18th International Conference "Computational Problems of Electrical Engineering" (CPEE)

The aim of this paper is to investigate the use of oculography signals for the recognition of experts in visual arts. We focused our attention on the number of sight transitions between characteristic image areas (ROIs). In the experiments we used oculographic data recorded at the Department of Experimental Psychology at the Catholic University of Lublin for 29 images and 34 users. The EM method was...

chapter

PharmaPack: Mobile fine-grained recognition of pharma packages

O. Taran, S. Rezaeifar, O. Dabrowski, J. Schlechten, more

2017 25th European Signal Processing Conference (EUSIPCO) > 1917 - 1921

2017 25th European Signal Processing Conference (EUSIPCO)

We consider the problem of fine-grained physical object recognition and introduce a dataset PharmaPack containing 1000 unique pharma packages enrolled in a controlled environment using consumer mobile phones as well as several recognition sets representing various scenarios. For performance evaluation, we extract two types of recently proposed local feature descriptors and aggregate them using popular...

chapter

Context based semantic scene classification and recognition used for a vision-based mobile robot

Hirokazu Madokoro, Kazuhito Sato, Kazuhisa Nakasho, Nobuhiro Shimoi

2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN) > 1332 - 1337

2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)

This paper presents a novel method using accelerated KAZE (AKAZE) and Gist for a context-based semantic classification and recognition of indoor scenes used for a vision-based mobile robot. Our method represents spatial relations among categories for mapping neighborhood units on category maps using counter propagation networks (CPNs) while maintaining sequential information of labels generated from...

Keywords:
VISUALIZATION
IMAGE RECOGNITION

Publication date

Set your own date range

Content availability

Available (379)
None (2)

Keywords

FEATURE EXTRACTION (167)
TRAINING (98)
OBJECT RECOGNITION (57)
IMAGE COLOR ANALYSIS (52)
CAMERAS (44)
COMPUTER VISION (44)
PIXEL (41)
IMAGE SEGMENTATION (40)
HISTOGRAMS (39)
SUPPORT VECTOR MACHINES (37)
SHAPE (36)
DATA MINING (34)
COMPUTATIONAL MODELING (33)
HUMANS (33)
ACCURACY (28)
PATTERN RECOGNITION (28)
VOCABULARY (26)
DATABASES (25)
IMAGE REPRESENTATION (25)
IMAGE CLASSIFICATION (24)
OBJECT DETECTION (23)
ROBUSTNESS (23)
SEMANTICS (23)
CLASSIFICATION ALGORITHMS (20)
KERNEL (20)
LEARNING (ARTIFICIAL INTELLIGENCE) (20)
CONTEXT (19)
IMAGE PROCESSING (19)
DETECTORS (18)
ARTIFICIAL NEURAL NETWORKS (17)
ROBOT VISION (17)
IMAGE EDGE DETECTION (16)
ROBOTS (16)
LIGHTING (15)
TESTING (15)
ALGORITHM DESIGN AND ANALYSIS (14)
BUILDINGS (14)
CLUSTERING ALGORITHMS (14)
FACE RECOGNITION (14)
NEURAL NETWORKS (14)
IMAGE RESOLUTION (13)
COMPUTERS (12)
EDUCATIONAL INSTITUTIONS (12)
IMAGE CODING (12)
IMAGE SEQUENCES (12)
SOFTWARE (12)
TRANSFORMS (12)
DISTANCE MEASUREMENT (11)
HUMAN COMPUTER INTERACTION (11)
MATHEMATICAL MODEL (11)
MOBILE COMMUNICATION (11)
NEURONS (11)
SIGNAL PROCESSING (11)
VIDEO SIGNAL PROCESSING (11)
COMPLEXITY THEORY (10)
IMAGE COLOUR ANALYSIS (10)
IMAGE MOTION ANALYSIS (10)
IMAGE RETRIEVAL (10)
MACHINE LEARNING (10)
STREAMING MEDIA (10)
VECTORS (10)
CHARACTER RECOGNITION (9)
DATA MODELS (9)
FACE (9)
INDEXES (9)
MOBILE ROBOTS (9)
NAVIGATION (9)
NEURAL NETS (9)
NOISE (9)
SCENE RECOGNITION (9)
TARGET RECOGNITION (9)
THREE-DIMENSIONAL DISPLAYS (9)
VEHICLES (9)
ACTION RECOGNITION (8)
ART (8)
HANDICAPPED AIDS (8)
HIDDEN MARKOV MODELS (8)
IMAGE ANALYSIS (8)
IMAGE FUSION (8)
IMAGE RECONSTRUCTION (8)
MACHINE VISION (8)
PROBABILITY (8)
PROPOSALS (8)
REAL TIME SYSTEMS (8)
SIGNAL PROCESSING ALGORITHMS (8)
SIMULTANEOUS LOCALIZATION AND MAPPING (8)
SPEECH RECOGNITION (8)
USER INTERFACES (8)
VISUAL DATABASES (8)
COGNITION (7)
CONFERENCES (7)
CRYPTOGRAPHY (7)
DEEP LEARNING (7)
EQUATIONS (7)
IMAGE MATCHING (7)
IMAGE TEXTURE (7)
LIBRARIES (7)
MOBILE HANDSETS (7)
more

INFONA - science communication portal

Search results

A discrete Bayes filter for visual loop-closing in image sequences of coral reef explorations taken by humans and AUVs

Recurrent Visual Relationship Recognition with Triplet Unit

Special glasses for obstacle detection with location system in case of emergency and aid for recognition of dollar bills for visually impaired persons

On a recent algorithm for multiple instance learning. Preliminary applications in image classification

Learning Robust Visual-Semantic Embeddings

Dual-Glance Model for Deciphering Social Relationships

Scene Graph Generation from Objects, Phrases and Region Captions

An exploratory study on inclusion of visual representations of thermodynamics-related problems

A minimal convolutional neural network for handwritten digit recognition

Deep features for breast cancer histopathological image classification

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks

Learning Visual N-Grams from Web Data

Large-Scale Image Retrieval with Attentive Deep Local Features

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

DualNet: Learn Complementary Features for Image Recognition

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

Automatic identification of experts in visual arts: The use of transitions between regions of interest in the image

PharmaPack: Mobile fine-grained recognition of pharma packages

Context based semantic scene classification and recognition used for a vision-based mobile robot

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options