Search results

Items from 1 to 20 out of 410 results

chapter

A discrete Bayes filter for visual loop-closing in image sequences of coral reef explorations taken by humans and AUVs

Alejandro Maldonado-Ramirez, L. Abril Torres-Mendez

OCEANS 2017 – Anchorage > 1 - 5

OCEANS 2017 - Anchorage

The recognition of places by using visual information in underwater environments is important when performing autonomous robotic exploration of the same area at different periods of time. It helps the robot to know its location and take decisions accordingly. However, vision-based recognition of underwater places can be a very challenging task due to the inherent properties of this kind of places...

chapter

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents

Ichiro Ide, Ye Zhang, Ryunosuke Tanishige, Keisuke Doman, more

2017 IEEE International Symposium on Multimedia (ISM) > 193 - 199

2017 IEEE International Symposium on Multimedia (ISM)

Since news videos are valuable sources of multimedia information on real-world events, there is a demand for viewing them efficiently. However, there is a problem that summarization methods based on auditory contents do not take into account the visual contents. In the case of news videos, due to its presentation style where audio contents and visual contents do not necessarily come from the same...

chapter

Tips for creating a block language with blockly

Erik Pasternak, Rachel Fenichel, Andrew N. Marshall

2017 IEEE Blocks and Beyond Workshop (B&B) > 21 - 24

2017 IEEE Blocks and Beyond Workshop (B&B)

Blockly is an open source library that makes it easy to add block based visual programming to an app. It is designed to be flexible and supports a large set of features for different applications. It has been used for programming animated characters on a screen; creating story scripts; controlling robots; and even generating legal documents. But Blockly is not itself a language; developers who use...

chapter

Open Vocabulary Scene Parsing

Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2021 - 2029

2017 IEEE International Conference on Computer Vision (ICCV)

Recognizing arbitrary objects in the wild has been a challenging problem due to the limitations of existing classification models and datasets. In this paper, we propose a new task that aims at parsing scenes with a large and open vocabulary, and several evaluation metrics are explored for this problem. Our approach is a joint image pixel and word concept embeddings framework, where word concepts...

chapter

Difference in focus points between experts and non-experts while watching soccer game

Yuri Hamada, Keisuke Kanno, Hiroko Shoji

2017 International Conference on Biometrics and Kansei Engineering (ICBAKE) > 70 - 73

2017 International Conference on Biometrics and Kansei Engineering (ICBAKE)

As study on information recommendation suitable for individuals, there are many study consideration of personal preference, but few studies consider the degree of specialization. Therefore, in this study, we aim to propose information recommendation method suitable for personal degree of specialization. In this paper, as the first stage, we will clarify the difference between the viewpoint of experts...

chapter

Towards semantic visual features for malignancy description within medical images

Abir Baazaoui, Walid Barhoumi, Ezzeddine Zagrouba

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 397 - 402

2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

Semantic gap, which is the difference between low-level image features and their high-level semantics, has become very popular and witnessed great interest in the last two decades. This paper deals with this problem and proposes a hybrid approach to learn image semantic concepts for modeling visual features in discriminative learning stage. It combines the advantages of human-in-the-loop and discriminative...

chapter

Few-Shot Object Recognition from Machine-Labeled Web Images

Zhongwen Xu, Linchao Zhu, Yi Yang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5358 - 5366

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

With the tremendous advances made by Convolutional Neural Networks (ConvNets) on object recognition, we can now easily obtain adequately reliable machine-labeled annotations easily from predictions by off-the-shelf ConvNets. In this work, we present an abstraction memory based framework for few-shot learning, building upon machine-labeled image annotations. Our method takes large-scale machine-annotated...

chapter

Weakly Supervised Dense Video Captioning

Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5159 - 5167

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper focuses on a novel and challenging vision task, dense video captioning, which aims to automatically describe a video clip with multiple informative and diverse caption sentences. The proposed method is trained without explicit annotation of fine-grained sentence to video region-sequence correspondence, but is only based on weak video-level sentence annotations. It differs from existing...

chapter

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2037 - 2046

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses...

chapter

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories

Ziad Al-Halah, Rainer Stiefelhagen

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5112 - 5121

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Attribute-based recognition models, due to their impressive performance and their ability to generalize well on novel categories, have been widely adopted for many computer vision applications. However, usually both the attribute vocabulary and the class-attribute associations have to be provided manually by domain experts or large number of annotators. This is very costly and not necessarily optimal...

chapter

An Empirical Evaluation of Visual Question Answering for Novel Objects

Santhosh K. Ramakrishnan, Ambar Pal, Gaurav Sharma, Anurag Mittal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7312 - 7321

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of answering questions about images in the harder setting, where the test questions and corresponding images contain novel objects, which were not queried about in the training data. Such setting is inevitable in real world–owing to the heavy tailed distribution of the visual categories, there would be some objects which would not be annotated in the train set. We show...

chapter

Extracting emotions from speech using a bag-of-visual-words approach

Evaggelos Spyrou, Theodoros Giannakopoulos, Dimitrios Sgouropoulos, Michalis Papakostas

2017 12th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP) > 80 - 83

2017 12th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP)

Recognition of humans' emotions may be crucial in certain applications involving e.g., human-computer interaction, monitoring of elderly, understanding the affective state of learners during a course etc. To this goal and depending on the application and the environment, one may use physiological parameters (e.g., heart rate, brain activity etc.) which are typically obtrusive, or analyze other modalities...

chapter

A haptic quality control method based on the human somatosensory system

Bruno Albert, Jean-Luc Maire, Maurice Pillet, Cecilia Zanni-Merk, more

2017 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA) > 135 - 140

2017 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)

The sense of touch is probably the most complex human sense, because it involves a very large number of sensory receptors spread over the whole body, and takes at the same time full advantage of the human nervous system complexity and power. Although this complexity enables us to perceive the world around us and interact with it, it is also a great source of variability when it comes to controlling...

chapter

Visual triage: A bag-of-words experience selector for long-term visual route following

Kirk MacTavish, Michael Paton, Timothy D. Barfoot

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2065 - 2072

2017 IEEE International Conference on Robotics and Automation (ICRA)

Our work builds upon Visual Teach & Repeat 2 (VT&R2): a vision-in-the-loop autonomous navigation system that enables the rapid construction of route networks, safely built through operator-controlled driving. Added routes can be followed autonomously using visual localization. To enable long-term operation that is robust to appearance change, its Multi-Experience Localization (MEL) leverages...

chapter

Visual attention based on long-short term memory model for image caption generation

Shiru Qu, Yuling Xi, Songtao Ding

2017 29th Chinese Control And Decision Conference (CCDC) > 4789 - 4794

2017 29th Chinese Control And Decision Conference (CCDC)

Image caption generation becomes a raising topic in computer vision and artificial intelligence. In order to solve the problem of stiff description, we intend to extract richer features using convolutional neural network (CNN). A neural and probabilistic framework has been proposed consequently which combines CNN with a special form of recurrent neural network (RNN) to produce an end-to-end image...

chapter

Robust visual localization in changing lighting conditions

Pyojin Kim, Brian Coltin, Oleg Alexandrov, H. Jin Kim

2017 IEEE International Conference on Robotics and Automation (ICRA) > 5447 - 5452

2017 IEEE International Conference on Robotics and Automation (ICRA)

We present an illumination-robust visual localization algorithm for Astrobee, a free-flying robot designed to autonomously navigate on the International Space Station (ISS). Astrobee localizes with a monocular camera and a pre-built sparse map composed of natural visual features. Astrobee must perform tasks not only during the day, but also at night when the ISS lights are dimmed. However, the localization...

chapter

Analysis of Kansei image of the trademark design under the Chinese culture five-elements theory

Ting-Chun Tung, Yu-Chen Huang, Ching-Fen Chang

2017 International Conference on Applied System Innovation (ICASI) > 1473 - 1476

2017 International Conference on Applied System Innovation (ICASI)

The shape and color of visual identity are the most important factors in the visualization of trademarks, which exert a far-reaching influence on the establishment of corporate image and brand image. Under the impact of globalization, brands have become a major factor in keeping a foothold in the consumer market, and sales are no longer limited to certain regions. Today's ideological trend of design...

chapter

Image describing based on bidirectional LSTM and improved sequence sampling

Ji Li, Yongfei Shen

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 735 - 739

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

Motivated by great performance gained by Recurrent neural network applied on machine translation, people began to pay attention to image describing with related deep learning methods. Recurrent neural network can not remember long term information but Long-Short Term Memory(LSTM) can handle this well. However, the LSTM applied on image describing to predict sentences in previous literature [1] can...

chapter

Experimental implementation of loop closure detection using data dimensionality reduction by spectral method

Leandro A. S. Moreira, Claudia M. Justel, Paulo F. F. Rosa

2017 IEEE International Conference on Industrial Technology (ICIT) > 797 - 802

2017 IEEE International Conference on Industrial Technology (ICIT)

This paper presents experimental results about loop closure detection in mobile robots through spectral description of images set and data dimensionality reduction. Both, spectral description and representation in low dimension depend heavily on the concept of dominant eigenvector. Integration between Matlab and ROS interface was exploited to perform our experiments. Besides, two environments were...

chapter

Product image search with regional evidences

Guixuan Zhang, Shuwu Zhang, Zhi Zeng, Hu Guan, more

2016 International Conference on Progress in Informatics and Computing (PIC) > 364 - 368

2016 International Conference on Progress in Informatics and Computing (PIC)

In the task of product image search, the database consists of clean versions of product images, while the query photos are often captured from mobile phone cameras under uncontrolled conditions. Conventional methods usually adopt the SIFT based bag-of-words (BoW) representation of the whole query image, which suffers from the interference of background noise. To address the problem, we extract multiple...

Keywords:
VISUALIZATION
VOCABULARY

Publication date

Set your own date range

Content availability

Available (407)
None (3)

Keywords

FEATURE EXTRACTION (191)
TRAINING (87)
HISTOGRAMS (77)
IMAGE RETRIEVAL (77)
SEMANTICS (62)
IMAGE CLASSIFICATION (47)
DATABASES (43)
IMAGE COLOR ANALYSIS (43)
SUPPORT VECTOR MACHINES (40)
IMAGE REPRESENTATION (39)
ACCURACY (35)
VISUAL VOCABULARY (34)
COMPUTER VISION (33)
OBJECT RECOGNITION (33)
DATA MINING (32)
VECTORS (31)
COMPUTATIONAL MODELING (30)
CLUSTERING ALGORITHMS (29)
SHAPE (27)
IMAGE RECOGNITION (26)
KERNEL (26)
IMAGE SEGMENTATION (25)
INDEXES (23)
BUILDINGS (21)
ROBOTS (21)
INDEXING (20)
QUANTIZATION (20)
BAG OF WORDS (18)
CAMERAS (18)
CONTEXT (18)
ENCODING (18)
CORRELATION (17)
IMAGE CODING (17)
VISUAL DATABASES (17)
VISUAL WORDS (17)
CONTENT-BASED RETRIEVAL (16)
DICTIONARIES (16)
TEXT ANALYSIS (16)
CLASSIFICATION ALGORITHMS (15)
CONFERENCES (15)
MOBILE COMMUNICATION (15)
ROBUSTNESS (14)
SIFT (14)
HIDDEN MARKOV MODELS (13)
DETECTORS (12)
HUMANS (12)
IMAGE MATCHING (12)
VIDEOS (12)
BAG-OF-WORDS (11)
COMPLEXITY THEORY (11)
EQUATIONS (11)
IMAGE EDGE DETECTION (11)
LEARNING (ARTIFICIAL INTELLIGENCE) (11)
PATTERN CLUSTERING (11)
PROBABILITY (11)
IMAGE ANNOTATION (10)
MEASUREMENT (10)
MOBILE HANDSETS (10)
MOBILE ROBOTS (10)
OBJECT DETECTION (10)
OPTIMIZATION (10)
PIXEL (10)
SIMULTANEOUS LOCALIZATION AND MAPPING (10)
VISUAL WORD (10)
BAG OF VISUAL WORDS (9)
BAG-OF-FEATURES (9)
ESTIMATION (9)
LIGHTING (9)
MATHEMATICAL MODEL (9)
MULTIMEDIA COMMUNICATION (9)
NEAREST NEIGHBOR SEARCHES (9)
PATTERN RECOGNITION (9)
PROBABILISTIC LATENT SEMANTIC ANALYSIS (9)
PROBABILISTIC LOGIC (9)
TESTING (9)
TRANSFORMS (9)
DATA MODELS (8)
LAYOUT (8)
QUANTIZATION (SIGNAL) (8)
VOCABULARY TREE (8)
ALGORITHM DESIGN AND ANALYSIS (7)
ANALYTICAL MODELS (7)
BAG-OF-VISUAL-WORDS (7)
ENTROPY (7)
GLOBAL POSITIONING SYSTEM (7)
INTERNET (7)
MACHINE LEARNING (7)
ONTOLOGIES (7)
QUERY PROCESSING (7)
ROBOT VISION (7)
SPEECH (7)
STREAMING MEDIA (7)
SURF (7)
SVM (7)
TOPIC MODEL (7)
EDUCATIONAL INSTITUTIONS (6)
IMAGE CATEGORIZATION (6)
IMAGE COLOUR ANALYSIS (6)
more

INFONA - science communication portal

Search results

A discrete Bayes filter for visual loop-closing in image sequences of coral reef explorations taken by humans and AUVs

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents

Tips for creating a block language with blockly

Open Vocabulary Scene Parsing

Difference in focus points between experts and non-experts while watching soccer game

Towards semantic visual features for malignancy description within medical images

Few-Shot Object Recognition from Machine-Labeled Web Images

Weakly Supervised Dense Video Captioning

Semantically Consistent Regularization for Zero-Shot Recognition

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories

An Empirical Evaluation of Visual Question Answering for Novel Objects

Extracting emotions from speech using a bag-of-visual-words approach

A haptic quality control method based on the human somatosensory system

Visual triage: A bag-of-words experience selector for long-term visual route following

Visual attention based on long-short term memory model for image caption generation

Robust visual localization in changing lighting conditions

Analysis of Kansei image of the trademark design under the Chinese culture five-elements theory

Image describing based on bidirectional LSTM and improved sequence sampling

Experimental implementation of loop closure detection using data dimensionality reduction by spectral method

Product image search with regional evidences

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options