Search results

chapter

Progressive Large Scale-Invariant Image Matching in Scale Space

Lei Zhou, Siyu Zhu, Tianwei Shen, Jinglu Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2381 - 2390

2017 IEEE International Conference on Computer Vision (ICCV)

The power of modern image matching approaches is still fundamentally limited by the abrupt scale changes in images. In this paper, we propose a scale-invariant image matching approach to tackling the very large scale variation of views. Drawing inspiration from the scale space theory, we start with encoding the image’s scale space into a compact multi-scale representation. Then, rather than trying...

chapter

Group Re-identification via Unsupervised Transfer of Sparse Features Encoding

Giuseppe Lisanti, Niki Martinel, Alberto Del Bimbo, Gian Luca Foresti

2017 IEEE International Conference on Computer Vision (ICCV) > 2468 - 2477

2017 IEEE International Conference on Computer Vision (ICCV)

Person re-identification is best known as the problem of associating a single person that is observed from one or more disjoint cameras. The existing literature has mainly addressed such an issue, neglecting the fact that people usually move in groups, like in crowded scenarios. We believe that the additional information carried by neighboring individuals provides a relevant visual context that can...

chapter

SuBiC: A Supervised, Structured Binary Code for Image Search

Himalaya Jain, Joaquin Zepeda, Patrick Perez, Remi Gribonval

2017 IEEE International Conference on Computer Vision (ICCV) > 833 - 842

2017 IEEE International Conference on Computer Vision (ICCV)

For large-scale visual search, highly compressed yet meaningful representations of images are essential. Structured vector quantizers based on product quantization and its variants are usually employed to achieve such compression while minimizing the loss of accuracy. Yet, unlike binary hashing schemes, these unsupervised methods have not yet benefited from the supervision, end-to-end learning and...

chapter

Aspects of image compression using neural networks for visual servoing in robot control

V. Nicolau, G. Petrea, M. Andrei

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE) > 1 - 5

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE)

Artificial intelligence is widely used in image processing. Neural networks (NN) were successful used for solving complicated issues due to their capacity of generalization and learning from examples. In this paper some aspects of image compression using artificial neural networks are discussed. The network is used in the feedback loop of the visual servoing system, which aims to control a wheeled...

chapter

Simple and energy efficient discrete cosine transform based image compression for simple pulse-based modulation

Muhammad Agus Zainuddin, Sritrusta Sukaridhoto

2017 International Electronics Symposium on Engineering Technology and Applications (IES-ETA) > 81 - 86

2017 International Electronics Symposium on Engineering Technology and Applications (IES-ETA)

A node in wireless sensor networks has limited battery capacity. In this case, energy efficiency is crucial to prolong the sensor devices lifetime. In this paper, we propose a simple and energy efficient image compression (SEIC), based on Discrete Cosine Transform (DCT) transform, in addition to our previous proposed method based on Discrete Wavelet Transform (SEIC-DWT). SEIC (DCT or DWT) consists...

chapter

Lossless pixel-gradient embedded compression algorithm for the memory bandwidth saving of the larger-sized WRGB OLED applications

Wen-Yu Chiou, Yu-Hsuan Lee

2017 24th International Workshop on Active-Matrix Flatpanel Displays and Devices (AM-FPD) > 143 - 146

2017 24th International Workshop on Active-Matrix Flatpanel Displays and Devices (AM-FPD)

The WRGB-OLED with larger-sized display resolution can bring us more colorful and better visual experiences. However, it also makes OLED display system suffer from a serious bottleneck on memory bandwidth. In this paper, the lossless pixel-gradient EC algorithm is proposed to overcome this bottleneck. It consists of two core techniques: Finer-Gradient-Based Prediction (FGBP) and Gradient-Based Golomb-Rice...

chapter

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning

Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6298 - 6306

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Visual attention has been successfully applied in structural prediction tasks such as visual captioning and question answering. Existing visual attention models are generally spatial, i.e., the attention is modeled as spatial probabilities that re-weight the last conv-layer feature map of a CNN encoding an input image. However, we argue that such spatial attention does not necessarily conform to the...

chapter

Traffic scene recognition based on deep CNN and VLAD spatial pyramids

Fang-Yu Wu, Shi-Yang Yan, Jeremy S. Smith, Bai-Ling Zhang

2017 International Conference on Machine Learning and Cybernetics (ICMLC) > 1 > 156 - 161

2017 International Conference on Machine Learning and Cybernetics (ICMLC)

Traffic scene recognition is an important and challenging issue in Intelligent Transportation Systems (ITS). Recently, Convolutional Neural Network (CNN) models have achieved great success in many applications, including scene classification. The remarkable representational learning capability of CNN remains to be further explored for solving real-world problems. Vector of Locally Aggregated Descriptors...

chapter

Toward the realization of six degrees-of-freedom with compressed light fields

Arianne T. Hinds, Didier Doyen, Pablo Carballeira

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1171 - 1176

2017 IEEE International Conference on Multimedia and Expo (ICME)

360° video, supporting the ability to present views consistent with the rotation of the viewer's head along three axes (roll, pitch, yaw) is the current approach for creation of immersive video experiences. Nevertheless, a more fully natural, photorealistic experience — with support of visual cues that facilitate coherent psycho-visual sensory fusion without the side-effect of cyber-sickness — is...

chapter

An efficient deep convolutional neural networks model for compressed image deblocking

Ke Li, Bahetiyaer Bare, Bo Yan

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1320 - 1325

2017 IEEE International Conference on Multimedia and Expo (ICME)

Convolutional neural networks (CNNs) have been widely used in image processing community. Image deblocking is a post-processing strategy, which aims to reduce the visually annoying blocking artifacts that are caused by block-based transform coding at low bit rates. In recent years, CNNs based methods have been proposed to solve this classic image processing problem. In this paper, we present an efficient...

chapter

Halftoning-based Block Truncation Coding feature on scrambled images

Heri Prasetyo, Bambang Harjito

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW) > 169 - 170

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW)

This paper investigates the usability of Halftoning-based Block Truncation Coding (HBTC) feature for image retrieval. It assumes that all images in database are stored in scrambled/encrypted format. Firstly, an image feature descriptor is derived from the scrambled/encrypted image. This image feature is subsequently converted into the binary representation to achieve fast similarity measurement. The...

chapter

Poster abstract: MicroBrain: Compressing deep neural networks for energy-efficient visual inference service

Shiming Ge, Zhao Luo, Qiting Ye, Xiao-Yu Zhang

2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) > 1000 - 1001

IEEE INFOCOM 2017 -IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

The deployments of deep neural network models on mobile or embedded devices have been hindered due to their large number of weights. In this work, we develop a deep neural network (DNN) model compression service termed MicroBrain to reduce the resource usage for energy-efficient visual inference. By automatically analyzing the trained DNN models, we propose a high-performance DNN model compression...

chapter

Impact of interactivity on the assessment of quality of experience for light field content

Irene Viola, Martin Rerabek, Touradj Ebrahimi

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX) > 1 - 6

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX)

The recent advances in light field imaging are changing the way in which visual content is captured, processed and consumed. Storage and delivery systems for light field images rely on efficient compression algorithms. Such algorithms must additionally take into account the feature-rich rendering for light field content. Therefore, a proper evaluation of visual quality is essential to design and improve...

chapter

Separating inference from feature learning in deep unsupervised visual saliency estimation

Bruno Tailie, Michael Garcia Ortiz

2017 International Joint Conference on Neural Networks (IJCNN) > 1195 - 1201

2017 International Joint Conference on Neural Networks (IJCNN)

Robotic agents, when not equipped with traditional means to capture information about their surroundings, must autonomously learn to extract this information from a very complex environment. In the context of developmental robotics, we use unsupervised representation learning, and more specifically deep autoencoders, in order to capture visual representations. These generic visual representations...

chapter

Visual language model for keyword spotting on historical mongolian document images

Hongxi Wei, Guanglai Gao

2017 29th Chinese Control And Decision Conference (CCDC) > 1737 - 1742

2017 29th Chinese Control And Decision Conference (CCDC)

The Bag-of-Visual-Words (BoVW) approach has been attracted some attention in the field of keyword spotting. However, the BoVW approach discards the spatial relations of the visual words. Therefore, a visual language model is integrated into the BoVW framework in this study so as to add the spatial information. To accomplish the process of keyword spotting, two well-known retrieval schemes, including...

chapter

Compression of topological models and localization using the global appearance of visual information

Luis Paya, Walterio Mayol, Sergio Cebollada, Oscar Reinoso

2017 IEEE International Conference on Robotics and Automation (ICRA) > 5630 - 5637

2017 IEEE International Conference on Robotics and Automation (ICRA)

In this work, a clustering approach to obtain compact topological models of an environment is developed and evaluated. The usefulness of these models is tested by studying their utility to solve the robot localization problem subsequently. Omnidirectional visual information and global appearance descriptors are used both to create and compress the models and to estimate the position of the robot....

chapter

CAS-CNN: A deep convolutional neural network for image compression artifact suppression

Lukas Cavigelli, Pascal Hager, Luca Benini

2017 International Joint Conference on Neural Networks (IJCNN) > 752 - 759

2017 International Joint Conference on Neural Networks (IJCNN)

Lossy image compression algorithms are pervasively used to reduce the size of images transmitted over the web and recorded on data storage media. However, we pay for their high compression rate with visual artifacts degrading the user experience. Deep convolutional neural networks have become a widespread tool to address high-level computer vision tasks very successfully. Recently, they have found...

chapter

Bit allocation with visual attention and visual distortion sensitivity

Mesut Pak, Ulug Bayazit

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

In this work, a novel bit allocation method based on visual attention and distortion sensitivity is developed for JPEG2000. Although, visual attention map for an image can be measured by using well-known saliency map methods, true visual attention map can be obtained by conducting experiments to determine fixation points and their durations. A perception model might turn these duration of fixations...

chapter

Assessment of lossy images with visual detection and saccadic eye movements

Shun-nan Yang, Ju Liu, Juliana Knopf, Hannah Colett, more

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX) > 1 - 3

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX)

This study investigated a novel method of evaluating visually lossy images based on saccadic eye movements. In each trial, participants with normal vision were asked to indicate any visible changes in the image while their gaze positions were being monitored. The original image was replaced with compressed or blurred versions of the same image 150ms after the onset of each eye fixation, and the parameters...

chapter

On the performance of objective metrics for omnidirectional visual content

Evgeniy Upenik, Martin Rerabek, Touradj Ebrahimi

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX) > 1 - 6

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX)

Omnidirectional image and video have gained popularity thanks to availability of capture and display devices for this type of content. Recent studies have assessed performance of objective metrics in predicting visual quality of omnidirectional content. These metrics, however, have not been rigorously validated by comparing their prediction results with ground-truth subjective scores. In this paper,...

INFONA - science communication portal

Search results

Progressive Large Scale-Invariant Image Matching in Scale Space

Group Re-identification via Unsupervised Transfer of Sparse Features Encoding

SuBiC: A Supervised, Structured Binary Code for Image Search

Aspects of image compression using neural networks for visual servoing in robot control

Simple and energy efficient discrete cosine transform based image compression for simple pulse-based modulation

Lossless pixel-gradient embedded compression algorithm for the memory bandwidth saving of the larger-sized WRGB OLED applications

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning

Traffic scene recognition based on deep CNN and VLAD spatial pyramids

Toward the realization of six degrees-of-freedom with compressed light fields

An efficient deep convolutional neural networks model for compressed image deblocking

Halftoning-based Block Truncation Coding feature on scrambled images

Poster abstract: MicroBrain: Compressing deep neural networks for energy-efficient visual inference service

Impact of interactivity on the assessment of quality of experience for light field content

Separating inference from feature learning in deep unsupervised visual saliency estimation

Visual language model for keyword spotting on historical mongolian document images

Compression of topological models and localization using the global appearance of visual information

CAS-CNN: A deep convolutional neural network for image compression artifact suppression

Bit allocation with visual attention and visual distortion sensitivity

Assessment of lossy images with visual detection and saccadic eye movements

On the performance of objective metrics for omnidirectional visual content

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options