Search results

chapter

Kvazaar: HEVC/H.265 4K30p Intra Encoder

Arttu Yla-Outinen, Ari Lemmetti, Marko Viitanen, Jarno Vanne, more

2017 IEEE International Symposium on Multimedia (ISM) > 362 - 363

2017 IEEE International Symposium on Multimedia (ISM)

This paper demonstrates the usage of Kvazaar open-source HEVC intra encoder in 4K real-time video encoding. In this setup, a raw 4K video is shot by an action camera, captured by an HDMI capture card, encoded in real-time by Kvazaar ultrafast preset on a 22-core Intel Xeon processor, sent to a laptop, and decoded by OpenHEVC decoder for playback. The encoding process is visualized on the fly by Kvazaar...

chapter

Estimating objectness using a compound eye camera

Hwiyeon Yoo, Donghoon Lee, Geonho Cha, Songhwai Oh

2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI) > 131 - 136

2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)

In this paper, we introduce a new hardware platform that mimics a compound eye of an insect and propose an algorithm to detect objects using it. The compound eye camera has a wide viewing angle and simulates a number of single eyes on its hemisphere. Each single eye is an elementary unit to acquire visual inputs. Visual information from single eyes is hierarchically merged to estimate objectness....

chapter

Group Re-identification via Unsupervised Transfer of Sparse Features Encoding

Giuseppe Lisanti, Niki Martinel, Alberto Del Bimbo, Gian Luca Foresti

2017 IEEE International Conference on Computer Vision (ICCV) > 2468 - 2477

2017 IEEE International Conference on Computer Vision (ICCV)

Person re-identification is best known as the problem of associating a single person that is observed from one or more disjoint cameras. The existing literature has mainly addressed such an issue, neglecting the fact that people usually move in groups, like in crowded scenarios. We believe that the additional information carried by neighboring individuals provides a relevant visual context that can...

chapter

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Amir Mazaheri, Dong Zhang, Mubarak Shah

2017 IEEE International Conference on Computer Vision (ICCV) > 1416 - 1425

2017 IEEE International Conference on Computer Vision (ICCV)

Given a video and a description sentence with one missing word, “source sentence”, Video-Fill-In-the-Blank (VFIB) problem is to find the missing word automatically. The contextual information of the sentence, as well as visual cues from the video, are important to infer the missing word accurately. Since the source sentence is broken into two fragments: the sentence’s left fragment (before the blank)...

chapter

Genetic CNN

Lingxi Xie, Alan Yuille

2017 IEEE International Conference on Computer Vision (ICCV) > 1388 - 1397

2017 IEEE International Conference on Computer Vision (ICCV)

The deep convolutional neural network (CNN) is the state-of-the-art solution for large-scale visual recognition. Following some basic principles such as increasing network depth and constructing highway connections, researchers have manually designed a lot of fixed network architectures and verified their effectiveness.,,In this paper, we discuss the possibility of learning deep network structures...

chapter

A New Pooling Strategy Based on Local Feature Distribution: A Case Study for Human Action Classification

Raquel Almeida, Zenilton Kleber Goncalves Do Patrocinio, Silvio Jamil F. Guimaraes

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 149 - 154

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Mid-level representations are used to map sets of local features into one global representation for a given media descriptor. In visual pattern recognition tasks, Bag-of-Words (BoW) is one popular strategy, among many methods available in literature, due mainly by the simplicity in concept and implementation. Despite the overall good results achieved by BoW in many tasks, the method is unstable in...

chapter

Evaluating how static analysis tools can reduce code review effort

Devarshi Singh, Varun Ramachandra Sekar, Kathryn T. Stolee, Brittany Johnson

2017 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC) > 101 - 105

2017 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)

Peer code reviews are important for giving and receiving peer feedback, but the code review process is time consuming. Static analysis tools can help reduce reviewer effort by catching common mistakes prior to peer code review. Ideally, contributors would use static analysis tools prior to pull request submission so common mistakes could be addressed first, before invoking the reviewer. To explore...

chapter

Sketchnoting: A new approach to developing visual communication ability, improving critical thinking and creative confidence for engineering and design students

Verena Paepcke-Hjeltness, Mani Mina, Aziza Cyamani

2017 IEEE Frontiers in Education Conference (FIE) > 1 - 5

2017 IEEE Frontiers in Education Conference (FIE)

In this study we provide our methodology and implementation strategy of Sketchnoting in Freshman Engineering and Technological Literacy classes. The objective is to improve students' learning, visualization, and communication proficiencies, as well as to foster advancement in knowledge retention, and critical thinking. This study provides the motivation, supporting research background, design, and...

chapter

SuBiC: A Supervised, Structured Binary Code for Image Search

Himalaya Jain, Joaquin Zepeda, Patrick Perez, Remi Gribonval

2017 IEEE International Conference on Computer Vision (ICCV) > 833 - 842

2017 IEEE International Conference on Computer Vision (ICCV)

For large-scale visual search, highly compressed yet meaningful representations of images are essential. Structured vector quantizers based on product quantization and its variants are usually employed to achieve such compression while minimizing the loss of accuracy. Yet, unlike binary hashing schemes, these unsupervised methods have not yet benefited from the supervision, end-to-end learning and...

chapter

Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization

Sijia Cai, Wangmeng Zuo, Lei Zhang

2017 IEEE International Conference on Computer Vision (ICCV) > 511 - 520

2017 IEEE International Conference on Computer Vision (ICCV)

The success of fine-grained visual categorization (FGVC) extremely relies on the modeling of appearance and interactions of various semantic parts. This makes FGVC very challenging because: (i) part annotation and detection require expert guidance and are very expensive; (ii) parts are of different sizes; and (iii) the part interactions are complex and of higher-order. To address these issues, we...

chapter

Tips for creating a block language with blockly

Erik Pasternak, Rachel Fenichel, Andrew N. Marshall

2017 IEEE Blocks and Beyond Workshop (B&B) > 21 - 24

2017 IEEE Blocks and Beyond Workshop (B&B)

Blockly is an open source library that makes it easy to add block based visual programming to an app. It is designed to be flexible and supports a large set of features for different applications. It has been used for programming animated characters on a screen; creating story scripts; controlling robots; and even generating legal documents. But Blockly is not itself a language; developers who use...

chapter

Generalized Orderless Pooling Performs Implicit Salient Matching

Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4970 - 4979

2017 IEEE International Conference on Computer Vision (ICCV)

Most recent CNN architectures use average pooling as a final feature encoding step. In the field of fine-grained recognition, however, recent global representations like bilinear pooling offer improved performance. In this paper, we generalize average and bilinear pooling to “α-pooling”, allowing for learning the pooling strategy during training. In addition, we present a novel way to visualize decisions...

chapter

Flower classification using fusion descriptor and SVM

Wei Liu, Yunbo Rao, Baijiang Fan, Jiali Song, more

2017 International Smart Cities Conference (ISC2) > 1 - 4

2017 International Smart Cities Conference (ISC2)

This paper aims to develop an effective flower classification approach using the technology of feature extraction. With this regard, a fused descriptor based on Pyramid Histogram of Visual Words (PHOW) is used to extract the color, texture and contour information of flower image. Secondly, Dictionary Learning and Locality-constrained Linear Coding (LLC) are operated on PHOW feature and then images...

chapter

Tell me again about the face: Using repeated interviewing techniques to improve feature-based facial composite technologies

Charity Brown, Charlie D. Frowd, Emma Portch

2017 Seventh International Conference on Emerging Security Technologies (EST) > 38 - 43

2017 Seventh International Conference on Emerging Security Technologies (EST)

Facial composite technologies are used to produce visual resemblances of an offender. However, resemblances may be poor, particularly when composites are constructed using traditional ‘feature’ composite systems deployed several days after the crime. In this case a witness may have forgotten important details about an offender's appearance. Engaging in early and repeated retrieval attempts could potentially...

chapter

Creating a Digital Edition of Mongolian Historical Documents

Biligsaikhan Batjargal, Garmaabazar Khaltarkhuu, Akira Maeda

2017 International Conference on Culture and Computing (Culture and Computing) > 151 - 152

2017 International Conference on Culture and Computing (Culture and Computing)

In this paper, we introduce a digital edition of the Altan Tobchi, a Mongolian historical manuscript written in traditional Mongolian script. The Text Encoding Initiative guidelines were adopted to encode the named entities, commentaries, transcriptions, and interpretations of ancient Mongolian words. Named entities such as personal names and place names were extracted from digitized text by employing...

chapter

Lossless pixel-gradient embedded compression algorithm for the memory bandwidth saving of the larger-sized WRGB OLED applications

Wen-Yu Chiou, Yu-Hsuan Lee

2017 24th International Workshop on Active-Matrix Flatpanel Displays and Devices (AM-FPD) > 143 - 146

2017 24th International Workshop on Active-Matrix Flatpanel Displays and Devices (AM-FPD)

The WRGB-OLED with larger-sized display resolution can bring us more colorful and better visual experiences. However, it also makes OLED display system suffer from a serious bottleneck on memory bandwidth. In this paper, the lossless pixel-gradient EC algorithm is proposed to overcome this bottleneck. It consists of two core techniques: Finer-Gradient-Based Prediction (FGBP) and Gradient-Based Golomb-Rice...

chapter

Dictionary learning for spontaneous neural activity modeling

Birini Troullinou, Grigorios Tsagkatakis, Ganna Palagina, Maria Papadopouli, more

2017 25th European Signal Processing Conference (EUSIPCO) > 1579 - 1583

2017 25th European Signal Processing Conference (EUSIPCO)

Modeling the activity of an ensemble of neurons can provide critical insights into the workings of the brain. In this work we examine if learning based signal modeling can contribute to a high quality modeling of neuronal signal data. To that end, we employ the sparse coding and dictionary learning schemes for capturing the behavior of neuronal responses into a small number of representative prototypical...

chapter

Psychomotor cues for depression screening

Zafi Sherhan Shah, Kirill Sidorov, David Marshall

2017 22nd International Conference on Digital Signal Processing (DSP) > 1 - 5

2017 22nd International Conference on Digital Signal Processing (DSP)

Depression is a cognitive impairment, which according to the World Health Organisation is the leading cause of disability worldwide. One key trait of depression is psychomotor retardation, which adversely affects both emotional and physical behaviour of an individual. In this paper we perform experiments on the Audio Visual Emotion recognition Challenge 2016 — Depression Classification sub-Challenge...

chapter

Hierarchical Boundary-Aware Neural Encoder for Video Captioning

Lorenzo Baraldi, Costantino Grana, Rita Cucchiara

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3185 - 3194

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The use of Recurrent Neural Networks for video captioning has recently gained a lot of attention, since they can be used both to encode the input video and to generate the corresponding description. In this paper, we present a recurrent video encoding scheme which can discover and leverage the hierarchical structure of the video. Unlike the classical encoder-decoder approach, in which a video is encoded...

chapter

Deep TEN: Texture Encoding Network

Hang Zhang, Jia Xue, Kristin Dana

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2896 - 2905

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a Deep Texture Encoding Network (Deep-TEN) with a novel Encoding Layer integrated on top of convolutional layers, which ports the entire dictionary learning and encoding pipeline into a single model. Current methods build from distinct components, using standard encoders with separate off-the-shelf features such as SIFT descriptors or pre-trained CNN features for material recognition. Our...

INFONA - science communication portal

Search results

Kvazaar: HEVC/H.265 4K30p Intra Encoder

Estimating objectness using a compound eye camera

Group Re-identification via Unsupervised Transfer of Sparse Features Encoding

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Genetic CNN

A New Pooling Strategy Based on Local Feature Distribution: A Case Study for Human Action Classification

Evaluating how static analysis tools can reduce code review effort

Sketchnoting: A new approach to developing visual communication ability, improving critical thinking and creative confidence for engineering and design students

SuBiC: A Supervised, Structured Binary Code for Image Search

Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization

Tips for creating a block language with blockly

Generalized Orderless Pooling Performs Implicit Salient Matching

Flower classification using fusion descriptor and SVM

Tell me again about the face: Using repeated interviewing techniques to improve feature-based facial composite technologies

Creating a Digital Edition of Mongolian Historical Documents

Lossless pixel-gradient embedded compression algorithm for the memory bandwidth saving of the larger-sized WRGB OLED applications

Dictionary learning for spontaneous neural activity modeling

Psychomotor cues for depression screening

Hierarchical Boundary-Aware Neural Encoder for Video Captioning

Deep TEN: Texture Encoding Network

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options