Search results

Items from 1 to 20 out of 543 results

chapter

A deep neural network approach to fusing vision and heteroscedastic motion estimates for low-SWaP robotic applications

E. Jared Shamwell, William D. Nothwang, Donald Perlis

2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI) > 56 - 63

2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)

Due both to the speed and quality of their sensors and restrictive on-board computational capabilities, current state-of-the-art (SOA) size, weight, and power (SWaP) constrained autonomous robotic systems are limited in their abilities to sample, fuse, and analyze sensory data for state estimation. Aimed at improving SWaP-constrained robotic state estimation, we present Multi-Hypothesis DeepEfference...

chapter

Learning Robust Visual-Semantic Embeddings

Yao-Hung Hubert Tsai, Liang-Kang Huang, Ruslan Salakhutdinov

2017 IEEE International Conference on Computer Vision (ICCV) > 3591 - 3600

2017 IEEE International Conference on Computer Vision (ICCV)

Many of the existing methods for learning joint embedding of images and text use only supervised information from paired images and its textual attributes. Taking advantage of the recent success of unsupervised learning in deep neural networks, we propose an end-to-end learning framework that is able to extract more robust multi-modal representations across domains. The proposed method combines representation...

chapter

[POSTER] Depth Map Interpolation Using Perceptual Loss

Ilya Makarov, Vladimir Aliev, Olga Gerasimova, Pavel Polyakov

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 93 - 94

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

In this paper, we discuss a semi-dense depth map interpolation method based on convolutional neural network. We propose a compact neural network architecture with loss function defined as Euclidean distance in the feature space of VGG-16 neural network used for deep visual recognition. The suggested solution shows state-of-art performance on synthetic and real datasets. Together with LSD-SLAM, the...

chapter

Iterative denoising-based mesh-to-grid reconstruction with hyperparametric adaptation

Jan Koloda, Michel Batz, Andre Kaup

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 5

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

This paper presents a new method for the reconstruction of images from samples located at non-integer mesh positions. This is a common scenario for many image processing applications such as multi-image super-resolution, frame-rate up-conversion, or virtual view synthesis in multi-camera systems. The proposed method consists of an iterative procedure that employs adaptive denoising in order to reduce...

chapter

An efficient lossless (2, n) secret image sharing based on Blakley's scheme

Sebastien Beugnon, William Puech, Jean-Pierre Pedeboy

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Visual Secret Sharing (VSS) is a type of cryptographic method used to secure digital media such as images by splitting it into n shares. Then, with k or more shares, the secret media can be reconstructed. Without the required number of shares, they are totally useless individually. The purpose of secret sharing methods is to reinforce the cryptographic approach from different points of failure as...

chapter

From Square Pieces to Brick Walls: The Next Challenge in Solving Jigsaw Puzzles

Shir Gur, Ohad Ben-Shahar

2017 IEEE International Conference on Computer Vision (ICCV) > 4049 - 4057

2017 IEEE International Conference on Computer Vision (ICCV)

Research into computational jigsaw puzzle solving, an emerging theoretical problem with numerous applications, has focused in recent years on puzzles that constitute square pieces only. In this paper we wish to extend the scientific scope of appearance-based puzzle solving and consider ’’brick wall” jigsaw puzzles – rectangular pieces who may have different sizes, and could be placed next to each...

chapter

Multi-view Dynamic Shape Refinement Using Local Temporal Integration

Vincent Leroy, Jean-Sebastien Franco, Edmond Boyer

2017 IEEE International Conference on Computer Vision (ICCV) > 3113 - 3122

2017 IEEE International Conference on Computer Vision (ICCV)

We consider 4D shape reconstructions in multi-view environments and investigate how to exploit temporal redundancy for precision refinement. In addition to being beneficial to many dynamic multi-view scenarios this also enables larger scenes where such increased precision can compensate for the reduced spatial resolution per image frame. With precision and scalability in mind, we propose a symmetric...

chapter

Visualization of the microcirculation in micro vasculatures by photoacoustic tomography with high frequency spherical array transducer

Ryo Nagaoka, Takuya Tabata, Ryo Takagi, Shin Yoshizawa, more

2017 IEEE International Ultrasonics Symposium (IUS) > 1

2017 IEEE International Ultrasonics Symposium (IUS)

A spatial resolution of photoacoustic tomography (PAT) has been limited by the receive frequency significantly lower than that of photoacoustic microscopy. In the present study, an in vivo microcirculation is visualized by a PAT system by using a newly developed spherical array transducer with a center frequency of 10 MHz. Additionally, we propose a novel reconstruction method suppressing the side...

chapter

Visualizing and improving scattering networks

Fergal Cotter, Nick Kingsbury

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

Scattering Transforms (or ScatterNets) introduced by Mallat in [1] are a promising start into creating a well-defined feature extractor to use for pattern recognition and image classification tasks. They are of particular interest due to their architectural similarity to Convolutional Neural Networks (CNNs), while requiring no parameter learning and still performing very well (particularly in constrained...

chapter

On the extension of XOR step construction for optimal contrast grey level visual cryptography

K. Praveen, M. Sethumadhavan

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 219 - 222

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Step construction for visual cryptography which is applicable to both OR and XOR operation based reconstruction uses traditional (2, 2)-visual cryptographic scheme (VCS) recursively for generating shares. XOR step construction is the first deterministic general access structure scheme based on XOR reconstruction for binary images. The average pixel expansion of step construction is very less when...

chapter

Accurate 3D reconstruction for less overlapped laparoscopic sequential images

Jangseok Oh, Kwangtaek Kim

2017 2nd International Conference on Bio-engineering for Smart Technologies (BioSMART) > 1 - 4

2017 2nd International Conference on Bio-engineering for Smart Technologies (BioSMART)

In this paper, we propose a new method that can reconstruct the more accurate 3D surface point clouds from the standard laparoscopic (single camera) imaging system even with less sequential (less overlapped) and low quality images, which is a promising way to achieve a faster surgical guidance. The strength of our method is to find more accurate feature points that lead to precise 3D reconstruction...

chapter

Autoencoder-augmented neuroevolution for visual doom playing

Samuel Alvernaz, Julian Togelius

2017 IEEE Conference on Computational Intelligence and Games (CIG) > 1 - 8

2017 IEEE Conference on Computational Intelligence and Games (CIG)

Neuroevolution has proven effective at many re-inforcement learning tasks, including tasks with incomplete information and delayed rewards, but does not seem to scale well to high-dimensional controller representations, which are needed for tasks where the input is raw pixel data. We propose a novel method where we train an autoencoder to create a comparatively low-dimensional representation of the...

chapter

A novel edge-preserving mesh-based method for image scaling

Seyedali Mostafavian, Michael D. Adams

2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM) > 1 - 6

2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM)

In this paper, we present a novel image scaling method that employs a mesh model that explicitly represents discontinuities in the image. Our method effectively addresses the problem of preserving the sharpness of edges, which has always been a challenge, during image enlargement. We use a constrained Delaunay triangulation to generate the model and an approximating function that is continuous everywhere...

chapter

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6175 - 6184

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Accurate visual localization is a key technology for autonomous navigation. 3D structure-based methods employ 3D models of the scene to estimate the full 6DOF pose of a camera very accurately. However, constructing (and extending) large-scale 3D models is still a significant challenge. In contrast, 2D image retrieval-based methods only require a database of geo-tagged images, which is trivial to construct...

chapter

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

Youngjoon Yoo, Sangdoo Yun, Hyung Jin Chang, Yiannis Demiris, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2943 - 2952

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a new high dimensional regression method by merging Gaussian process regression into a variational autoencoder framework. In contrast to other regression methods, the proposed method focuses on the case where output responses are on a complex high dimensional manifold, such as images. Our contributions are summarized as follows: (i) A new regression method estimating high dimensional...

chapter

Semantic Autoencoder for Zero-Shot Learning

Elyor Kodirov, Tao Xiang, Shaogang Gong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4447 - 4456

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing zero-shot learning (ZSL) models typically learn a projection function from a feature space to a semantic embedding space (e.g. attribute space). However, such a projection function is only concerned with predicting the training seen class semantic representation (e.g. attribute prediction) or classification. When applied to test data, which in the context of ZSL contains different (unseen)...

chapter

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks

Xiao Yang, Ersin Yumer, Paul Asente, Mike Kraley, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4342 - 4351

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an end-to-end, multimodal, fully convolutional network for extracting semantic structures from document images. We consider document semantic structure extraction as a pixel-wise segmentation task, and propose a unified model that classifies pixels based not only on their visual appearance, as in the traditional page segmentation task, but also on the content of underlying text. Moreover,...

chapter

Learning Shape Abstractions by Assembling Volumetric Primitives

Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1466 - 1474

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a learning framework for abstracting complex shapes by learning to assemble objects using 3D volumetric primitives. In addition to generating simple and geometrically interpretable explanations of 3D objects, our framework also allows us to automatically discover and exploit consistent structure in the data. We demonstrate that using our method allows predicting shape representations which...

chapter

Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context

Qingan Yan, Long Yang, Ling Zhang, Chunxia Xiao

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 152 - 160

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A perennial problem in structure from motion (SfM) is visual ambiguity posed by repetitive structures. Recent disambiguating algorithms infer ambiguities mainly via explicit background context, thus face limitations in highly ambiguous scenes which are visually indistinguishable. Instead of analyzing local visual information, we propose a novel algorithm for SfM disambiguation that explores the global...

chapter

Adversarial Discriminative Domain Adaptation

Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2962 - 2971

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Adversarial learning methods are a promising approach to training robust deep networks, and can generate complex samples across diverse domains. They can also improve recognition despite the presence of domain shift or dataset bias: recent adversarial approaches to unsupervised domain adaptation reduce the difference between the training and test domain distributions and thus improve generalization...

Keywords:
VISUALIZATION
IMAGE RECONSTRUCTION

Publication date

Set your own date range

Content availability

Available (534)
None (9)

Keywords

CAMERAS (130)
PIXEL (83)
THREE DIMENSIONAL DISPLAYS (81)
IMAGE CODING (70)
CRYPTOGRAPHY (66)
IMAGE COLOR ANALYSIS (61)
THREE-DIMENSIONAL DISPLAYS (61)
FEATURE EXTRACTION (59)
SOLID MODELING (52)
COMPUTATIONAL MODELING (48)
SHAPE (44)
IMAGE EDGE DETECTION (41)
IMAGE RESOLUTION (41)
TRAINING (40)
IMAGE SEGMENTATION (39)
3D RECONSTRUCTION (34)
MEDICAL IMAGE PROCESSING (30)
PSNR (29)
OPTIMIZATION (27)
ENCODING (26)
DATA VISUALISATION (25)
RENDERING (COMPUTER GRAPHICS) (25)
INTERPOLATION (24)
COMPUTER VISION (23)
GEOMETRY (23)
IMAGE QUALITY (23)
MATHEMATICAL MODEL (23)
VISUAL CRYPTOGRAPHY (23)
DATA MINING (22)
ESTIMATION (22)
STACKING (22)
DECODING (21)
VECTORS (21)
VISUAL HULL (21)
DICTIONARIES (20)
IMAGE PROCESSING (19)
ROBUSTNESS (19)
COLOR (18)
CORRELATION (18)
IMAGE COLOUR ANALYSIS (18)
CALIBRATION (17)
HUMANS (17)
IMAGE REPRESENTATION (17)
ALGORITHM DESIGN AND ANALYSIS (16)
BIOMEDICAL IMAGING (16)
SOLID MODELLING (16)
STEREO IMAGE PROCESSING (16)
SURFACE RECONSTRUCTION (16)
COMPUTATIONAL GEOMETRY (15)
IMAGE MATCHING (15)
IMAGE SEQUENCES (15)
SECRET SHARING (15)
TARGET TRACKING (15)
IMAGE RESTORATION (14)
SIMULTANEOUS LOCALIZATION AND MAPPING (14)
VIRTUAL REALITY (14)
COMPUTED TOMOGRAPHY (13)
VIDEO CODING (13)
ACCURACY (12)
COMPUTERISED TOMOGRAPHY (12)
COMPUTERS (12)
DISTANCE MEASUREMENT (12)
EQUATIONS (12)
KERNEL (12)
MEASUREMENT (12)
PRINCIPAL COMPONENT ANALYSIS (12)
RECONSTRUCTION (12)
VIDEO SIGNAL PROCESSING (12)
DATA COMPRESSION (11)
DATA MODELS (11)
IMAGE TEXTURE (11)
NOISE (11)
REAL-TIME SYSTEMS (11)
STREAMING MEDIA (11)
VISUAL SECRET SHARING (11)
WAVELET TRANSFORMS (11)
COMPLEXITY THEORY (10)
DATA VISUALIZATION (10)
ENCRYPTION (10)
FACE (10)
GRAY-SCALE (10)
IMAGING (10)
MOTION ESTIMATION (10)
REAL TIME SYSTEMS (10)
ROBOT SENSING SYSTEMS (10)
SECURITY (10)
SURGERY (10)
TRANSFORM CODING (10)
TRANSFORMS (10)
VISUAL QUALITY (10)
AUTHENTICATION (9)
CONFERENCES (9)
DATABASES (9)
FILTERING (9)
IMAGE CLASSIFICATION (9)
INDEXES (9)
LIGHTING (9)
NAVIGATION (9)
more

INFONA - science communication portal

Search results

A deep neural network approach to fusing vision and heteroscedastic motion estimates for low-SWaP robotic applications

Learning Robust Visual-Semantic Embeddings

[POSTER] Depth Map Interpolation Using Perceptual Loss

Iterative denoising-based mesh-to-grid reconstruction with hyperparametric adaptation

An efficient lossless (2, n) secret image sharing based on Blakley's scheme

From Square Pieces to Brick Walls: The Next Challenge in Solving Jigsaw Puzzles

Multi-view Dynamic Shape Refinement Using Local Temporal Integration

Visualization of the microcirculation in micro vasculatures by photoacoustic tomography with high frequency spherical array transducer

Visualizing and improving scattering networks

On the extension of XOR step construction for optimal contrast grey level visual cryptography

Accurate 3D reconstruction for less overlapped laparoscopic sequential images

Autoencoder-augmented neuroevolution for visual doom playing

A novel edge-preserving mesh-based method for image scaling

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

Semantic Autoencoder for Zero-Shot Learning

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks

Learning Shape Abstractions by Assembling Volumetric Primitives

Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context

Adversarial Discriminative Domain Adaptation

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options