The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, the lip feature that has the highest correlation with audio features is investigated. Audio features are selected as Mel Frequency Cepstral Coefficients (MFCC) of the audio signal. Three different lip features are considered for the visual lip information, where these features are 2D DCT coefficients of the intensity based image and the optical flow vectors within the lip region, and...
Both Discrete Cosine Transform (DCT) and Singular Value Decomposition (SVD) have been used as mathematical tools for embedding data into an image. In this paper, we present a new robust hybrid watermarking scheme based on DCT and SVD. After applying the DCT to the cover image, we map the DCT coefficients in a zig-zag order into four quadrants, and apply the SVD to each quadrant. These four quadrants...
In this paper, we address the problem of standard compliant frame rate up conversion (SC-FRUC) at the decoder by using received motion vectors analysis and processing for low bit rate applications. In the proposed SC-FRUC scheme, the skipped frames are generated at the decoder by using received motion vectors only. We introduce a smooth motion vector interpolation method to enhance visual quality...
A time domain constrained subspace-based estimator for extracting a visual evoked potential (VEP) from a highly noisy brain activity is proposed. Generally, the desired VEP is corrupted by background electroencephalogram (EEG) behaving as colored noise, making the overall signal-to-noise ratio as low as −10 dB. The estimator is designed to minimize signal distortion, while keeping residual noise below...
This paper presents a novel Virtual Reality tool for visualising image neighbourhoods. It is currently implemented using RGB colour space, but the technique is not limited to Cartesian spaces. The Visualisation Tool is based on Virtual Reality Modeling Language (VRML). Visualisation using virtual reality techniques is a useful tool to help evaluate the effectiveness of colour filters. It can also...
In this paper, a fuzzy approach is used for efficiently representing the visual content. Initially, the image is partitioned into several segments (objects) and for each segment appropriate features, such as the segment color and texture, are extracted. In the following, all features are classified in a fuzzy framework, resulting in a content interpretation closer to the human perception. Furthermore,...
This paper describes a nonlinear Image-Based Visual Servo controller for automatic landing of a fixed wing aircraft in presence of a wind gust. Only 2D image features are exploited from the image of the runway to perform a typical path for landing maneuver (alignment, glide and flare maneuvers). The visual measurements for the alignment are the two lines delimitating the runway using the so-called...
This paper proposes a general framework for the analysis and synthesis of multicriteria visual-based control schemes. Both position-based and image-based controllers can be studied, whether static or dynamic. A kinematic visual-based positioning standard problem is considered. Its embedding into a Structured Norm-bounded Linear Differential Inclusion allows to state sufficient conditions for some...
Due to the rapid evolution in multimedia technology, the multimedia data have been growing at a phenomenal rate. With the enormous amount of multimedia data, the richness of its information has raised the demand for sophisticated multimedia knowledge discovery systems. Multimedia documents requires distinct type of processing and knowledge discovery methods, due to the distinct characteristics of...
In this paper we take a look at extensions of the Bag of Words model developed within the last few years. Namely the aggregation of vector residuals known as VLAD encodings and Fisher kernels and assess their performance for the classification task using multiple views. We also take a look at the triangular embedding strategy for classification in the compression domain. Our work focuses on using...
Introduction: ST segment values are used to assess patients with suspected acute myocardial infarction. Given ST segment changes can be missed in busy clinical environments, researchers have proposed the ST Map as an alternative for depicting ST segment data. Given this approach has shown potential for use within routine clinical practice, we implemented a web based visualization system that could...
In this paper we propose a novel spatially stratified sampling technique for evaluating the likelihood function in particle filters. In particular, we show that in the case where the measurement function uses spatial correspondence, we can greatly reduce computational cost by exploiting spatial structure to avoid redundant computations. We present results which quantitatively show that the technique...
We propose a new framework for image recognition by selectively pooling local visual descriptors, and show its superior discriminative power on fine-grained image classification tasks. The representation is based on selecting the most confident local descriptors for nonlinear function learning using a linear approximation in an embedded higher dimensional space. The advantage of our Selective Pooling...
Key frame based video summarization, which enables an user to access any video in a friendly and meaningful way, has emerged as an important area of research for the multimedia community. Various pattern clustering techniques are applied for the extraction of key frames from a video to form a storyboard. In this work, we improve existing Delaunay graph based video summarization framework with i) semantic...
Recognition of social styles of people is an interesting but relatively unexplored task. Recognizing "style" appears to be a quite different problem than categorization, it is like recognizing a letter's font as opposed to recognizing the letter itself. Similar-looking things must be mapped to different categories. Hence a priori it would appear that features that are good for categorization...
In cartographic symbology, the evaluation of a symbol set's visual similarity is an important and frequently required task. Usually, after a set of map symbols are designed, the visual similarity of symbols need to be manually examined. To fully automate this task, in this paper, we propose two approaches based on entropy calculated on SOM surface for quantifying and visualizing the visual similarities...
Objective Patients with chronic diseases and complications may frequently visit different specialists. Analytics could help deliver patient-centric seamless care by providing insights on the visit patterns of this group of patients, so that utilization of healthcare resources can be optimized. A new perspective focusing on patients' specialist utilization records combined with statistical learning...
A novel approach to steady-state visual evoked potential (SSVEP) based brain-computer interface (BCI) is presented in the paper. To minimize possible side-effects of the monochromatic light SSVEP-based BCI we propose to utilize chromatic green-blue flicker stimuli in higher, comparing to the traditionally used, frequencies. The developed safer SSVEP responses are processed an classified with features...
Due to the semantic gap between low-level visual features and high-level semantic content of images, the methods for image annotation based on low-level visual features, cannot well meet the requirement of knowledge discovery from web images. Therefore, the automatic acquisition for high-level semantic content of image has become a hot research topic. The traditional image annotation methods represent...
This paper analyses the attributes of population dynamics of Differential Evolution algorithm using Complex Network Analysis tools. The population is visualised as an evolving complex network, which exhibits non-trivial features. Complex network attributes such as adjacency graph gives interconnectivity, centralities give the overview of convergence and stagnation, whereas cliques outlines the depth...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.