The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In bit patterned magnetic recording (BPMR), The two-dimensional (2D) interference composed of inter-symbol and inter-track interference is a major problem especially at high areal density (AD). One way to alleviate the destructive effect of 2D interference is to deploy a 2D coding scheme on an input data sequence before recording in order to avoid some data patterns that easily cause an error at the...
Dimensional emotion representation such as valence and arousal (VA) space has been an emerging way to represent emotions. In this representation, emotion words can be projected to the VA space according to their valence and arousal values. Sentence and document-level emotions can then be projected based on the emotion words within them. However, emotion expressions in sentences and documents usually...
The written-in errors in bit patterned media (BPM) recording systems cause the erroneous bits during the writing process, leading to performance degradation. In previous works, the BPM read/write channel is modeled as a binary symmetric channel (BSC) with an additive white Gaussian noise (AWGN) channel or the cascaded BSC-AWGN channels. The initial channel information including the written-in errors...
This paper presents a single-camera HDR imaging algorithm, which is able to solve all the extreme conditions by controlling the exposure time to get a well-exposed image and then enhancing it. The proposed system is divided into two parts, i.e. single-image HDR imaging technique and an exposure control method. The single-image HDR imaging technique integrates two methods, i.e. Dynamic Local Contrast...
We propose a novel contrast enhancement method for dark images using the value gap expansion force (VGEF) and the sorted histogram equalization. Based on the observation that the inter-pixel relationship is analogous to the electrostatic force, we define the pixel field spread around each pixel and the pixel mass at each pixel position. We compute the VGEF exerted to a pixel by multiplying the pixel...
The conventional broadband active noise control (ANC) system may not be able to function well when the target noise contains both wideband and narrowband components simultaneously. Feed-forward hybrid active control has attracted quite a lot attention recently, as it can effectively suppress both components at the same time. Such hybrid ANC system consists of three tightly integrated subsystems, namely,...
It is still not fully revealed how people convey and understand whispered speech without critical deterioration of intelligibility. Typical questions are how the height of sound is produced and how voiced consonants are produced without vibration of vocal folds. In this paper, we have investigated the deterioration of intelligibility of whispered speech and tried to reveal the mechanism to convey...
In this paper, we propose a process of 3D object reconstruction using a pair of Kinect cameras. After we refine raw depth images from two Kinect cameras using a joint bilateral filter, we find intrinsic and extrinsic parameters by camera calibration. Then, we apply 3D warping to obtain a point cloud model in the 3D space and acquire a smooth surface model of the 3D object. In order to accelerate a...
In High Efficiency Video Coding (HEVC), the coding efficiency of infra-frames is lower than inter-frames, which will cause the flicker artifact and perceptual fluctuation among CTUs in low bitrates applications. Therefore, this paper proposes a region-based infra-frame rate-control scheme to improve the objective quality and to reduce PSNR fluctuation among CTUs. Firstly, the CTUs in infra-frame are...
This paper introduces a novel vision-based approach for realistic interaction between user and display's content. An extremely accurate motion capture system is proposed to measure and track the user's head motion in 3D space. Video frames captured by the low-cost head-mounted camera are processed to retrieve the 3D motion parameters. The retrieved information facilitates the real-time 3D interaction...
Although refractory epileptic patients suffer from uncontrolled seizures, their quality of life (QoL) may be improved if the seizure can be predicted in advance. On the hypothesis that the excessive neuronal activity of epilepsy affects the autonomie nervous system and the fluctuation of the R-R interval (RRI) of an electrocardiogram (ECG), called heart rate variability (HRV), reflects the autonomie...
Anchorperson segment detection enables efficient video content indexing for information retrieval. Anchorperson detection based on audio analysis has gained popularity due to lower computational complexity and satisfactory performance. This paper presents a robust framework using a hybrid I-vector and deep neural network (DNN) system to perform anchorperson detection based on audio streams of video...
In this work, we present a novel fountain code-based hybrid storage system that combines cloud storage with P2P storage. In the cloud storage system, all the data are consistently stored at server clusters but the data may be exposed to others accessing server clusters. On the other hand, in the P2P storage system, the data are distributed to a group of participating peers but the data retrieval may...
In this paper, we present an algorithm to infer foreground segmentation from given a sequence of images. In our system, we can capture the interested object on a planar background with a handheld camera. There are two main assumptions are mentioned: 1) the region of interest appears entirely in all images; 2) the background pixels have a similar plane projective transformation, i.e., the foreground...
Monitoring health conditions and events of grandparent-headed family is important to increase their quality of life and reduce care burdens. Affective episodes are significant indexes in monitoring behavior changes. In this paper, we propose an information retrieval approach to extract affect words from speech and written text to provide quantitative evidence of physical functions and social interactivity...
Depth map is a kind of video clip that contains 3D object's depth information, and is an important coding feature in the recently 3D video coding standards, which has been applied for the latest 3D coding approaches, e.g. MV-HEVC and 3D-HEVC. It has been approved that the support of depth map coding can significantly improve the coding performance for 3D videos, and provide more flexibility for 3D...
A method to adjust the mean-squared-errors (MSE) value for coded video quality assessment is investigated in this work by incorporating subjective human visual experience. First, we propose a linear model between the mean opinioin score (MOS) and a logarithmic function of the MSE value of coded video under a range of coding rates. This model is validated by experimental data. With further simplification,...
In this paper, we propose a feature-based approach to address the challenging task of recognising overlapping sound events from single channel audio. Our approach is based on our previous work on Local Spectrogram Features (LSFs), where we combined a local spectral representation of the spectrogram with the Generalised Hough Transform (GHT) voting system for recognition. Here we propose to take the...
We present the design of approximated discrete cosine transform (DCT) with large size and its performance. The approximated DCT is designed using Wang's matrix factorization algorithms with fixed-point parameters and the size greater than 2k, k > 4 is focused. So far, the specific parameters and performance of approximated DCT with large size have not been presented. In addition, we apply the sign...
This paper attempts to provide some insights about the relationship between the differentiability and the classification importance of consonants in Chinese speech communication. The two characteristics can be modelled by the perceptual distance and the functional load respectively. We have a clustering analysis of Chinese consonants based on functional load (FL) relied on mutual information (MI)...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.