The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper describes experimental evaluations of encoding parameters that are appropriate for MPEG-4 Audio Lossless Coding (ALS) to compress high-resolution audio. MPEG-4 ALS Simple Profile defines the values of encoding parameters, such as the maximum sampling frequency and quantization bit depth, for making it easier to implement in the receiving applications. However, ALS Simple Profile does not...
This paper presents a new segmentation-based compression scheme for 3D dynamic models. The segmentation process is preformed based on the heat diffusion properties, while exploiting temporal and spatial dependencies of the geometry component. 3D affine transforms are used to describe the motion clusters. Weighting of these affine transforms allows to faithfully determine the vertex motions in each...
Recently sparse coding have been highly successful in image classification mainly due to its capability of incorporating the sparsity of image representation. In this paper, we propose an improved sparse coding model based on linear spatial pyramid matching(SPM) and Scale Invariant Feature Transform (SIFT) descriptors. The novelty is the simultaneous non-convex and non-negative characters added to...
Common e-commerce websites rely heavily on JPEG images for product presentation. In this paper we present a new coding scheme and file format that is tailored to the presentation of single-color products. A JPEG image file can be transcoded into this new format leading to substantial reduction in file size (Average of 28%) with practically no quality degradation. We describe how we can take advantage...
Very high-resolution video systems, such as 4K (4096×2160), enable a very close viewing distance that is almost the same as the picture height. This technology enables high-reality systems in homes to be realized. However, the very short distance causes significantly different views across the picture, and uniform processing for the picture may not always be the best choice. Here, we calculate the...
This paper presents a novel cloud model which is designed for the inter prediction coding using virtual frame technique. The virtual frame obtained by typical dynamic texture synthesis methods can have a better prediction result in some regions of the encoding frame than the common reference frames, because the non-linear motion and global illumination change between frames is taken into account....
In this study, compressed sensing concepts are applied to multi-view video coding. Existing work from single view video is utilized to develop efficient GOP patterns and reference framing for stereo coding. It has been observed that the most typical choice of pattern improved the characteristics 0.4 dB with respect to the model that do not benefit from interview sparsity for all frames. Alternatives...
We present a video compression scheme using epitome based texture coding that uses a low quality video as side information and improves it by using the epitome. The side information is sent at different levels of quantization and resolution to optimize the quality against the bit rate. The concept of motion threading is used for propagation of epitome information from one frame to another. The proposed...
A rate control algorithm for hierarchical B-pictures in Scalable Video Coding (SVC) is proposed in this work. The complex inter-frame dependency issue is effectively addressed by the Q-distance policy decision rule while the statistical smoothing effect enables the GOP-based precise bit rate control. The simplicity of the decision processes greatly reduces the encoder complexity providing an efficient...
Intra coding in the current H.264/AVC video coding standard achieves high compression efficiency, in part due to the highly effective intra prediction process that exploits spatial directional correlation. However, intra prediction of chroma components in YUV 4:2:0 videos uses a limited set of possible predictions available for coding of luma components. Furthermore, coding of chroma components proceeds...
The image patches learned by recent works are usually only bar-like or Gabor-like patterns. However those simple patterns are not meaningful enough to capture higher level information. In this study, we try to learn more complex image patterns from unaligned images. We propose Scale And Shift Invariant Sparse Coding (SASISC), which aligns basis patches at proper locations and scales to reconstruct...
Multiple Description Coding (MDC) is one of the most efficient methods to combat error-prone channels especially when retransmission is unacceptable. In applications involving scalable, multicast and P2P environments, it is advantageous to use more than two descriptions. In this paper, we present an adaptable temporal-spatial error concealment method based on error tracking to improve the performance...
In this paper, we propose a novel scheme of coupled distributed arithmetic coding to overcome the de-synchronization problem caused by causal decoding in existing distributed arithmetic coding system. Simulation results show that decoding performance is significantly improved and longer sequences outperform shorter sequences using this approach.
In this paper, we propose a fast mode decision scheme for H.264 video coding to address the requirements of both low complexity and error resilience in realtime video communications. Traditional fast mode decision schemes are usually designed based on feature analysis on the source videos. However, the rate-distortion behaviors of the coding modes change when channel errors are involved. Therefore,...
Intra-prediction improves coding performance by reducing inter-pixel redundancy. However, to accommodate the use of block transforms, not all pixels can be predicted from reconstructed pixels that are located close to themselves. This causes prediction performance to suffer as pixel values further apart are less correlated. This paper presents additional intra-prediction modes designed with the goal...
Multiview video coding systems are characterized by their high encoding complexity. In this paper, we propose to reduce the encoding complexity by omitting frames at the encoder in a pattern that keeps the ability to reconstruct an interpolated version of each omitted frame, by motion-compensated and inter-view interpolation. Since our goal is maintaining good visual quality, we propose a method to...
H.264/AVC adopts intra coding technique to reduce spatial redundancies. In order to achieve better coding performance, there are nine candidate modes for 4×4 block unit to be selected by Rate-Distortion Optimization(RDO) process in the H.264/AVC encoder-side. However, too many intra candidate modes undoubtedly increase the transmitted bits that represent them. To solve this problem, a novel method...
We present a minimum message length (MML) framework for trajectory partitioning by point selection, and use it to automatically select the tolerance parameter є for Douglas-Peucker partitioning, adapting to local trajectory complexity. By examining a range of є for synthetic and real trajectories, it is easy to see that the best є does vary by trajectory, and that the MML encoding makes sensible choices...
Our objective is to learn invariant color features directly from data via unsupervised learning. In this paper, we introduce a method to regularize restricted Boltzmann machines during training to obtain features that are sparse and topographically organized. Upon analysis, the features learned are Gabor-like and demonstrate a coding of orientation, spatial position, frequency and color that vary...
Slice group coding using Flexible Macroblock Ordering (FMO) based on Macroblock (MB) importance in H.264/AVC has been studied for providing higher error robustness in error prone environments. However, the efficiency still needs to be improved and the high computational and time cost it causes remains to be an issue. In this paper, an Adaptive Macroblock-to-slice Allocation map (MBAmap) Updating scheme...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.