The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents an efficient image exploration scheme for the unshaped object using semantic modelling. The local regions of an image have been classified with respect to the frequency of occurrences. The semantic concept is evaluated using RGB histogram dissimilarity factor, overall dissimilarity factor and regional dissimilarity factor. The dissimilarities determine the local concept with accuracy...
Content based image retrieval has become a major research interest recently. This paper presents an improved image similarity measure for image retrieval system. In the region based image comparison, two images are usually compared in terms of sum of the Euclidean distances among their regions. In this work, the image similarity measure is enhanced through a fuzzyfication of regions' importance and...
Fast and robust traffic sign recognition is very important but difficult for the safety driving assist systems. This study addresses the fast and robust traffic sign recognition to enhance safety driving. We first adopt the typical Hough transform methods to implement coarse-grained locating of the candidate regions (shapes of rectangle, triangle and circle, etc.) of the traffic signs; and then propose...
Recently, approaches utilizing spatial-temporal features have achieved great success in human action classification. However, they typically rely on bag-of-words (BoWs) model, and ignore the spatial and temporal structure information of visual words, bringing ambiguities among similar actions. In this paper, we present a novel approach called sequential BoWs for efficient human action classification...
In order to avoid the over-segmentation problem caused by original watershed transform and improve the segmentation precision of Mycobacterium Tuberculosis (MTB) images, a novel segmentation algorithm is proposed based on automatic-marker watershed transform. The automatic marker is accomplished by Gaussian weighted adaptive threshold segmentation and local minimum points search within gradient image...
Locally normalized Histogram of Oriented Gradient (HOG) algorithm originally proposed by Dalal & Triggs presents excellent results for pedestrian detection. However, as the demand of accuracy and speed in real-time application increase, the detection speed and robustness of this method is becoming insufficient. Over the years, improvements have been proposed by different researchers in order to...
Image matching is widely used in machine vision, computer technology, industry and agriculture and other fields. Matching the characteristics of traditional methods such as SIFT has advantages of compressed information quantity, high precision, but there are also some deficiencies such as the large of calculation, long time, the object to be measured of position should be required accurately and etc...
Text detection in compressed video has received much attention in recent years due to the effectiveness of DCT coefficients and motion vectors in realizing several applications. In this paper, a new text detection, which utilizes AC coefficients in the H.264/AVC compressed video, is proposed. The proposed median deviation of coefficients from a specific subband is first computed, then the k-means...
This paper presents a novel video copy detection system. The kernel of the approach is based on our proposed extended local descriptor WLD to three orthogonal planes (WLD-TOP). Indeed, in the aim to extract features vector, key-frames are generated and then a perceptual hash is performed using the WLD-TOP descriptor. The proposed method is applied on three databases and evaluated against several attacks...
This paper addresses the problem of human action detection/recognition by investigating interest points (IP) trajectory cues and by reducing undesirable small camera motion. We first detect speed up robust feature (SURF) to segment video into frame volume (FV) that contains small actions. This segmentation relies on IP trajectory tracking. Then, for each FV, we extract optical flow of every detected...
Raga plays an important role in Indian classical music. Raga is made up from the swara or note. According to characteristics of raga, Indian classical music is further divided into two systems Hindustani / North Indian classical music, Carnatic / South India classical music. This paper introduces us with some basic terms in Indian classical music and terms associated with raga. Then we discussed different...
The current trend of growth of information reveals that it is inevitable that large-scale learning problems become the norm. In this paper, we propose and analyze a novel Low-density Cut based tree Decomposition method for large-scale SVM problems, called LCD-SVM. The basic idea here is divide and conquer: use a decision tree to decompose the data space and train SVMs on the decomposed regions. Specifically,...
In this paper, we propose a normalized cone histogram features method to recognize human actions in video clips. The cone features are extracted based not on the center of gravity as is common, but on the head position of the extracted human region. Initially, the head, hands and legs positions are determined. Thereafter, the distances and orientations between the head and the hands and legs are the...
The paper presents Echo State Network (ESN) as classifier to diagnose the abnormalities in mammogram images. Abnormalities in mammograms can be of different types. An efficient system which can handle these abnormalities and draw correct diagnosis is vital. We experimented with wavelet and Local Energy based Shape Histogram (LESH) features combined with Echo State Network classifier. The suggested...
In this preliminary research we examine the suitability of hierarchical strategies of multi-class support vector machines for classification of induced pluripotent stem cell (iPSC) colony images. The iPSC technology gives incredible possibilities for safe and patient specific drug therapy without any ethical problems. However, growing of iPSCs is a sensitive process and abnormalities may occur during...
In recent years, large-scale image retrieval has been shown remarkable potential in real-life applications. To reduce retrieval time as searched database may contain thousands of images, Inverted Indexing is the basic technique, given images are represented by Bag-of-Words model. However, one major limitation of both standard Inverted Index and Bag-of-Words model is that they ignore spatial information...
This paper presents a GPU-based system for real-time traffic sign detection and recognition which can classify 48 different traffic signs included in the library. The proposed design implementation has three stages: pre-processing, feature extraction and classification. For high-speed processing, we propose a window-based histogram of gradient algorithm that is highly optimized for parallel processing...
As one tool for structuring a massive volume of archived news videos based on their semantic contents, this paper proposes a method to detect scene duplicates from news videos. A scene duplicate is a pair of video segments taken at the same event from different viewpoints. Referring to the audio channel is effective to detect scene duplicates regardless of viewpoints, but it cannot be relied on when...
Local spatio-temporal features with a Bag-of-visual words model is a popular approach used in human action recognition. Bag-of-features methods suffer from several challenges such as extracting appropriate appearance and motion features from videos, converting extracted features appropriate for classification and designing a suitable classification framework. In this paper we address the problem of...
The objective of this paper is to evaluate Bag-of-Colors (BoC) descriptor for land use classification. BoC can be used either as a global or local descriptor. In this paper we present and evaluate both approaches. We analyze the influence of different parameters on classification accuracy and introduce a modification of descriptor extraction process, which significantly influences the classification...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.