The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Image search engines commonly employ the Bag Of Features (BOF) method to represent each database image with a feature vector and retrieve the best candidate using a measure of similarity to a query image vector. The BOF vector, which specifies the occurrence frequency of features, is used with Soft Assignment (SA) to find the most similar candidates which are further analyzed using geometric information...
This paper deals with automatic estimation of the horizon in videos from fixed surveillance cameras. The proposed algorithm is fully automatic in the sense that no user input is needed per-camera and it works with various scenes (indoor, outdoor, traffic, pedestrian, livestock, etc.). The algorithm detects moving objects, tracks them in time, assesses some of their geometric properties related to...
In order to achieve unmanned monitoring in the smart substation, the video monitoring technology has been introduced. However, it is still lacking of real-time image processing in its actual use. In this paper, a system based on the image tracking technology for fault monitoring of insulators in the substation is developed. In this system, an unmanned aerial vehicle (UAV) moving in a special trace...
The core feature of any intelligent transportation system (ITS) is the vehicle measurement technique incorporated to measure road traffic volumes. Sensors or detectors placed along roads are used to acquire traffic flow data for further processing either manually or through fully automated intelligent systems. Although several different technologies exist, no single vehicle count method is absolutely...
Due to variations in pose and illumination condition, the appearance of can be significantly different in different cameras and the performance of person re-identification is degraded. In this paper, a person re-identification based on multi-level and multi-feature fusion for this phenomenon is proposed. Firstly, we divided each sample into three parts and multi-layer sampling. Secondly, we extracted...
The popularity of applications using Augmented Reality, especially due to the dissemination of smartphones with high processing power, introduces the need for Fiducial Markers that can be detected quickly, with good accuracy and can deal with partial occlusion. Fiducial Markers can have different shapes, sizes, structure and colors, and are inserted into a scene to facilitate the detection and consequent...
Obstacle avoidance and navigation are major problems for blind people. They require help to pass safely. This paper presents an obstacle avoidance system for blind people using Kinect depth camera. This assistive technology recognizes the medium in front of the user using Kinect depth camera. The system receives the depth images from the Kinect camera and processes it using a windowing-based mean...
In recent years, there have been a variety of proposals for high quality 3D reconstruction by fusion of depth and normal maps that contain good low and high frequency information respectively. Typically, these methods create an initial mesh representation of the complete object or scene being scanned. Subsequently, normal estimates are assigned to each mesh vertex and a mesh-normal fusion step is...
We present a novel deep learning framework that models the scene dependent image processing inside cameras. Often called as the radiometric calibration, the process of recovering RAWimages from processed images (JPEG format in the sRGB color space) is essential for many computer vision tasks that rely on physically accurate radiance values. All previous works rely on the deterministic imaging model...
Person re-identification via cross-camera is a difficult problem in the field of target discovery and tracking. Traditional solutions depending on the characteristics of a target's appearance have low reliability and can easily lead to low matching rate because they use simple metric functions. This work proposes a more reliable measurement: correlative degree of a target's features among different...
In action recognition, local motion descriptors contribute to effectively representing video sequences where target actions appear in localized spatio-temporal regions. For robust recognition, those fundamental descriptors are required to be invariant against horizontal (mirror) flipping in video frames which frequently occurs due to changes of camera viewpoints and action directions, deteriorating...
A method used for estimating the distance between objects in photographic images is the depth from focus (DFF) method. The method has reportedly been implemented many times using only image-side telecentric optical systems because a change in image magnification becomes a problem due to differences in focus positions. Therefore, there have been no examples of this method having been implemented using...
During conflicts such as climatic catastrophe, war etc. in various countries, including underdeveloped countries, there has been a lack of experienced medical personnel leading to a significant rise in the death toll. To overcome this crisis, a real time automatic injecting device (AID) with robust vein detection is presented in this paper. This device is capable of automatically injecting medicines...
This paper addresses the problem of estimating the alignment pose between two models by using structure-specific local descriptors which are generated by combining 2D image data and 3D contextual shape data. The 2D texture information is represented by a robust SIFT descriptor, and the geometric information is represented by a histogram supported by the orders of curvature and angles between normal...
We present a novel approach to 6DOF pose estimation and segmentation of rigid 3D objects using a single monocular RGB camera based on temporally consistent, local color histograms. We show that this approach outperforms previous methods in cases of cluttered backgrounds, heterogenous objects, and occlusions. The proposed histograms can be used as statistical object descriptors within a template matching...
In this paper we propose a fast method for detecting the ground plane in 3D scenes for an arbitrary roll angle rotation of a stereo vision camera. The method is based on the analysis of the disparity map and its “V-disparity” representation. First, the roll angle of the camera is identified from the disparity map. Then, the image is rotated to a zero-roll angle position and the ground plane is detected...
In this paper, we propose to fuse the geometric information of a 3D Lidar and a monocular camera to detect the urban road region ahead of an autonomous vehicle. Our method takes advantage of both the high definition of 3D Lidar data and the continuity of road in image representation. First, we obtain an efficient representation of Lidar data, an organized 2D inverse depth map, by projecting the spatially...
Object detection and classification are two very important tasks for quality control in industrial process. The first step of a quality algorithm is the detection of the moving object. Afterwards, the detected object is classified according to its size and color related properties. In this study, an interactive image segmentation method is proposed to detect a moving object. The segmentation method...
The present paper describes a dual approach using pseudo-3D vision for Machine-Awareness in indoor environment. Provided by color and depth cameras of the a Kinect system, the aforementioned duality presents an appealing solution for robots' 3D-vision. Placing the human-robot and in a more general way the human-machine interactions as a key outcome of the expected visual Machine-Awareness, the proposed...
The communication barrier of deaf and dumb community with the society still remains a matter of concern due to lack of perfect sign language translators. Usage of mobile phones for communication remains a dream for deaf and dumb community. We propose an android application that converts sign language to natural language and enable deaf and dumb community to talk over mobile phones. Developing Sign...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.