The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Augmented Reality (AR) has become a popular technology for both the public and industries. Serious Games (SG) incorporating pedagogical elements for learning, training, or inform. The creation of SG can use AR techniques for their development, with the prospect of new possibilities due to this integration. A Systematic Literature Mapping (SLM) allows a synthesized view of research carried out in the...
Affordance learning in general, is to identify the purpose, use, and ways to interact with an object, based on information gained from observing the object. Most of the existing affordance learning approaches assume the object target has been cropped individually from images. However, the object could not be easily separated from others due to occlusion or noise. Actually, two or more neighboring...
In this paper, problem of identifying and tracking the power port of remote radio unit (RRU) is addressed. The testing of RRU requires the inspection robot to insert the probes into its power and network ports. In order to solve this problem, an experimental setup of visual servoing with 6 degrees of freedom (DoF) manipulator has been established. The initial problem of recognizing and tracking the...
Computer Science students usually carry practical activities for the identification of software requirements and for understanding the organization business rules. Within this context, during the last two years we have conducted a project with software industry and Computer Science students, using comic strips to support the software requirements specification. We created a method of scenario simulation...
Salient object detection aims to correctly highlight the most salient object(s) in an image. Combining fine-grained contrast prior with rough-grained object consistency, this paper proposes a Focusness Guided Salient object detection (FGS) algorithm. To obtain clean and precise contrast map, FGS uses the focusness prior to guide the contrast map. Combing different saliency priors, FGS utilizes a unified...
Loop closure detection is an important part of visual simultaneous location and mapping (SLAM) system. Most of traditional loop closure detection approaches using hand-crafted features often lack robustness with respect to object occlusions and illumination changes, especially for the complicated indoor environment. Recently, convolutional neural network (CNN) makes a huge impact on many computer...
Nowadays, people have to face many challenges In order to become familiar with all the functionalities of electronical devices. The training process becomes thus essential, especially when we are referring to consumer electronics created for people with disabilities. Our paper presents the importance of training visually impaired people for an advanced sensory substitution device. By using the advantages...
This paper describes an approach for analyzing the project team members' expectations to achieve the personal goals as well as the project objectives. The article describes four types of expectations and suggests the expectation map as an analytical tool. The paper introduces the important patterns and the process of expectation map analysis.
Detecting potential aerial threats like drones with computer vision is at the paramount of interest for the protection of critical locations. This type of a system should prevent efficiently the false alarms caused by non-malign objects such as birds, which intrude the image plane. In this paper, we propose an improved version of a previously presented Speeded-up Robust Feature Transform (SURF) based...
With the tremendous advances made by Convolutional Neural Networks (ConvNets) on object recognition, we can now easily obtain adequately reliable machine-labeled annotations easily from predictions by off-the-shelf ConvNets. In this work, we present an abstraction memory based framework for few-shot learning, building upon machine-labeled image annotations. Our method takes large-scale machine-annotated...
Given a convolutional neural network (CNN) that is pre-trained for object classification, this paper proposes to use active question-answering to semanticize neural patterns in conv-layers of the CNN and mine part concepts. For each part concept, we mine neural patterns in the pre-trained CNN, which are related to the target part, and use these patterns to construct an And-Or graph (AOG) to represent...
Referring expressions are natural language constructions used to identify particular objects within a scene. In this paper, we propose a unified framework for the tasks of referring expression comprehension and generation. Our model is composed of three modules: speaker, listener, and reinforcer. The speaker generates referring expressions, the listener comprehends referring expressions, and the reinforcer...
This paper addresses three issues in integrating part-based representations into convolutional neural networks (CNNs) for object recognition. First, most part-based models rely on a few pre-specified object parts. However, the optimal object parts for recognition often vary from category to category. Second, acquiring training data with part-level annotation is labor-intensive. Third, modeling spatial...
Image captioning often requires a large set of training image-sentence pairs. In practice, however, acquiring sufficient training pairs is always expensive, making the recent captioning models limited in their ability to describe objects outside of training corpora (i.e., novel objects). In this paper, we present Long Short-Term Memory with Copying Mechanism (LSTM-C) — a new architecture...
It is well-established by cognitive neuroscience that human perception of objects constitutes a complex process, where object appearance information is combined with evidence about the so-called object affordances, namely the types of actions that humans typically perform when interacting with them. This fact has recently motivated the sensorimotor approach to the challenging task of automatic object...
The deployments of deep neural network models on mobile or embedded devices have been challenged due to two main reasons: 1) the large model size for storage, and 2) the large memory bandwidth for inference. To address these issues, this paper develops a deep neural network compression framework to reduce the resource usage for efficient visual inference. By reviewing the trained deep model, we propose...
Robotic graspable object recognition is a crucial ingredient in many exciting autonomous manipulation applications. However, identifying complex image features from limited data remains largely unsolved. In this paper, we leverage the advantages of two kinds of feature representation approaches, kernel descriptors and deep neural networks, to present a novel hierarchical feature learning framework...
Reliable occluded skeletal posture estimation is a fundamentally challenging problem for vision-based monitoring techniques. This is due to several imaging related challenges introduced by existing depth-based pose estimation techniques that fail to provide accurate joint position estimates when the line of sight between the imaging device and the patient is obscured by an occluding material. In this...
Behaviors of an object-oriented system can be visualized as reverse-engineered sequence diagrams from execution traces. This approach is a valuable tool for program comprehension tasks. However, owing to the massiveness of information contained in an execution trace, a reverse-engineered sequence diagram is often afflicted by a scalability issue. To address this issue, we present in this paper a method...
Humans can recognise objects under partial occlusion. Machine-based approaches cannot reliably recognise objects and scenes in the presence of occlusion. This paper investigates the use of the elastic net hierarchical MAX (En-HMAX) model to handle occlusions. Our experiments show that the En-HMAX model achieves an accuracy of ∼70%, when ∼50% artificial occlusions are applied to the centre of the visual...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.