Manfred Huber

chapter

Training neural networks with policy gradient

Sourabh Bose, Manfred Huber

2017 International Joint Conference on Neural Networks (IJCNN) > 3998 - 4005

2017 International Joint Conference on Neural Networks (IJCNN)

Neural networks are a powerful function approximation tool which has the ability to model any function with arbitrary precision. For any function as a black box, it is able to reconstruct the function given the target and the input data. However, there are problems where the target is at least partially unknown. In such cases it is impossible for a traditional neural network to compute the gradient...

chapter

SmartCare: An introduction

Manfred Huber, Gergely Zaruba, Nicholas Brent Burns, Kathryn Daniel

2017 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops) > 394 - 400

2017 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops)

This paper introduces SmartCare, a project revolving around a smart environment especially built to enable aging in place. The paper describes the vision behind SmartCare as well as its translation into a deployed system. The physical incarnation of SmartCare is the SmartCare apartment, an actual apartment in a retirement community. We provide a description the technologies that are deployed in the...

chapter

Dynamic heuristic planner selection

Brian Cook, Manfred Huber

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2329 - 2334

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Heuristic search is considered state-of-the-art for classical planning. However, the performance of search heuristics varies significantly from problem to problem and no single heuristic is superior to all others. As a result, it is highly desirable to identify and utilize the best available heuristic for a particular planning problem. This paper presents a novel approach for planning that monitors...

chapter

Temporal and agent abstractions in multiagent reinforcement learning

Danielle M. Clement, Manfred Huber

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2190 - 2195

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

A major challenge in the area of multiagent reinforcement learning has been addressing the problem of scale, more specifically the fact that increasing the number of agents in a system dramatically increases both the cost of representing the problem and the cost of calculating a solution. In single agent systems, temporal abstractions in the form of options have been used to address part of the scaling...

chapter

Incremental learning of neural network classifiers using reinforcement learning

Sourabh Bose, Manfred Huber

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2097 - 2103

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

With the availability of more data, classification is increasingly important. However, traditional classification algorithms do not scale well to large data sets and are often not suited when only limited samples of the dataset are available at any point in time. The latter arises, for example, in streaming data when the accumulation of data a priori is infeasible either due to limitations in memory...

chapter

PESTO: Data integration for visualization and device control in the SmartCare project

Nicholas Brent Burns, Peter Sassaman, Kathryn Daniel, Manfred Huber, more

2016 IEEE International Conference on Pervasive Computing and Communication Workshops (PerCom Workshops) > 1 - 6

2016 IEEE International Conference on Pervasive Computing and Communication Workshops (PerCom Workshops)

The SmartCare project is to design, develop, and evaluate an intelligent sensor-driven living environment for the elderly. The core objectives are to provide emergency detection, improve quality of life, extend independence for the elderly, and detect patterns of behavior that could suggest early signs of a physical or cognitive issue, all in an unobtrusiveness manner. This paper specifically focuses...

chapter

Deep Belief Network for Modeling Hierarchical Reinforcement Learning Policies

Predrag D. Djurdjevic, Manfred Huber

2013 IEEE International Conference on Systems, Man, and Cybernetics > 2485 - 2491

2013 IEEE International Conference on Systems, Man and Cybernetics (SMC 2013)

Intelligent agents over their lifetime face multiple tasks that require simultaneous modeling and control of complex, initially unknown environments, observed via incomplete and uncertain observations. In such scenarios, policy learning is subject to the curse of dimensionality, leading to scaling problems for traditional Reinforcement Learning (RL). To address this, the agent has to efficiently acquire...

chapter

Data Modeling Using Channel-Remapped Generalized Features

Houtan Rahmanian, Manfred Huber

2013 IEEE International Conference on Systems, Man, and Cybernetics > 864 - 869

2013 IEEE International Conference on Systems, Man and Cybernetics (SMC 2013)

Sparse coding is a very powerful method to learn high-level features from raw data input. It is able to learn an over complete basis that has the potential to capture robust and discriminative patterns within the data. However, like many other feature learning algorithms, it is unable to detect very similar features or stimuli on different input channels. In this paper, we propose a novel method to...

chapter

Data Modeling Using Channel-Remapped Generalized Features

Houtan Rahmanian, Manfred Huber

2013 IEEE International Conference on Systems, Man, and Cybernetics > 864 - 869

2013 IEEE International Conference on Systems, Man and Cybernetics (SMC 2013)

Sparse coding is a very powerful method to learn high-level features from raw data input. It is able to learn an over complete basis that has the potential to capture robust and discriminative patterns within the data. However, like many other feature learning algorithms, it is unable to detect very similar features or stimuli on different input channels. In this paper, we propose a novel method to...

chapter

A Sampling-Based Approach to Reducing the Complexity of Continuous State Space POMDPs by Decomposition Into Coupled Perceptual and Decision Processes

Rassool Fakoor, Manfred Huber

2012 11th International Conference on Machine Learning and Applications > 1 > 687 - 692

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

In this paper, we propose a method to reduce the complexity of solving POMDPs in continuous state spaces by decomposing them into separate, coupled perceptual and decision processes which leads to a reduction of the state space size of the decision learning problem. In our method, we reduce the state space of the POMDP by handling some aspects of the state space outside of the decision POMDP. To achieve...

chapter

A Game Theoretic Framework for Communication in Fully Observable Multiagent Systems

Tummalapalli Sudhamsh Reddy, Gergely Zaruba, Manfred Huber

2012 11th International Conference on Machine Learning and Applications > 1 > 697 - 702

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

Communication is an important element of multiagent systems (MAS). In fully decentralized systems it is needed to allow the agents to coordinate their actions to achieve certain goals. When the agents have no means to coordinate their actions, they generally choose actions that minimize their chance of losses. If the agents were allowed to coordinate, on the other hand, they can choose actions that...

chapter

Improving tractability of POMDPs by separation of decision and perceptual processes

Rasool Fakoor, Manfred Huber

2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 593 - 598

2012 IEEE International Conference on Systems, Man and Cybernetics - SMC

Markov Decision Processes (MDPs) and Partially Observable Markov Decision Processes (POMDPs) are very powerful frameworks to model decision and decision learning tasks in a wide range of problem domains. Thus, they are used widely in complex and real-world situations such as robot control tasks. However, this modeling power and generality of the framework comes at a cost in that the complexity of...

chapter

Symbol generation and feature selection for reinforcement learning agents using affordances and U-Trees

Marcus Oladell, Manfred Huber

2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 657 - 662

2012 IEEE International Conference on Systems, Man and Cybernetics - SMC

One of the challenges for artificial agents is managing the complexity of their environment and task domain as they learn increasingly difficult tasks. This is especially true of agents that are grounded in the physical world, which contains a vast number of features and potentially very complex dynamics. A scalable solution to this problem in terms of forming, managing, and re-using compact, grounded...

chapter

Inverse reinforcement learning for decentralized non-cooperative multiagent systems

Tummalapalli Sudhamsh Reddy, Vamsikrishna Gopikrishna, Gergely Zaruba, Manfred Huber

2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1930 - 1935

2012 IEEE International Conference on Systems, Man and Cybernetics - SMC

The objective of inverse reinforcement learning (IRL) is to learn an agent's reward function based on either the agent's policies or the observations of the policy. In this paper we address the issue of using inverse reinforcement learning to learn the reward function in a multi agent setting, where the agents can either cooperate or be strictly non-cooperative. The case of cooperataing agents is...

chapter

Building Bayesian Network based expert systems from rules

Saravanan Thirumuruganathan, Manfred Huber

2011 IEEE International Conference on Systems, Man, and Cybernetics > 3002 - 3008

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

Combining expert knowledge and user explanation with automated reasoning in domains with uncertain information poses significant challenges in terms of representation and reasoning mechanisms. In particular, reasoning structures understandable and usable by humans are often different from the ones used for automated reasoning and data mining systems. Rules with certainty factors represent one possible...

chapter

Reinforcement field

Po-Hsiang Chiu, Manfred Huber

2011 IEEE International Conference on Systems, Man, and Cybernetics > 2567 - 2574

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

Complex control tasks involving varying or evolving system dynamics often pose a great challenge to mainstream reinforcement learning algorithms. Specifically, in most standard methods, actions are often assumed to be a concrete and fixed set that applies to the state space in a predefined manner. Consequently, without resorting to a substantial re-learning procedure, the derived policy lacks the...

chapter

Autonomous identification, categorization and generalization of policies based on task type

Srividhya Rajendran, Manfred Huber

2011 IEEE International Conference on Systems, Man, and Cybernetics > 1333 - 1339

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

A life-long learning agent must have the ability to learn new tasks, adapt the policies of already learned tasks, and extract and reuse knowledge from previous tasks for future use. To do the latter, it needs methods that can autonomously identify, categorize and generalize control and representational knowledge. This paper presents a novel approach to achieve this by combining the policy homomorphism...

chapter

Generalized reinforcement learning with concept-driven abstract actions

Po-Hsiang Chiu, Manfred Huber

2011 IEEE International Conference on Systems, Man, and Cybernetics > 2575 - 2582

2011 IEEE International Conference on Systems, Man and Cybernetics - SMC

The standard reinforcement learning framework often faces challenges in a varying or evolving environment due to an inherent limitation in its representation. In particular, useful actions for decision making are often assumed to be a prefixed set prior to the learning process. Consequently, the derived policy in general lacks the ability to adapt to possible variations in the action outcomes or the...

article

Design and evaluation of haptic effects for use in a computer desktop for the physically disabled

Brian Holbert, Manfred Huber

Universal Access in the Information Society > 2011 > 10 > 2 > 165-178

The human–computer interface remains a mostly visual environment with little or no haptic interaction. While haptics is finding inroads in specialized areas such as surgery, gaming, and robotics, there has been little work to bring haptics to the computer desktop, which is largely dominated today by the GUI/mouse relationship. The mouse as an input device, however, poses many challenges for users...

chapter

Pseudo-Hierarchical Ant-Based Clustering

Jeremy B. Brown, Manfred Huber

2010 IEEE International Conference on Systems, Man and Cybernetics > 2016 - 2024

2010 IEEE International Conference on Systems, Man and Cybernetics (SMC 2010)

The behavior and self-organization of ant colonies has been widely studied to address distributed clustering. However, most models that directly mimic ants produce too many clusters and converge too slowly. A wide range of research has attempted to address this through various means, but a number of sources of inefficiency remain, including: i) ants must physically move from one cluster to another...

INFONA - science communication portal

Search results for: Manfred Huber

Training neural networks with policy gradient

SmartCare: An introduction

Dynamic heuristic planner selection

Temporal and agent abstractions in multiagent reinforcement learning

Incremental learning of neural network classifiers using reinforcement learning

PESTO: Data integration for visualization and device control in the SmartCare project

Deep Belief Network for Modeling Hierarchical Reinforcement Learning Policies

Data Modeling Using Channel-Remapped Generalized Features

Data Modeling Using Channel-Remapped Generalized Features

A Sampling-Based Approach to Reducing the Complexity of Continuous State Space POMDPs by Decomposition Into Coupled Perceptual and Decision Processes

A Game Theoretic Framework for Communication in Fully Observable Multiagent Systems

Improving tractability of POMDPs by separation of decision and perceptual processes

Symbol generation and feature selection for reinforcement learning agents using affordances and U-Trees

Inverse reinforcement learning for decentralized non-cooperative multiagent systems

Building Bayesian Network based expert systems from rules

Reinforcement field

Autonomous identification, categorization and generalization of policies based on task type

Generalized reinforcement learning with concept-driven abstract actions

Design and evaluation of haptic effects for use in a computer desktop for the physically disabled

Pseudo-Hierarchical Ant-Based Clustering

Filter options

Publication date

Content availability

Publication type

Keywords

Data set

Journal

INFONA - science communication portal

Search results for: Manfred Huber

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options