Search results for: Quan Liu

Items from 1 to 5 out of 5 results

chapter

ACIS: An Improved Actor-Critic Method for POMDPs with Internal State

Dan Xu, Quan Liu

2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI) > 369 - 376

2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI)

Partially observable Markov decision processes (POMDPs) provide a rich mathematical model for sequential decision making in partially observable and stochastic environments. Model-free methods use the internal state as a substitute of the belief state which is a sufficient statistic of all past action-observation history in model-based techniques. A main drawback of previous model-free techniques,...

article

Experience replay for least-squares policy iteration

Quan Liu, Xin Zhou, Fei Zhu, Qiming Fu, more

IEEE/CAA Journal of Automatica Sinica > 2014 > 1 > 3 > 274 - 281

Policy iteration, which evaluates and improves the control policy iteratively, is a reinforcement learning method. Policy evaluation with the least-squares method can draw more useful information from the empirical data and therefore improve the data validity. However, most existing online least-squares policy iteration methods only use each sample just once, resulting in the low utilization rate...

chapter

Stabilization of Networked Control Systems with Data Rate Constraints

Qing-Quan Liu, Fang Jin

2010 International Conference on E-Product E-Service and E-Entertainment > 1 - 4

2010 International Conference on E-Product E-Service and E-Entertainment (ICEEE 2010)

This paper investigates the stabilization problem for networked control systems (NCSs) with limited data rates over an additive white Gaussian noise (AWGN) channel. The notion of control with limited data rates means specifying the lower bound of data rates, above which there exists a coding and control scheme for stabilization of linear time-invariant systems. Different from the literatures, the...

chapter

A hierarchical reinforcement learning algorithm based on heuristic reward function

Qicui Yan, Quan Liu, Daojing Hu

2010 2nd International Conference on Advanced Computer Control > 3 > 371 - 376

2010 2nd International Conference on Advanced Computer Control (ICACC 2010)

A hierarchical reinforcement learning method based on heuristic reward function is proposed to solve the problem of “curse of dimensionality”, that is the states space will grow exponentially in the number of features, and low convergence speed. The method can reduce state spaces greatly and can enhance the speed of the study. Choose actions with favorable purpose and efficiency so as to optimize...

chapter

A Particle Swarm Optimization Based on Improved Multi-Swarm and Analysis

Minghua Li, Quan Liu, Wangshu Yao, Ming Chen

2008 3rd International Conference on Innovative Computing Information and Control > 30

2008 3rd International Conference on Innovative Computing Information and Control (ICICIC)

Based on the differentially perturbed velocity particle swarm optimization, an improved multi-swarm particle swarm optimization (MSPSO) is presented to improve the problem of the slow convergence and diversity loss. The algorithm makes the number of populations search at the same time in the same

Filter options

Keywords:
CONVERGENCE

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Quan Liu

ACIS: An Improved Actor-Critic Method for POMDPs with Internal State

Experience replay for least-squares policy iteration

Stabilization of Networked Control Systems with Data Rate Constraints

A hierarchical reinforcement learning algorithm based on heuristic reward function

A Particle Swarm Optimization Based on Improved Multi-Swarm and Analysis

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options