Search results

chapter

Decentralized reinforcement social learning based on cooperative policy exploration in multi-agent systems

Chi Wang, Xin Chen

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1575 - 1580

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Coordination problems including miscoordination and relative overgeneralization are difficult to overcome especially in dynamic and stochastic environments. In the practical scenario, there may be a large number of agents, and the interactions between agents may be sparse and unfixed. In this paper, we study the coordination problems and stochastic rewards under the social learning framework where...

chapter

Evolutionary game theoretic approach for optimal resource allocation in multi-agent systems

Changhao Sun, Xiaochu Wang, Jiaxin Liu

2017 Chinese Automation Congress (CAC) > 5588 - 5592

2017 Chinese Automation Congress (CAC)

For task completion in distributed environments, a set of resources is required and a group of agents must cooperate in deciding the share each should provide to maximize the system performance. We address the problem from an evolutionary game-theoretic perspective and present a fully distributed algorithm based on local replicator dynamics. By using the optimality condition, we prove the convergence...

chapter

Distributed and Adaptive Routing Based on Game Theory

Baptiste Jonglez, Bruno Gaujal

2017 29th International Teletraffic Congress (ITC 29) > 1 > 1 - 9

2017 29th International Teletraffic Congress (ITC 29)

In this paper, we present a new adaptive multiflow routing algorithm to select end-to-end paths in packetswitched networks. This algorithm provides provable optimality guarantees in the following game theoretic sense: The network configuration converges to a configuration arbitrarily close to a pure Nash equilibrium. In this context, a Nash equilibrium is a configuration in which no flow can improve...

chapter

Using single Conspiracy Number to analyze game progress patterns

Zhang Song, Hiroyuki Iida

2017 International Conference on Computer, Information and Telecommunication Systems (CITS) > 219 - 222

2017 International Conference on Computer, Information and Telecommunication Systems (CITS)

Conspiracy Number Search (CNS) is a MIN/MAX tree search algorithm, trying to guarantee the accuracy of the MIN/MAX value of a root node. It suffers from a low efficiency because of its slow convergence and a big cost of computing conspiracy numbers. However, the conspiracy number is still a promising concept for measuring the “stability”, which can be used to analyze game progress patterns. In this...

chapter

Embedding symbol algorithm for fast hit rate convergence in slot machine games

Yen-Han Chen, Shin-Hung Chang, Guan-Yun Wang

2017 2nd International Conference on Computer and Communication Systems (ICCCS) > 11 - 15

2017 2nd International Conference on Computer and Communication Systems (ICCCS)

Slot machines are the most popular facility in casinos worldwide. With the advancement of computer technology, the operating reel spinning of the current slot machine is presented by computer software emulation instead of rotating mechanical iron reels. The reel strip table of a slot machine has many special pictures embedded for different attractive themes. Each slot machine achieves a hit rate based...

chapter

Utility-Based resource allocation for underlay D2D networks

Susan Dominic, Lillykutty Jacob

2017 IEEE Region 10 Symposium (TENSYMP) > 1 - 5

2017 IEEE Region 10 Symposium (TENSYMP)

This paper investigates distributed resource allocation in next-generation underlay Device-to-Device (D2D) networks. The joint channel and power allocation for a D2D network underlaying a cellular network is formulated as a non-cooperative game. A utility-based learning algorithm which does not require information exchange between device pairs is proposed to determine the channel index and power level...

chapter

Distributed algorithm for generalized Nash equilibria seeking of network aggregative game

Guanpu Chen, Xianlin Zeng, Peng Yi, Yiguang Hong

2017 36th Chinese Control Conference (CCC) > 11319 - 11324

2017 36th Chinese Control Conference (CCC)

This paper concentrates on seeking the generalized Nash equilibria of network aggregative games by using a distributed continuous-time algorithm. By considering the variational inequality related to the problem, we design a distributed algorithm seeking the variational equilibria, which are practically an essential part of generalized Nash equilibrium points. Then the novel distributed projected continuous-time...

chapter

Multi-leader multi-follower game-based ADMM for big data processing

Zijie Zheng, Lingyang Song, Zhu Han, Geoffrey Ye Li, more

2017 IEEE 18th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC) > 1 - 5

2017 IEEE 18th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC)

Alternating direction method of multipliers (ADMM) is a promising approach to solve “big data” problems due to its efficient variable decomposition and fast convergence. However, it is subject to the following two fundamental assumptions: no contradiction among multiple controllers' objectives and ideal feedback from the agents to the controllers. In this paper, a multiple-leader multiple-follower...

chapter

Distributed nash equilibrium seeking in multi-agent games with partially coupled payoff functions

Maojiao Ye, Guoqiang Hu

2017 13th IEEE International Conference on Control & Automation (ICCA) > 265 - 270

2017 13th IEEE International Conference on Control & Automation (ICCA)

In this paper, distributed Nash equilibrium seeking for multi-agent games, particularly for games where the players' payoff functions are partially coupled, is investigated. To model the (partial, explicit) dependence of the players' payoff functions on the players' actions, an interference graph is introduced. Besides, the players are supposed to be equipped with a communication graph to achieve...

chapter

A distributed method for simultaneous social cost minimization and nash equilibrium seeking in multi-agent games

Maojiao Ye, Guoqiang Hu

2017 13th IEEE International Conference on Control & Automation (ICCA) > 799 - 804

2017 13th IEEE International Conference on Control & Automation (ICCA)

In this paper, a fully distributed strategy is proposed to solve the N-coalition multi-agent games. The agents in the considered N-coalition multi-agent games are supposed to have limited access to the other players' actions. Consensus protocols, including a leader-following consensus protocol and a dynamic average consensus protocol, are leveraged to search for the Nash equilibrium of the N-coalition...

chapter

Distributed Nash equilibrium seeking of a class of aggregative games

Shu Liang, Peng Yi, Yiguang Hong

2017 13th IEEE International Conference on Control & Automation (ICCA) > 58 - 63

2017 13th IEEE International Conference on Control & Automation (ICCA)

In this paper, we investigate a distributed Nash equilibrium seeking problem for a class of aggregative games that the strategic interaction is characterized by a sum of nonlinear mapping of heterogeneous local decisions. We consider non-quadratic local cost functions and constrained strategy sets. We propose a novel continuous-time distributed algorithm for equilibrium seeking based on dynamic average...

chapter

Analysis of naming game based collective behavior with biased assimilation over adaptive networks

Guiyuan Fu, Weidong Zhang

2017 36th Chinese Control Conference (CCC) > 1486 - 1490

2017 36th Chinese Control Conference (CCC)

The dynamics of two-word naming game incorporating the influence of biased assimilation is investigated in this paper. Firstly an extended naming game with biased assimilation (NGBA) is proposed. The hearer in NGBA accepts the received information in a biased manner, where he will refuse to accept the conveyed word with a predefined probability, if it is different from his own current memory. Secondly,...

chapter

Controlling the motion of a group of unmanned flight vehicles in a perturbed environment based on the rules

M. V. Khachumov

2017 International Siberian Conference on Control and Communications (SIBCON) > 1 - 5

2017 International Siberian Conference on Control and Communications (SIBCON)

Controlling the motion of a group of unmanned flight vehicles (FVs) in a perturbed environment is considered on the example of two similar problems: tracking of the dynamic target and moving along the given path. The problem of the target tracking implies that a randomly-arranged FV group approaches close to the target and flies near it during a specified time period. The low-velocity target seeks...

chapter

Building Endgame Data set to Improve Opponent Modeling Approach

Zhang Jiajia, Liu Hong

2017 IEEE Second International Conference on Data Science in Cyberspace (DSC) > 255 - 260

2017 IEEE Second International Conference on Data Science in Cyberspace (DSC)

Opponent modeling is an essential approach for building competitive computer agents in imperfect information games. This paper presents a novel approach to accelerate the convergence process in opponent modeling. The approach applies neural network (ANN) to abstract and build an endgame data set of imperfect information game. Based on a labeled database of author's previous work, several parameters...

chapter

An analysis of the stability of evolution game between P2P platforms and regulators

Huang Jiamin, Liu Qi

2017 International Conference on Service Systems and Service Management > 1 - 5

2017 14th International Conference on Service Systems and Service Management (ICSSSM)

Under the background of the rapid development and the continuous exposure of risks in peer to peer lending platforms industry in China, this paper attempts to construct an evolutionary game model between the peer to peer lending platforms and the regulators, to analyze the evolution and stability of the two groups under different circumstances. The results show that the convergence state of peer to...

chapter

A multi-agent reinforcement learning algorithm based on Stackelberg game

Chi Cheng, Zhangqing Zhu, Bo Xin, Chunlin Chen

2017 6th Data Driven Control and Learning Systems (DDCLS) > 727 - 732

2017 IEEE 6th Data Driven Control and Learning Systems Conference (DDCLS)

Multi-agent reinforcement learning has been paid much attention due to its wide applications in various engineering systems. In this paper, the control problems of large-scale multi-agent systems with multiple roles are formulated into a multiplayer Stackelberg game, which provides a new perspective on cooperative issues. Then a Stackelberg Q-learning algorithm is proposed and knowledge transfer is...

chapter

Optimal distributed channel assignment in D2D networks using learning in noisy potential games

Mohd. Shabbir Ali, Pierre Coucheney, Marceau Coupechoux

2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) > 151 - 156

IEEE INFOCOM 2017 -IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

We present a novel solution for Channel Assignment Problem (CAP) in Device-to-Device (D2D) wireless networks that takes into account the throughput estimation noise. CAP is known to be NP-hard in the literature and there is no practical optimal learning algorithm that takes into account the estimation noise. In this paper, we first formulate the CAP as a Stochastic Optimization Problem (SOP) to maximize...

chapter

Multicommodity games in public-cloud markets considering subadditive resource demands

G. Kesidis, N. Nasiriani, Y. Shan, B. Urgaonkar, more

2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) > 654 - 658

IEEE INFOCOM 2017 -IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

In still developing, public cloud-computing markets, prices for virtual machine (VM) offerings fluctuate, and not just for spot/preemptible instances. Moreover, some (particularly derivative) providers allow for fine-grain initial resource provisioning and dynamic reprovisioning of VMs. In this preliminary study, we consider long-lived tenants of a public cloud under resource-based service-level agreements...

chapter

Bridging the gap between big data and game theory: A general hierarchical pricing framework

Zijie Zheng, Lingyang Song, Zhu Han

2017 IEEE International Conference on Communications (ICC) > 1 - 6

ICC 2017 - 2017 IEEE International Conference on Communications

In this paper, we propose a general pricing framework, helping the controller promote agents to achieve its objective, for a big data network with one controller and a large number of agents. The convergence of the framework is guaranteed for a general class of objective functions: a separable convex function for the controller and a convex function for each agent. Specially, the proposed framework...

chapter

Distributed power control vs power control game: A comparison study of performance in cognitive femtocell network

Anggun Fitrian Isnawati, Risanuri Hidayat, Selo Sulistyo, I Wayan Mustika

2017 International Conference on Applied System Innovation (ICASI) > 1841 - 1844

2017 International Conference on Applied System Innovation (ICASI)

System performance on cognitive femtocell network depends on power control method. Therefore, power control comparison between DPC and PCG is needed. Result showed that DPC had higher convergence rate than PCG but maximum values of DPC was only equal to SINR target. The proposed PCG at the convergent condition was able to exceed the SINR target, yet it had higher power than the previous PCG which...

INFONA - science communication portal

Search results

Decentralized reinforcement social learning based on cooperative policy exploration in multi-agent systems

Evolutionary game theoretic approach for optimal resource allocation in multi-agent systems

Distributed and Adaptive Routing Based on Game Theory

Using single Conspiracy Number to analyze game progress patterns

Embedding symbol algorithm for fast hit rate convergence in slot machine games

Utility-Based resource allocation for underlay D2D networks

Distributed algorithm for generalized Nash equilibria seeking of network aggregative game

Multi-leader multi-follower game-based ADMM for big data processing

Distributed nash equilibrium seeking in multi-agent games with partially coupled payoff functions

A distributed method for simultaneous social cost minimization and nash equilibrium seeking in multi-agent games

Distributed Nash equilibrium seeking of a class of aggregative games

Analysis of naming game based collective behavior with biased assimilation over adaptive networks

Controlling the motion of a group of unmanned flight vehicles in a perturbed environment based on the rules

Building Endgame Data set to Improve Opponent Modeling Approach

An analysis of the stability of evolution game between P2P platforms and regulators

A multi-agent reinforcement learning algorithm based on Stackelberg game

Optimal distributed channel assignment in D2D networks using learning in noisy potential games

Multicommodity games in public-cloud markets considering subadditive resource demands

Bridging the gap between big data and game theory: A general hierarchical pricing framework

Distributed power control vs power control game: A comparison study of performance in cognitive femtocell network

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options