Multi-Agent Fingerprints-Enhanced Distributed Intelligent Handover Algorithm in LEO Satellite Networks - Details

Author：

Indexed by：

EI Scopus SCIE

Abstract：

The　next-generation　wireless　network　is　expected　to　use　low-earth　orbit　(LEO)　satellite　networks　to　deliver　seamless　and　high-capacity　global　communications　services.　Due　to　the　high-speed　mobility　of　LEO　satellites,　massive　and　frequent　handovers　inevitably　occur.　Moreover,　handover　becomes　more　complicated　with　the　ever-growing　constellation　scale,　number　of　mobile　terminals　(MTs),　and　demands　for　emerging　delay-sensitive　applications.　In　this　paper,　a　decentralized　Markov　decision　process　(DEC-MDP)　is　adopted　to　formulate　the　handover　problem　in　the　LEO　satellite　network　with　finite　bursty　traffic.　The　target　is　maximizing　the　total　reward　associated　with　the　service　revenue　and　the　cost　of　handover　and　packet　loss.　To　deal　with　the　high　computational　complexity　caused　by　the　large　state　space　and　action　space,　the　solution　is　designed　using　a　multi-agent　double　deep　Q-network　(MADDQN)　with　fully　decentralized　framework,　which　also　allows　each　MT　to　train　and　use　an　individual　local　DDQN　to　avoid　load　imbalance　between　satellites.　Further,　to　alleviate　the　non-stationary　issue　of　the　environment　in　parallel　learning,　multi-agent　fingerprints　are　applied　in　MADDQN,　and　the　proposed　algorithm　is　called　multi-agent　fingerprints-enhanced　double　deep　Q-network-based　distributed　intelligent　handover　(MAF-DDQN-DIH)　mechanism.　The　implementation　of　MAF-DDQN-DIH　in　practical　communication　systems　are　discussed,　and　the　corresponding　communication　overhead　and　computational　complexity　are　analyzed.　Simulation　results　demonstrate　that　the　designed　multi-agent　fingerprints　are　effective　and　the　proposed　MAF-DDQN-DIH　algorithm　outperforms　the　comparison　handover　algorithms　in　terms　of　the　total　reward.

Keyword：

Low earth orbit satellites decentralized Markov decision process handover Delays LEO satellite networks Handover multi-agent double deep Q-network multi-agent fingerprints Decision making Orbits Satellites Heuristic algorithms

Author Community：

[ 1 ] [Yang, Feng]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Wu, Wenjun]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Gao, Yang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 4 ] [Sun, Yang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 5 ] [Si, Pengbo]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 6 ] [Sun, Teng]China Elect Technol Grp Corp, Res Inst 54, Shijiazhuang 050081, Peoples R China

Reprint Author's Address：

[Wu, Wenjun]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China;;

Email：

Show more details

Related Keywords：

Handover for Multi-Beam LEO Satellite Networks: A Multi-Objective Reinforcement Learning Method
2024，IEEE COMMUNICATIONS LETTERS
Distributed Handoff Problem in Heterogeneous Networks With End-to-End Network Slicing: Decentralized Markov Decision Process-Based Modeling and Solution
2022，IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS
An Adaptive Dual-Mode Task-Oriented Resource Management Strategy for GEO Relay Systems
2024，IEEE TRANSACTIONS ON MOBILE COMPUTING
Self-learning congestion control of MPTCP in satellites communications
2019，15th IEEE International Wireless Communications and Mobile Computing Conference, IWCMC 2019

Source ：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY

ISSN： 0018-9545

Year： 2024

Issue： 10

Volume： 73

Page： 15255-15269

6 . 8 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 9

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to