• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Yang, Feng (Yang, Feng.) | Wu, Wenjun (Wu, Wenjun.) | Gao, Yang (Gao, Yang.) | Sun, Yang (Sun, Yang.) | Sun, Teng (Sun, Teng.) | Si, Pengbo (Si, Pengbo.)

Indexed by:

EI Scopus SCIE

Abstract:

The next-generation wireless network is expected to use low-earth orbit (LEO) satellite networks to deliver seamless and high-capacity global communications services. Due to the high-speed mobility of LEO satellites, massive and frequent handovers inevitably occur. Moreover, handover becomes more complicated with the ever-growing constellation scale, number of mobile terminals (MTs), and demands for emerging delay-sensitive applications. In this paper, a decentralized Markov decision process (DEC-MDP) is adopted to formulate the handover problem in the LEO satellite network with finite bursty traffic. The target is maximizing the total reward associated with the service revenue and the cost of handover and packet loss. To deal with the high computational complexity caused by the large state space and action space, the solution is designed using a multi-agent double deep Q-network (MADDQN) with fully decentralized framework, which also allows each MT to train and use an individual local DDQN to avoid load imbalance between satellites. Further, to alleviate the non-stationary issue of the environment in parallel learning, multi-agent fingerprints are applied in MADDQN, and the proposed algorithm is called multi-agent fingerprints-enhanced double deep Q-network-based distributed intelligent handover (MAF-DDQN-DIH) mechanism. The implementation of MAF-DDQN-DIH in practical communication systems are discussed, and the corresponding communication overhead and computational complexity are analyzed. Simulation results demonstrate that the designed multi-agent fingerprints are effective and the proposed MAF-DDQN-DIH algorithm outperforms the comparison handover algorithms in terms of the total reward.

Keyword:

Low earth orbit satellites decentralized Markov decision process handover Delays LEO satellite networks Handover multi-agent double deep Q-network multi-agent fingerprints Decision making Orbits Satellites Heuristic algorithms

Author Community:

  • [ 1 ] [Yang, Feng]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Wu, Wenjun]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Gao, Yang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Sun, Yang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 5 ] [Si, Pengbo]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 6 ] [Sun, Teng]China Elect Technol Grp Corp, Res Inst 54, Shijiazhuang 050081, Peoples R China

Reprint Author's Address:

  • [Wu, Wenjun]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China;;

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY

ISSN: 0018-9545

Year: 2024

Issue: 10

Volume: 73

Page: 15255-15269

6 . 8 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:526/5317299
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.