A configurable multiplex data transfer model for asynchronous and heterogeneous FPGA accelerators on single DMA device - Details

Author：

Huang, Zhangqin (Huang, Zhangqin.) (Scholars：黄樟钦) | Zhang, Shuo (Zhang, Shuo.) | Gao, Han (Gao, Han.) | Zhang, Xiaobo (Zhang, Xiaobo.) | Yang, Shengqi (Yang, Shengqi.)

Indexed by：

EI Scopus SCIE

Abstract：

To　reduce　DMA　utilization　for　multiple　algorithm　IPs　on　FPGA,　a　channel　configurable　and　multiplex　DMA　device　(CMDMA)　is　proposed　for　asynchronous　and　heterogeneous　algorithm　IPs.　Firstly,　we　abstract　the　entities　and　data-flow　in　CMDMA　system　with　a　formal　description　for　function　definition　and　work-flow　analysis.　Then　based　on　the　functions　and　work-flow,　we　design　and　implement　a　prototype　of　CMDMA,　which　includes　CMDMA　software　driver　(SW)　and　hardware　circuits　(HW)　of　one　DMA　IP,　a　configurable　input　switch　(CISwitch),　algorithm　IPs,　and　an　asynchronous　output　switch　(AOSwitch).　The　configurable　function　of　CMDMA　is　implemented　by　CISwitch　through　a　configuration　port　in　HW-level,　and　a　configurable　Round-Robin　(CRR)　algorithm　is　proposed　to　implement　channel　and　input　data　schedule　in　SW-level.　For　output,　a　channel　distinguishable　output　buffer　(ChnDistBuf)　is　proposed,　which　is　able　to　deliver　channel　ID　and　data　size　to　SW　earlier　than　the　end　time　of　an　algorithm　IP.　With　a　double　interrupt　coordination　method　of　both　ChnDistBuf　and　algorithm　IPs,　CMDMA　is　able　to　successively　store　complete　output　data　from　different　algorithm　IPs.　With　a　double　interrupt　coordination　method　of　both　ChnDistBuf　and　algorithm　IPs,　CMDMA　is　able　to　successively　store　complete　output　data　from　different　algorithm　IPs.　The　experiments　based　on　4　heterogeneous　matrix　multiplication　algorithm　IPs　on　Xilinx　Zynq　platform　show　that　CMDMA　is　able　to　improve　about　8%　29%　average　algorithm　acceleration　rates　on　single　algorithm　IP　compared　to　the　exclusive　method　that　one　DMA　works　for　one　algorithm　IP　only,　and　it　is　able　to　increase　about　10-40　MB/s　and　5-15　MB/s　of　DMA　input　and　output　data　throughput　with　multiple　algorithm　IPs　running　in　parallel.　Moreover,　the　extended　LUT　and　FF　resources　in　CMDMA　are　756　and　1219　,　both　of　which　are　about　1%　of　Zynq　platform.　Besides,　in　a　double　CNN　algorithm　IPs　test　on　Mnist　application,　an　enhanced　function　of　data　broadcasting　in　CMDMA　is　able　to　improve　4　s　than　the　system　with　4　exclusive　DMA　running　in　parallel,　meanwhile　reduce　3　DMA　utilization　and　0.　03　W　power　consumption.　(c)　2020　Elsevier　B.V.　All　rights　reserved.

Keyword：

DMA Switch FPGA System architecture Multiplex

Author Community：

[ 1 ] [Huang, Zhangqin]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China
[ 2 ] [Zhang, Shuo]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China
[ 3 ] [Gao, Han]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China
[ 4 ] [Zhang, Xiaobo]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China
[ 5 ] [Yang, Shengqi]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China

Reprint Author's Address：

[Zhang, Shuo]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China

Email：

zhuang@bjut.edu.cn |
15210880237@139.com

Show more details

Related Keywords：

Design and Implementation of Multiport Ethernet Data Arbiter Based on AXI4-Stream
2018，2nd International Conference on Advances in Image Processing (ICAIP) / 2nd International Conference on Software Engineering and Development (ICSED
基于AHB总线的SD/SDHC/MMC控制器设计
2010，微电子学与计算机
供水管网独立计量区域管理数据库系统开发
2015，给水排水
氯胺消毒中DMA生成UDMH反应机理的理论研究及DFT方法的比较
2017，第九届全国环境化学大会

Source ：

MICROPROCESSORS AND MICROSYSTEMS

ISSN： 0141-9331

Year： 2020

Volume： 77

2 . 6 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：132

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部软件学院

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to