Recent Advances in Stochastic Gradient Descent in Deep Learning - Details

Author：

Tian, Yingjie (Tian, Yingjie.) | Zhang, Yuqi (Zhang, Yuqi.) | Zhang, Haibin (Zhang, Haibin.)

Indexed by：

Scopus SCIE

Abstract：

In　the　age　of　artificial　intelligence,　the　best　approach　to　handling　huge　amounts　of　data　is　a　tremendously　motivating　and　hard　problem.　Among　machine　learning　models,　stochastic　gradient　descent　(SGD)　is　not　only　simple　but　also　very　effective.　This　study　provides　a　detailed　analysis　of　contemporary　state-of-the-art　deep　learning　applications,　such　as　natural　language　processing　(NLP),　visual　data　processing,　and　voice　and　audio　processing.　Following　that,　this　study　introduces　several　versions　of　SGD　and　its　variant,　which　are　already　in　the　PyTorch　optimizer,　including　SGD,　Adagrad,　adadelta,　RMSprop,　Adam,　AdamW,　and　so　on.　Finally,　we　propose　theoretical　conditions　under　which　these　methods　are　applicable　and　discover　that　there　is　still　a　gap　between　theoretical　conditions　under　which　the　algorithms　converge　and　practical　applications,　and　how　to　bridge　this　gap　is　a　question　for　the　future.

Keyword：

deep learning machine learning stochastic gradient descent

Author Community：

[ 1 ] [Tian, Yingjie]Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[ 2 ] [Tian, Yingjie]Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[ 3 ] [Tian, Yingjie]Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[ 4 ] [Zhang, Yuqi]Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
[ 5 ] [Zhang, Haibin]Beijing Univ Technol, Beijing Inst Sci & Engn Comp, Fac Sci, Beijing 100124, Peoples R China

Reprint Author's Address：

[Tian, Yingjie]Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China;;[Tian, Yingjie]Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China;;[Tian, Yingjie]Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China;;

Email：

tyj@ucas.ac.cn

Show more details

Related Keywords：

State-of-the-Art CNN Optimizer for Brain Tumor Segmentation in Magnetic Resonance Images
2020，BRAIN SCIENCES
Neural Network-Based Side Channel Attacks and Countermeasures
2020，57th ACM/IEEE Design Automation Conference (DAC)
Implementation and Use of Disease Diagnosis Systems for Electronic Medical Records Based on Machine Learning: A Complete Review
2020，IEEE ACCESS
A Method for Mining Granger Causality Relationship on Atmospheric Visibility
2021，ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

Source ：

MATHEMATICS

Year： 2023

Issue： 3

Volume： 11

2 . 4 0 0

JCR@2022

ESI Discipline： MATHEMATICS;

ESI HC Threshold：9

Cited Count：

WoS CC Cited Count： 86

SCOPUS Cited Count： 164

ESI Highly Cited Papers on the List： 8 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to