Exponential Discretization of Weights of Neural Network Connections in Pre-Trained Neural Networks

M. Yu. Malsagov; E. M. Khayrov; M. M. Pushkareva; I. M. Karandashev

doi:10.3103/S1060992X19040106

Exponential Discretization of Weights of Neural Network Connections in Pre-Trained Neural Networks

Authors: Malsagov M.Y.¹, Khayrov E.M.¹, Pushkareva M.M.¹, Karandashev I.M.¹^,2
Affiliations:
1. Scientific Research Institute for System Analysis, Russian Academy of Sciences
2. Peoples Friendship University of Russia (RUDN University Moscow)
Issue: Vol 28, No 4 (2019)
Pages: 262-270
Section: Article
URL: https://journals.rcsi.science/1060-992X/article/view/195239
DOI: https://doi.org/10.3103/S1060992X19040106
ID: 195239

Cite item

Full Text

Open Access
Restricted Access

Access granted
Restricted Access

Subscription Access

Abstract
About the authors
References
Supplementary files
Statistics

Abstract

To reduce random access memory (RAM) requirements and to increase speed of recognition algorithms we consider a weight discretization problem for trained neural networks. We show that an exponential discretization is preferable to a linear discretization since it allows one to achieve the same accuracy when the number of bits is 1 or 2 less. The quality of the neural network VGG-16 is already satisfactory (top5 accuracy 69%) in the case of 3 bit exponential discretization. The ResNet50 neural network shows top5 accuracy 84% at 4 bits. Other neural networks perform fairly well at 5 bits (top5 accuracies of Xception, Inception-v3, and MobileNet-v2 top5 were 87%, 90%, and 77%, respectively). At less number of bits, the accuracy decreases rapidly.

Keywords

weight quantization, equidistant discretization, exponential discretization, neural network, number of bits, neural network compression, reduction of bit depth of weights

Supplementary files

Supplementary Files

Action

1. JATS XML

Download

Username
Password
Remember me

Forgot password?	Register

Username
Password
Remember me

Forgot password?	Register

Exponential Discretization of Weights of Neural Network Connections in Pre-Trained Neural Networks

Full Text

Abstract

Keywords

About the authors

M. Yu. Malsagov

E. M. Khayrov

M. M. Pushkareva

I. M. Karandashev

Supplementary files