Deep learning: an overview and main paradigms
- Autores: Golovko V.A.1,2
-
Afiliações:
- Brest State Technical University
- National Research Nuclear University MEPhI (Moscow Engineering Physics Institute)
- Edição: Volume 26, Nº 1 (2017)
- Páginas: 1-17
- Seção: Article
- URL: https://journals.rcsi.science/1060-992X/article/view/194933
- DOI: https://doi.org/10.3103/S1060992X16040081
- ID: 194933
Citar
Resumo
In the present paper, we examine and analyze main paradigms of learning of multilayer neural networks starting with a single layer perceptron and ending with deep neural networks, which are considered regarded as a breakthrough in the field of the intelligent data processing. The baselessness of some ideas about the capacity of multilayer neural networks is shown and transition to deep neural networks is justified. We discuss the principal learning models of deep neural networks based on the restricted Boltzmann machine (RBM), an autoassociative approach and a stochastic gradient method with a Rectified Linear Unit (ReLU) activation function of neural elements.
Palavras-chave
Sobre autores
V. Golovko
Brest State Technical University; National Research Nuclear University MEPhI (Moscow Engineering Physics Institute)
Autor responsável pela correspondência
Email: gva@bstu.by
Belarus, Brest; Moscow
Arquivos suplementares
