Skip navigation
Please use this identifier to cite or link to this item: https://repositorio.ufpe.br/handle/123456789/38541

Share on

Title: Time Aware Sigmoid Optimization : a new learning rate scheduling method
Authors: LEUCHTENBERG, Pedro Henrique Dreyer
Keywords: Inteligência computacional; Aprendizagem de máquinas; Redes neurais profundas; Taxa de aprendizado
Issue Date: 6-Sep-2019
Publisher: Universidade Federal de Pernambuco
Citation: LEUCHTENBERG, Pedro Henrique Dreyer​. Time Aware Sigmoid Optimization: a new learning rate scheduling method. 2019. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de Pernambuco, Recife, 2019.
Abstract: The correct choice of hyperparameters for the training of a deep neural network is a critical step to achieve a good result. Good hyperparameters would give rise to faster training and a lower error rate, while bad choices could make the network not even converge, rendering the whole training process useless. Among all the existing hyperparameters, perhaps the one with the greatest importance is the learning rate, which controls how the weights of a neural network are going to change at each interaction. In that context, by analyzing some theoretical findings in the area of information theory and topology of the loss function in deep learning, the author was able to come up with a new training rate decay method called Training Aware Sigmoid Optimization (TASO), which proposes a dual-phase during training. The proposed method aims to improve training, achieving a better inference performance in a reduced amount of time. A series of tests were done to evaluate this hypothesis, comparing TASO with different training methods such as Adam, ADAGrad, RMSProp, and SGD. Results obtained on three datasets (MNIST, CIFAR10, and CIFAR100) and with three different architectures (Lenet, VGG, and RESNET) have shown that TASO presents, in fact, an overall better performance than the other evaluated methods.
Description: LEUCHTENBERG, Pedro Henrique Dreyer​, também é conhecido em citações bibliográficas por: ​DREYER, Pedro Henrique
URI: https://repositorio.ufpe.br/handle/123456789/38541
Appears in Collections:Dissertações de Mestrado - Ciência da Computação

Files in This Item:
File Description SizeFormat 
DISSERTAÇÃO Pedro Henrique Dreyer Leuchtenberg.pdf1,85 MBAdobe PDFThumbnail
View/Open


This item is protected by original copyright



This item is licensed under a Creative Commons License Creative Commons