Use este identificador para citar ou linkar para este item:
https://repositorio.ufpe.br/handle/123456789/38541
Compartilhe esta página
Título: | Time Aware Sigmoid Optimization : a new learning rate scheduling method |
Autor(es): | LEUCHTENBERG, Pedro Henrique Dreyer |
Palavras-chave: | Inteligência computacional; Aprendizagem de máquinas; Redes neurais profundas; Taxa de aprendizado |
Data do documento: | 6-Set-2019 |
Editor: | Universidade Federal de Pernambuco |
Citação: | LEUCHTENBERG, Pedro Henrique Dreyer. Time Aware Sigmoid Optimization: a new learning rate scheduling method. 2019. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de Pernambuco, Recife, 2019. |
Abstract: | The correct choice of hyperparameters for the training of a deep neural network is a critical step to achieve a good result. Good hyperparameters would give rise to faster training and a lower error rate, while bad choices could make the network not even converge, rendering the whole training process useless. Among all the existing hyperparameters, perhaps the one with the greatest importance is the learning rate, which controls how the weights of a neural network are going to change at each interaction. In that context, by analyzing some theoretical findings in the area of information theory and topology of the loss function in deep learning, the author was able to come up with a new training rate decay method called Training Aware Sigmoid Optimization (TASO), which proposes a dual-phase during training. The proposed method aims to improve training, achieving a better inference performance in a reduced amount of time. A series of tests were done to evaluate this hypothesis, comparing TASO with different training methods such as Adam, ADAGrad, RMSProp, and SGD. Results obtained on three datasets (MNIST, CIFAR10, and CIFAR100) and with three different architectures (Lenet, VGG, and RESNET) have shown that TASO presents, in fact, an overall better performance than the other evaluated methods. |
Descrição: | LEUCHTENBERG, Pedro Henrique Dreyer, também é conhecido em citações bibliográficas por: DREYER, Pedro Henrique |
URI: | https://repositorio.ufpe.br/handle/123456789/38541 |
Aparece nas coleções: | Dissertações de Mestrado - Ciência da Computação |
Arquivos associados a este item:
Arquivo | Descrição | Tamanho | Formato | |
---|---|---|---|---|
DISSERTAÇÃO Pedro Henrique Dreyer Leuchtenberg.pdf | 1,85 MB | Adobe PDF | ![]() Visualizar/Abrir |
Este arquivo é protegido por direitos autorais |
Este item está licenciada sob uma Licença Creative Commons