Use este identificador para citar ou linkar para este item:
https://repositorio.ufpe.br/handle/123456789/53563
Compartilhe esta página
Título: | Improving binary classifiers on imbalanced data using large language models |
Autor(es): | BARBOSA, José Matheus Lacerda |
Palavras-chave: | Banco de dados; Aprendizado por indução |
Data do documento: | 12-Jul-2023 |
Editor: | Universidade Federal de Pernambuco |
Citação: | BARBOSA, José Matheus Lacerda. Improving binary classifiers on imbalanced data using large language models. 2023. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de Pernambuco, Recife, 2023. |
Abstract: | In the realm of real-world classification tasks, the challenge of imbalanced data fre- quently hinders the efficacy of machine learning models in performing accurate binary classifications. To address this issue directly, this study introduces "BALANCE," a novel framework designed to rectify data imbalance in text datasets for binary classification. BALANCE leverages prompt-based learning to efficiently generate synthetic data that mimics the characteristics of the minority class. This is achieved by optimizing the de- coding parameters of a specific natural language generation model and tailoring text gen- eration to the minority class. A customized prompt is subsequently employed to generate instances using the fine-tuned language model. We conducted a comprehensive experimen- tal evaluation using three imbalanced real-world text classification datasets. The findings of our study reveal that BALANCE consistently outperforms existing methods for data creation and imbalance correction in the majority of scenarios. These results underscore the high quality of the generated instances and the potential of BALANCE to significantly enhance the performance of text classification models when dealing with imbalanced data. |
URI: | https://repositorio.ufpe.br/handle/123456789/53563 |
Aparece nas coleções: | Dissertações de Mestrado - Ciência da Computação |
Arquivos associados a este item:
Arquivo | Descrição | Tamanho | Formato | |
---|---|---|---|---|
DISSERTAÇÃO José Matheus Lacerda Barbosa.pdf | 1,69 MB | Adobe PDF | ![]() Visualizar/Abrir |
Este arquivo é protegido por direitos autorais |
Este item está licenciada sob uma Licença Creative Commons