Please use this identifier to cite or link to this item:
https://repositorio.ufpe.br/handle/123456789/58289
Share on
Title: | Robust Video Plagiarism Detection Using Word Embeddings from Audio Transcriptions |
Authors: | BARROS SILVA, Lucas Leonardo |
Keywords: | Video plagiarism detection; Word embeddings; Audio transcription; Semantic Similarity; Multimedia Content Protection |
Issue Date: | 22-Oct-2024 |
Citation: | BARROS SILVA, L.L. Robust Video Plagiarism Detection Using Word Embeddings from Audio Transcriptions. 2024. Trabalho de Conclusão do Curso de Ciência da Computação - Universidade Federal de Pernambuco, Recife, 2024. |
Abstract: | Video piracy presents a significant challenge in the digital era, requiring effective detection methods to protect intellectual property. This paper proposes a novel approach for detecting video plagiarism by leveraging word embeddings derived from audio transcriptions. Our method begins by extracting audio streams from videos and transcribing the audio content. We then generate semantic embeddings, storing these embeddings in a vector store for efficient similarity searches. To identify potential plagiarism, query videos are processed through the same pipeline, and their embeddings are compared against reference embeddings. A Euclidean distance below a predefined threshold indicates possible plagiarism, enabling accurate classification and identification of plagiarized videos. Experimental evaluations demonstrate the method’s scalability and efficiency, particularly in detecting complete video copies with explicit English speech content. This approach offers a robust and scalable solution against joint video manipulations, providing a practical framework for combating video piracy in large-scale content environments. |
URI: | https://repositorio.ufpe.br/handle/123456789/58289 |
Appears in Collections: | (TCC) - Ciência da Computação |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
tg-llbs.pdf | 866,17 kB | Adobe PDF | ![]() View/Open |
This item is protected by original copyright |
This item is licensed under a Creative Commons License