Use este identificador para citar ou linkar para este item: https://repositorio.idp.edu.br//handle/123456789/5666
Título: Application of LLMS in source code recovery from Python bytecode
Autor(es): Brandão-Martins, João Vitor Moreira
Orientador(es): Gomes, Jeremias Moreira
Palavras-chave: Engenharia reversa de software;Python - Linguagem de programação;Geração de código - Ciência da Computação
Editor: Idp
Citação: BRANDÃO-MARTINS, João Vitor Moreira. Application of LLMS in source code recovery from Python bytecode. 2025. 75 f. Monografia (Graduação em Ciência da Computação) – Instituto Brasileiro de Ensino, Desenvolvimento e Pesquisa, Brasília, 2026.
Resumo: Source code recovery from intermediate representations, such as bytecode or binary code, plays a fundamental role in reverse engineering, especially in scenarios where the original source is unavailable. Although Python is typically referred to as an inter preted language, its execution involves compilation into bytecode, an intermediate form executed by the Python Virtual Machine (PVM). This process removes high-level infor mation and introduces challenges for accurate decompilation. Traditional tools often produce code that is syntactically valid but semantically limited or difficult to interpret. In recent years, Large Language Models (LLMs) based on transformer architectures have shown promising results in tasks involving source code understanding, genera tion, and even binary analysis. This study investigates the application of LLMs to the task of recovering Python source code from bytecode, an area still largely unexplored in the literature. Through a systematic review of related work, the research identifies a gap in the use of LLMs for Python bytecode decompilation. This work proposes an approach centered on modern LLMs. The hypothesis is that such models can assist in both syntactic and semantic reconstruction of the original source code. The expected contributions include evaluating the success rate of this approach and offering new insights into the intersection between machine learning and reverse engineering.
URI: https://repositorio.idp.edu.br//handle/123456789/5666
Aparece nas coleções:Trabalhos de Conclusão de Curso (Graduação em Ciência da Computação)

Arquivos associados a este item:
Arquivo Descrição TamanhoFormato 
Monografia_JOÃO VITOR MOREIRA BRANDÃO-MARTINS_Graduação em Ciência da Computação.pdf1.52 MBAdobe PDFVisualizar/Abrir


Os itens no repositório estão protegidos por copyright, com todos os direitos reservados, salvo quando é indicado o contrário.