AVALIAÇÃO DE QUALIDADE DE CONJUNTOS DE DADOS DE MALWARE PARA APRENDIZADO DE MÁQUINA

HERNANDEZ, Thaisa da Silva; NETO, Augusto Parisot de Gusmão; GANDOLFI, Caroline Duarte; BENTO, Lucila M. S.; MACHADO, Raphael C. S. Machado C. S.; SANTOS, Luiz Olavo Bonino da Silva; SANTOS, ANDERSON FERNANDES PEREIRA DOS; CAVALCANTI, Maria Cláudia Reis

QUALITY ASSESSMENT OF MALWARE DATASETS FOR MACHINE LEARNING

- 326182

Poster

Download

How to cite this paper?

Abstract

As cyberspace grows, so does the damage caused by malware, which is one of the main tools used by malicious agents. Machine learning algorithms have been consolidated as important tools for detecting threats. Models used by these algorithms depend on data for training and testing. In this sense, malware datasets have become valuable in the deployment of modern anti-malware systems. However, these datasets face problems with the quality of the samples, as well as not keeping up with the speed of technological evolution and becoming obsolete. In addition, many of the datasets used in research are not publicly accessible. This paper proposes a quality assessment framework based on metrics focused on sampling and data temporality. It also incorporates criteria aligned with the FAIR principles, with the aim of encouraging the publication of more reliable and reusable datasets.

Programme

16:00 to 16:30 on 10/06/2025

Foyer Terreo

Institutions

¹ Instituto Militar de Engenharia (IME) e Diretoria de Comunicações e Tecnologia da Informação da Marinha (DCTIM)
² CASNAV
³ Instituto Militar de Engenharia (IME)
⁴ Universidade do Estado do Rio de Janeiro
⁵ Universidade Federal Fluminense (UFF)
⁶ University of Twente

Track

25. SE-PODMAR

Keywords

Datasets

Malware Analysis

FAIR

SBPO 2025

Anais do Simpósio Brasileiro de Pesquisa Operacional
Book of abstracts of the LVII Brazilian Symposium on Operations Research

QUALITY ASSESSMENT OF MALWARE DATASETS FOR MACHINE LEARNING

How to cite this paper?

Share your ideas or questions with the authors!

Streamline your Scholarly Event