DOBEL: detecting backdoors in ensemble learning

Research output: Contribution to journalArticlepeer-review

Abstract

In recent years, machine learning’s rapid growth has sparked security concerns, notably around backdoor attacks (a.k.a., Trojan attacks). However, while previous research has examined these attacks across domains like neural networks, there’s been little focus on backdoors in ensemble learning, despite their heightened risk. This paper presents DOBEL, the first method specialized to detect backdoor attacks in ensemble learning, especially those enabled by embedded triggers in training data. DOBEL employs carefully crafted test ensembles and analyzes feature vector magnitudes to distinguish benign models from malicious ones. Crucially, it addresses limitations of existing defenses which rely on sensitive training data. Experimental results show DOBEL’s effectiveness, with 98.9% accuracy in identifying Trojaned ensembles and rapid decision-making for a 50-model ensemble in 0.024 milliseconds.

Original languageEnglish
Article number288
JournalCluster Computing
Volume28
Issue number5
DOIs
StatePublished - Oct 2025

Keywords

  • AI security
  • Backdoor/Trojan attack detection
  • Distributed learning
  • Ensemble learning

Fingerprint

Dive into the research topics of 'DOBEL: detecting backdoors in ensemble learning'. Together they form a unique fingerprint.

Cite this