GenTune-CyberDB: Workload-Generative, Cross-Family Auto-Tuning for Cybersecurity Vector Databases

  • Muhammad Tayyab Faculty of Computer Science, University of Central Punjab, Faisalabad, Pakistan
  • Afrooz Amjad Faculty of Computer Science, University of Central Punjab, Faisalabad, Pakistan
  • Ali Hussain Department of Computer Science & IT, The University of Lahore, Lahore, Pakistan
Keywords: Vector databases, intrusion detection, anomaly detection, threat intelligence, network traffic patterns, multi-fidelity optimization, GenTune-CyberDB, security infrastructure

Abstract

Vector databases are essential for AI-driven cybersecurity tasks, such as intrusion detection, anomaly detection, and threat intelligence retrieval, where high-dimensional security data like network traffic patterns, user behavior analytics, and security event logs are processed. However, the performance of these systems often relies on manual selection and tuning of indexing families (e.g., HNSW, IVF-PQ, ScaNN) and hyperparameters, which is inefficient and impractical in dynamic security environments. In this paper, we propose GenTune-CyberDB, a workload-generative, cross-family auto-tuning framework specifically designed for cybersecurity applications. GenTune-CyberDB leverages workload generation to create realistic attack and anomaly detection queries, optimizing database performance for real-time security data processing. It performs multi-objective, multi-fidelity optimization on index families, execution plans, and hyperparameters, considering constraints like latency, memory, and build time, ultimately improving detection efficiency and resource usage. GenTune-CyberDB demonstrates significant gains in recall and latency optimization, achieving up to 60% memory reduction with minimal recall loss (≤1%). The system adapts to evolving attack patterns and workloads, ensuring robustness even with shifts in data distribution. By automating the tuning process, GenTune-CyberDB offers superior performance for cybersecurity deployments compared to traditional, manually-tuned systems, delivering better recall-latency-memory trade-offs and improving overall security infrastructure.

Published
2025-11-13
How to Cite
Muhammad Tayyab, Afrooz Amjad, & Hussain, A. (2025). GenTune-CyberDB: Workload-Generative, Cross-Family Auto-Tuning for Cybersecurity Vector Databases. International Journal for Electronic Crime Investigation, 9(2). https://doi.org/10.54692/ijeci.2025.0902/258