Table 8 Scalability and Resource Metrics Under Load Replay.

From: Hybrid MLOps framework for automated lifecycle management of adaptive phishing detection models

Metric

Result

Load replay volume

5 million labeled URLs

Ingestion rate

\(\approx\) 2,300 requests per second

p99 inference latency

41.6 milliseconds

Horizontal Pod Autoscaler (HPA) scaling behavior

Scaled from 3 to 12 pods (CPU target = 70%)

GPU utilization (peak during retraining)

74%

Storage overhead after 60 days

\(\le\) 1.4 TB (MinIO lifecycle-managed)