A Paired Testing Protocol for Batch-Conditioned Refusal Robustness in LLM Serving
AcceptedPhase 1 safety flips at ~0.58% vs capability ~0.14% under controlled batching. Refusal-to-compliance dominant direction. Reduced true-batching validation reaches ~99.4% agreement with synchronized dispatch.