Which metric is most appropriate for evaluating a binary classifier on an imbalanced dataset where the positive class is rare?
Correct: B
F1-score balances precision and recall, making it robust when class prevalence is skewed, unlike accuracy which can be misleadingly high if the model predicts the majority class.