Model evaluation metrics (accuracy, precision, recall, F1-score)