◾전통적인 평가방법

🔻Confusion Matrix

e.g. ground truth(gt) : Here is a breakdown of what is happening and why prediction : Here is a what happening and also why

🔸Precision

🔸Recall

🔸F1-score

◾NLP Benchmarks

🔸BLEU (Bilingual Evaluation Understudy Score)