본문 바로가기

Machine Learning Paper Reviews (Mostly NLP)

Korean1

KMMLU: A Korean Benchmark for LLMs KMMLU: Measuring Massive Multitask Language Understanding in Korean Guijin Son, Hanwool Lee, Sungdong Kim, etc. 18 Feb 2024 MMLU There exist various benchmarks for evaluating and understanding the capabilities of Large Language Models (LLMs), such as commonsense reasoning, code generation, and multi-turn conversations. Massive Multitask Language Understanding (MMLU) is one of these benchmarks, c.. 2024. 3. 29.

이전 1 다음

티스토리툴바