Korean1 KMMLU: A Korean Benchmark for LLMs KMMLU: Measuring Massive Multitask Language Understanding in Korean Guijin Son, Hanwool Lee, Sungdong Kim, etc. 18 Feb 2024 MMLU There exist various benchmarks for evaluating and understanding the capabilities of Large Language Models (LLMs), such as commonsense reasoning, code generation, and multi-turn conversations. Massive Multitask Language Understanding (MMLU) is one of these benchmarks, c.. 2024. 3. 29. 이전 1 다음