Polish Language Benchmarks
Polish Linguistic and Cultural Competency Benchmark (PLCC)
Polish
Evaluates linguistic and cultural understanding in Polish
Polish Cultural Vision Benchmark
Polish
SpeakLeash
Vision Language Model benchmark for Polish cultural understanding
International Benchmarks (Bielik Evaluated)
European LLM Leaderboard
Multi-language European language model evaluation
EuroEval
European multilingual model evaluation platform
Open LLM Leaderboard
Original comprehensive LLM evaluation leaderboard
Open LLM Leaderboard v2
Updated version of the Open LLM Leaderboard
MixEval
Mixed evaluation benchmark for language models
Berkeley Function-Calling Leaderboard
Evaluates function calling capabilities of LLMs
FLORES200 Translation Benchmark
Large-scale multilingual translation evaluation
BenCzechMark
Czech language model benchmark suite
Portuguese Benchmark (Open PT LLM Leaderboard)
Portuguese language model evaluation platform