This demo allows you to generate a radial plot comparing the performance of different language models on different tasks. It is based on the generative results from the EuroEval benchmark.