Loading HumanEval benchmark...