Kai Wu d74507ace8 use pip install for lm_eval 1 rok temu
..
benchmarks d74507ace8 use pip install for lm_eval 1 rok temu