For a benchmark named terminal bench, I would assume it would require some terminal "interaction", not giving the code and command.