add reacher env and all mujoco envs now support COT, SPP, SELF-REFLEXION, EXE methods under L1&L3 setting. 8f842da CharlesZhang commited on Jan 8, 2024