ahq1993/inverse_rl

what means the max iteration of target empowerment model

Closed this issue · 1 comments

code at ant_irl.py

    empw_model = Empowerment(env=env,fusion=True, max_itrs=4)
    t_empw_model = Empowerment(env=env,scope='t_efn',fusion=True, max_itrs=2, name='empowerment2')

but we do not call t_empw_model.fit(...)
why we set max_itrs=2 for target empowerment model?
where is the target empowerment model updated?

That must be a mistake, but you are right we do not use max_itrs for target model.