pixel obs

Question

pixel obs

haoyu-x opened this issue 4 years ago · 5 comments

Hi @keiohta , thanks for sharing such a great project!

I'm wondering if I want to set observation to be image-based using tf2rl, do you have any advice?

Answer 1 · 2020-07-31T12:57:58.000Z

I'm not sure what you want to do but I guess Atari example with DQN might help you.
https://github.com/keiohta/tf2rl/blob/master/examples/run_dqn_atari.py

Answer 2 · 2020-08-03T15:15:34.000Z

hi @keiohta, sorry for the late reply.

basically, I'd like to use GAIfO+SAC on image-based dm_control and robosuite tasks.
What I'm trying to say is that does tf2rl have any support of image-based RL/IL. or let's say can I pull a request, for example, SAC+AE working on the image-based dm_conttrol suite. https://sites.google.com/view/sac-ae/home. also something like CURL. https://arxiv.org/abs/2004.04136
example of IL, the experiment of the original GAIfO paper is also based on image-based tasks, it uses CNN policy and CNN discriminator. also, VAIL explored video imitation in their experiments.

if there could be a CNN-GAIfO or VAE-GAIfO-SAC. that would be great!
thanks!

Haoyu

Answer 3 · 2020-08-12T13:29:53.000Z

Hi @haoyu-x , sorry I completely forgot to answer your question...

Current tf2rl does not officially support image-based RL/IL (I mean I don't provide no example of such scripts under examples), but we've already implemented the codes to do that.
For example, I think you can solve an easy image-based RL (let's say Pendulum-v0) using SAC + VAE, or do IRL in similar way.

I strongly suggest you use VAE instead of directly inputting image to your policy. Refs are:

Of course it would be awesome if you can send the PR that supports CURL or other image-based RL/IL!

Answer 4 · 2020-08-12T13:35:34.000Z

Thank you！ Kei Ohta <notifications@github.com>于2020年8月12日周三21:30写道：

…

Hi @haoyu-x <https://github.com/haoyu-x> , sorry I completely forgot to answer your question... Current tf2rl does not officially support image-based RL/IL (I mean I don't provide no example of such scripts under examples <https://github.com/keiohta/tf2rl/tree/master/examples>), but we've already implemented the codes to do that. For example, I think you can solve an easy image-based RL (let's say Pendulum-v0) using SAC <https://github.com/keiohta/tf2rl/blob/master/tf2rl/algos/sac.py> + VAE <https://github.com/keiohta/tf2rl/blob/master/tf2rl/tools/vae.py>, or do IRL in similar way. I strongly suggest you use VAE instead of directly inputting image to your policy. Refs are: - Improving Sample Efficiency in Model-Free Reinforcement Learning from Images <https://arxiv.org/abs/1910.01741> - Learning to drive in a day <https://arxiv.org/abs/1807.00412> Of course it would be awesome if you can send the PR that supports CURL or other image-based RL/IL! — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#93 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APACPZSDJ4VEWI5IABFFTYTSAKKOHANCNFSM4PQGTADQ> .

Answer 5 · 2020-08-12T14:18:28.000Z

I close this issue since it seems to be resolved.
Please reopen this issue if you find other problem related to this.