[Re] Exploration in Model-based Reinforcement Learning by Empirically Estimating Learning Progress

Question

[Re] Exploration in Model-based Reinforcement Learning by Empirically Estimating Learning Progress

AugustinChrtn opened this issue 2 years ago · 50 comments

Original article: Lopes, M., Lang, T., Toussaint, M., & Oudeyer, P. Y. (2012). Exploration in model-based reinforcement learning by empirically estimating learning progress. Advances in neural information processing systems, 25.

PDF URL: https://github.com/AugustinChrtn/Reproduction/blob/master/article.pdf
Metadata URL: https://github.com/AugustinChrtn/Reproduction/blob/master/metadata.yaml
Code URL: https://github.com/AugustinChrtn/Reproduction/

Scientific domain: Computational Neuroscience
Programming language: Python
Suggested editor:

Answer 1 · 2023-06-01T09:01:43.000Z

Thank you for your submission. An editor will be soon assigned. @benoit-girard Can you handle this submission (either editing it or assigning an editor)?

Answer 2 · 2023-06-01T09:39:32.000Z

Thank you for your answer! @benoit-girard is my PhD supervisor and cannot edit this submission due to conflict of interest.

Answer 3 · 2023-06-01T09:53:19.000Z

Oh sorry, I didn't check the authors... @oliviaguest Can you handle this submission (either editing it or assigning an editor)?

Answer 4 · 2023-06-01T13:11:41.000Z

Yes, I can, @rougier .☺️

Answer 5 · 2023-06-01T14:51:34.000Z

Thank you @oliviaguest!

Answer 6 · 2023-06-07T06:56:16.000Z

👋 Hi @hhihn @kkhetarpal @ghost-nn-machine @MA-Ramirez can any/all of you review this for ReScience C? 😊 Please let me know as soon as possible if you can take this on, I would appreciate that of course, but also be aware I might be harder to reach than usual till next week. Thank you!

Answer 7 · 2023-08-04T10:22:33.000Z

Dear editors, Dear @rougier, Dear @oliviaguest,
Any news from the reviewing process?
I wish you all a great summer!
Best regards
mehdi

Answer 8 · 2023-08-24T16:29:43.000Z

Hi @MehdiKhamassi I guess August is not the best month to find reviewers. Let's hope we'll find them early September.
@oliviaguest Any progress on that?

Answer 9 · 2023-08-26T16:02:53.000Z

Sorry indeed. It's so slow-going. As you see, nobody even replied above; I'll keep trying. If you have any suggestions, please give them to us. 😊

Answer 10 · 2023-08-28T08:06:46.000Z

Dear all,

thank you very much for the invitation to review the article.
Nonetheless, this time I won’t be able to review it due to time constraints.

All the best

Answer 11 · 2023-08-29T02:57:30.000Z

Dear ReScience/submissions Thank you for the reviewer invitation. Unfortunately, I am unable to commit to this at this point due to several other constraints. Sorry about that. Thanks for understanding. Kind Regards Khimya

…

On Mon, Aug 28, 2023 at 1:06 AM Alejandra Ramirez ***@***.***> wrote: Dear all, thank you very much for the invitation to review the article. Nonetheless, this time I won’t be able to review it due to time constraints. All the best — Reply to this email directly, view it on GitHub <#73 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABXGQYWVIREACGKTWNCX2BDXXRGSBANCNFSM6AAAAAAYMACVK4> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Answer 12 · 2023-09-11T09:46:43.000Z

@oliviaguest You can try to ask all the reviewers at once to check if someone willing to review.

Answer 13 · 2023-09-17T12:13:24.000Z

dear @ReScience/reviewers can anybody take this on? 😊

Answer 14 · 2023-09-17T12:26:20.000Z

@natbprice do you have capacity?

…

On Sun, Sep 17, 2023 at 08:13 Olivia Guest ***@***.***> wrote: dear @ReScience/reviewers <https://github.com/orgs/ReScience/teams/reviewers> can anybody take this on? 😊 — Reply to this email directly, view it on GitHub <#73 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACL2TNL4VCKSX2CBK7VSB23X23SQBANCNFSM6AAAAAAYMACVK4> . You are receiving this because you are on a team that was mentioned.Message ID: ***@***.***>

Answer 15 · 2023-09-17T12:54:13.000Z

I would be willing to review this submission. But do note that I might be largely unreachable between Oct 20 to end-November. I can have my initial review submitted prior to this period.

[Edit on Sep 26]
As the upcoming weeks are going to be busy, I would no longer be available for review during this period. I can revisit this in December if needed.

Answer 16 · 2023-09-17T15:25:53.000Z

I would be happy to review this. However, we should have a "compute requirements" section as well in the submission process to better understand what resources reviewers might need.

Answer 17 · 2023-10-13T08:56:14.000Z

@oliviaguest I think you have two potential reviewers.

Answer 18 · 2023-10-29T13:38:27.000Z

I am sorry for taking so long to check this, I have been unwell. Apologies, to all above.

Answer 19 · 2023-10-29T13:40:35.000Z

@HaoZeke and @appukuttan-shailesh thank you for offering to review this. Please let me know if you need any help. Please see here for some information: https://rescience.github.io/edit/ ☺️

Answer 20 · 2023-10-29T16:26:50.000Z

Hi @oliviaguest,
As I had updated on my post above, I am currently away and won't be able to take this up prior to December. Also, once back I would be shifting to a new city and therefore might be slow on proceeding with this. I would therefore recommend appointing another reviewer to speed things along. If in case nobody volunteers by December, I will be willing to take it up on my return.

Answer 21 · 2023-11-20T17:51:13.000Z

@appukuttan-shailesh thank you for the heads up.

Answer 22 · 2024-01-10T08:37:08.000Z

@oliviaguest @appukuttan-shailesh @HaoZeke Happy new year all and gentle reminder for the review. If we can target end of January that would be cool since we'll soon transition to a new website.

Answer 23 · 2024-01-10T09:17:50.000Z

@rougier, @oliviaguest : I am afraid I won't be able to attend to this in the short term. As mentioned previously I have just moved to take up a new position, and therefore a bit occupied at the moment. It would be best to appoint another reviewer for a faster evaluation.

Answer 24 · 2024-01-10T13:38:02.000Z

@appukuttan-shailesh ok
@oliviaguest can you find another reviewer?

Answer 25 · 2024-02-07T15:11:52.000Z

Dear editors, Dear @rougier, Dear @oliviaguest,
Any news from the reviewing process?
I wish you all a nice day!
Best regards
mehdi

Answer 26 · 2024-02-12T16:45:33.000Z

@HaoZeke Are tou still availabe to review? If so you can start (don't wait for secodn reviewer I mean). If you could do it in two weeks that would wonderful.
@oliviaguest Can you find another reviewer or should be broadcast to all reviewers ?

Answer 27 · 2024-02-13T21:53:09.000Z

@rougier can reviewers be tagged like this @ReScience/reviewers? If so, that's great. Please if anyone has the time and capacity, please let us know. 😊

Answer 28 · 2024-02-19T10:31:03.000Z

Another solution would be to ask one of ReScience published author if he/she is willing to review.

Answer 29 · 2024-02-19T15:32:50.000Z

I can review this work, should be able to return the review by the end of March.

Answer 30 · 2024-02-19T16:14:03.000Z

@mo-arvan Wonderful, thanks.

Answer 31 · 2024-03-18T22:40:48.000Z

In this submission, the authors attempt to reproduce the work of Lopes et al (Exploration in Model-based Reinforcement Learning by Empirically Estimating Learning Progress). They discuss the challenges, discrepancies and potential solutions for resolving the identified issues. While the authors of the original paper have not released their source code, they have assisted the authors of this paper and have clarified some aspect of the work. This paper does not fit into my area of expertise, I have attempted to read, understand, and follow the instructions of the authors.

I started with forking and cloning the source code released by submission 73. List of dependencies used by authors are rather short, which reduces the chance of discrepancy management issues (dependency hell). Containerizing the environment to run everything inside a docker container was straightforward. The dockerfile, the steps I took to reproduce the results, and the output of the scripts ran are added to the fork inside a directory called artifact.

The repository released by authors is sufficiently documented and properly structured. It took approximately two hours to run all the scripts on a modern workstation. While re-running the scripts provided by authors was easy, sifting through the results to find the corresponding figure was more convoluted than it should be. To put it simply, the code generates too many figures and naming schema is not clear enough to identify the matching figure in the paper. This is not a major issue, but they authors could refactor the naming schema to include the figure number, if a figure is included in their paper.

The authors extensively discuss the their findings and the challenges they have faced. In particular, Table 2 provides a great overview of all the challenges, the resolutions, and the results. Although they did not manage to successfully reproduce all, their investigation is thorough and the clarifications provided by the original authors are helpful. Overall, I believe this submission complements the original paper.

Question for the authors:

Can you provide some information on how to match the figures generated by the code to the figures in the paper?

Answer 32 · 2024-03-19T13:28:48.000Z

@oliviaguest I can do the review, give me a few days (and a reminder if I'm late)

Answer 33 · 2024-04-07T17:06:54.000Z

@AugustinChrtn I've cloned and installed your dependencies but I get problem running main. I'll open an issue on your repo.

Answer 34 · 2024-04-09T15:12:00.000Z

Dear @mo-arvan, thank you for your detailled feedback, the docker and your question.

I modified the structure of the repository and clarified how to match each plot to a Figure of the article. I separated the different metrics (or agents for parameter fitting) in different folders. In addition, the plot names the code generates now have the number of the figure they correspond to in their names. I also added a clearer output for the computational time and for the figures that the code is generating online. Thank you for pointing this out and I hope that these changes will make the results of the code easier to reuse and understand. Please let me know if I should change anything else in the code or in the article!

Dear @rougier, I answered to your issue and I hope that you can run main now. Please let me know if you cannot.

Answer 35 · 2024-04-10T16:32:19.000Z

Thanks, seems to be working flawlessly now (and good point with the estimated time)

Answer 36 · 2024-04-10T16:42:08.000Z

I've read the article and I would like to congratulate for this work. The paper is really really clear and explain both the different difficulties, how you manage to solve problems and it represents a fair amount of work. I've got only a small minor problem during testing but it has been fixed since then. I've only some some minor points:

Your requirements file is bit strict in terms of version for the different library. On the one hand this ensures reproducibility (which is good) but in the other hand you need to install a specific python version which I found a bit annoying. This does not need correction and you precise specification is ok. Just to let you know.
You need to add a license to your code because no license means no right for anyone to do anything with your code (see https://choosealicense.com/). We have no recommendation for ReScience as long as it is an open license.

The "major" point concerns the title/conclusion. The semantic is [Re] for replication, [¬Re] for failed replication and [~Re] for approximate replication which I think correspond to your case since you did not manage to replicate all the results in spite of a lot of efforts you put in this work. The idea with the title is to quickly warn the reader that the work is not totally reproducible. If you disagree with the change [Re] to [~Re], we can discuss it.

One this is done, I think the paper can be published. I'll let @oliviaguest decides on the next step.

Answer 37 · 2024-04-11T12:43:02.000Z

Dear @rougier, thank you very much for your feedback. We did spend a lot of time and effort to replicate this paper and we're very happy that you liked it. I answer to all of your points below:

Minor points:

I agree that the requirement file is rather strict. It was to make sure that people would be able to generate the exact same plots and results as the ones we present.
Thank you for pointing it out. I added an MIT license.

Major point: I do agree that the paper is ~Re rather than Re. I changed the metadata (and the title).

We would like to thank again the two reviewers for their feedback and we are ready to proceed depending on @oliviaguest 's instructions.

Answer 38 · 2024-04-12T18:10:36.000Z

Thanks. Let's wait for Monday for @oliviaguest to react, else I'll publish the paper.

Answer 39 · 2024-05-13T15:05:11.000Z

Dear @rougier, @oliviaguest,
I am ready to proceed depending on your instructions.

Answer 40 · 2024-05-27T13:51:31.000Z

Sorry for delay. @oliviaguest can we publish?

Answer 41 · 2024-05-27T13:51:59.000Z

@AugustinChrtn In the meantime, you can start filling the metadata with all the information from this review.

Answer 42 · 2024-06-12T09:43:57.000Z

@rougier Thank you for the instructions. I updated the metadata.yaml file !

Answer 43 · 2024-08-12T23:12:46.000Z

Dear Editors, Dear @rougier , Dear @oliviaguest ,
This is my yearly message asking if there is any news from the reviewing process? :-)
I wish you all a great summer!
Best regards
mehdi

Answer 44 · 2024-09-02T05:02:28.000Z

Sorry for the loooong delay. I'll publish the paper today. @AugustinChrtn Do you have a link to the latex files, it'll make my life easier.

Answer 45 · 2024-09-02T08:27:38.000Z

Sure! I downloaded the source files from overleaf. Please let me know if you need something else!
Augustin

[
Rescience Chartouny.zip
](url)

Answer 46 · 2024-09-02T10:41:01.000Z

Thanks. Can you check https://sandbox.zenodo.org/records/105346 if everything looks right? This is a sandbox version, not the final publication.

Answer 47 · 2024-09-02T11:04:21.000Z

I checked the sandbox version and everything looks right to me! Thank you!

Answer 48 · 2024-09-02T11:22:25.000Z

Ok, so lt's try to to publish the finale version.

Answer 49 · 2024-09-02T11:26:57.000Z

It's online https://zenodo.org/records/13627804 !!! Congratulations!

Answer 50 · 2024-09-02T12:19:41.000Z

Thank you very much @rougier!