juzraai/cordis-projects-crawler

H2020 compatible

Closed this issue · 7 comments

Is this crawler H2020 file format compatible?
Also, some results briefs are published for fp7.

Thank you for your interest in my project! Unfortunately, CORDIS website changed a lot in the past years, currently my application cannot parse any of the project pages.

CORDIS publishes project information also in downloadable datasets, you may try these: https://data.europa.eu/euodp/data/dataset/cordisH2020projects

If that doesn't fill your needs and updating my crawler would be useful for you and others, I can try to do it in the next days/weeks.

@davidpitl Ok, I'll upgrade the crawler, and I appreciate any idea of yours. :) I've just created issue #2 for planning v2.0 of the crawler. We can discuss in the comments here or there, or I'm open to use any other service (Trello, Slack, etc.) for sharing ideas.

These objectives seem OK. I think I will need some help with the unified view, but let's see what I can do in the upgrade first.

Kotlin: I tried it about a year ago and I fell in love with it. :) It has a lot of cool features, I can work faster in Kotlin then in Java. (But it's still compatible with Java.)

I've just released v2.0.0 which now can fetch and parse any project from CORDIS: old ones, new ones, including H2020 ones - so I'm closing this ticket.

Please try it out and if you still need an "unified view", please open a new ticket and give details about it.