Git-Mediawiki/Git-Mediawiki

How to get pages even if namespace is undefined?

egonw opened this issue · 5 comments

egonw commented

Thanks for this really interesting tool! Can you please point me to some instructions how I can get pages in a namespace, even if the MediaWiki does not formally defines the namespace?

Particularly, I am trying to get pages like http://www.wikipathways.org/index.php/Pathway:WP22

And while it picks up nicely the rest of this MediaWiki website, but not the Pathway:WPxxxx pages and I note this message:

Namespace Pathway not found in cache, querying the wiki ...
No such namespace Pathway on MediaWiki.

Is there anyway to still download the history of these pages too?

moy commented

The message just tells you that the tool tried to see if Pathname was a namespace. But this still does the right thing:

git clone -c remote.origin.pages='Pathway:WP22' mediawiki::http://www.wikipathways.org/
egonw commented

Is there also some way of using some regular expression here? Because this MediaWiki installation has almost 3000 WP pages with some history... Something like the following?

git clone -c remote.origin.pages='Pathway:WP*' mediawiki::http://www.wikipathways.org/

i believe namespace support in #10 may do exactly what you need here.

egonw commented

@anarcat, not sure how to use that... note there is no matching namespace... I am not sure what command to use. Help would be appreciated.

oh. well i thought it was a namespace thing, so i blindly closed this, my apologies. :)