How to make this work with some hosted database?

Question

How to make this work with some hosted database?

sergueif opened this issue 8 years ago · 9 comments

Hi, I really love this project and would love to write a whole bunch of good stuff on my wiki node, but because pages are stored in flat files, a single EC2 crash wiped half of my pages.

Rather than cobble together backup/restore processes, I'd love to use a durable database that's hosted and backed up / monitored by someone else (like Amazon or some db-as-a-service host)

Hence the question: Has anyone gotten fedwiki to work with a hosted database?

Preferably Amazon, but some other provider is fine too.

Answer 1 · 2016-12-10T03:22:14.000Z

I'm sorry you have lost pages.

The ReadMe describes support we have for databases other than flat files.
https://github.com/fedwiki/wiki#datastore-options

This is the script I use to make daily backups of my farms.

# nightly backup of asia.wiki.org sites
# 40 4 * * * (cd wiki/asia; sh backup.sh)
when=`date +%a`
how='tar -chz .wiki'
ssh root@asia.wiki.org $how > store/$when.tgz

Answer 2 · 2016-12-10T09:40:39.000Z

There are some storage plugins, but they have not been looked at for a very long time, and do not work with the wiki server in farm mode.

An alternative might be to look at using an EBS-backed instance to provide persistence.

Answer 3 · 2016-12-14T11:50:08.000Z

I notice that all the existing storage plugins are blob-store like.

Is there anything about the fedwiki that would combine poorly with postgres/rds relational database? If it's really json blobs that fit fedwiki best, Posgres got the blob column type.

I'd like to take a stab within next 1-3 months of contributing either a RDS/PostgresSQL storage plugin or DynamoDB, both from Amazon, fully-managed, so they offer to take scheduled backups to s3.

Has anyone you know already made any efforts on either storage plugin?

Answer 4 · 2016-12-14T17:02:33.000Z

We don't share the server's store other than by web requests for json structures. The structures are more like sentences in a grammar than rows in a table which is what leads us to blob storage.

There is a page that sketches this grammar and calls it the 'json schema' for pages.
http://ward.asia.wiki.org/json-schema.html

This and a few small details, like how we convert titles to slugs, are the essence of the federation.

Answer 5 · 2016-12-14T21:29:28.000Z

I didn't fully answer your question because I've not used those amazon services. If you know your way around AWS and you are just doing personal stuff, you might try just backing up your EC2 instance to S3 on a timer. I find that I go to my own backups when I accidentally trash a page and want to just recover it. That is just one ssh/tar command with my system. I don't know how that would work with RDS/PostgresSQL automatic backups, probably full restores, nothing surgical.

We do offer end-user full site backups to a single json file. These can be dropped onto any site to bring up a page viewer from which backed-up pages can be 'forked' into the site. Here, for example, is an export of our glossary:

http://glossary.asia.wiki.org/system/export.json

If that were your site you could 'Save As' to, say, glossary-dec-2016.json to have a full backup with support for selective restore.

Answer 6 · 2017-01-04T01:06:24.000Z

Thank you @WardCunningham . I am now backing up my wiki farm to an s3 bucket from a cronjob.

It was uncomfortable at first to settle on something like cron and scripts, because, I guess, coming from the industry, it is standard to separate the software such as this wiki from a database host for the purposes of scaling them out. But for this application, that is perhaps overkills.

On the other hand, I do wish for adoption of federated wiki in corporate settings, where such deployment practice might be a policy.

I'll keep this issue open, because technically farm mode is not or is poorly supported with a remote external database. But I'm ok if activity dies down, because the responses have been very helpful and I'm now running said solutions.

Answer 7 · 2017-01-04T16:34:27.000Z

To follow on with the idea of federation, I suggest wrapping our heads around the idea of using the https://matrix.org protocol as a supported backend storage. I feel our page-centric journal data model fits nicely to their stream of events in a given channel. The rendered story version could just be another BLOB attached there. Just to keep on brainstorming: it is especially the farm mode, which is interesting for deployments, but currently lacks sophisticated auth² models which don't break the federation. Lately, after suspension of Persona, the security implications when claiming a site also minimised the one-wiki-per-user-in-a-farm (Remember, I am German.) surface. Since Matrix came with a native permission control, a useful federation via HTTPS appears on the horizon, and even makes federated real-time editing look not so impossible anymore. That said, I am open for any high- or low-level implementation of this. If one wanted to adapt the old S3 storage to work with Ceph and similar S3-compatibly APIs, why not, while CouchDB could be a more end-user friendly backend, breaking with the SQL legacy at least. People also create interesting demonstrators with Secure Scuttlebutt or dat, though.

…

On 4 January 2017 at 02:06, Serguei Filimonov ***@***.***> wrote: Thank you @WardCunningham <https://github.com/WardCunningham> . I am now backing up my wiki farm to an s3 bucket from a cronjob. It was uncomfortable at first to settle on something like cron and scripts, because, I guess, coming from the industry, it is standard to separate the software such as this wiki from a database host for the purposes of scaling them out. But for this application, that is perhaps overkills. On the other hand, I do wish for adoption of federated wiki in corporate settings, where such deployment practice might be a policy. I'll keep this issue open, because technically farm mode is not or is poorly supported with a remote external database. But I'm ok if activity dies down, because the responses have been very helpful and I'm now running said solutions. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#94 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABka_JV-f1LOMzly2T8_09YW26syfB--ks5rOvCQgaJpZM4LJAVQ> .

Answer 8 · 2018-04-10T08:56:03.000Z

The linked issues in #24 suggest that at least the CouchDB adaptor is capable of farming. fedwiki/wiki-storage-couchdb#4

Answer 9 · 2018-04-10T13:30:39.000Z

Is anyone using any database adapter successfully? How?