How to get all data from dbpedia/page (Using Python and SPRQL) #Question
Closed this issue · 1 comments
Hello everyone,
I'm trying to get all the data from https://dbpedia.org/page/Asturias (Abstract, Place, Areacode...etc) using python but without success.
I tried with SPARQL (and Python) making some requests and the result was the same. Does someone know if there is a way to get the all data in a request?
Thanks very much in advance,
David
Sorry for asking in a closed issue but I seem to have the same problem / misunderstanding. I cloned the repo and started the docker container with latest-core
from the databus - which downloaded 5GB of data in the /data directory (the repo has a total of 9GB).
However, the pages (e.g., Berlin) do not show dbo:abstract and other relations - do I need to use another databus repository, is the downloader still loading (however, after 5,2GB nothing changed) or is the full dbpedia not available and I need to use the public endpoints?
Btw. After ~4 hours the two container dbpedia/virtuoso-sparql-endpoint-quickstart:latest
and openlink/virtuoso-opensource-7
are still running and the "log" in the terminal shows lines like this:
store_1 | 12:29:18 * Monitor: CPU% is low while there are large numbers of runnable threads
store_1 | 12:30:20 * Monitor: High disk read (1)
store_1 | 12:30:45 *** read-ahead of a free or out of range page dp L=260741, database not necessarily corrupted.
store_1 | 12:31:18 * Monitor: CPU% is low while there are large numbers of runnable threads
store_1 | 12:31:56 Write wait on column page 568030. Waits should be on the index leaf page, except when col page is held for read by background write
And furthermore: The files in the data directory "long-abstracts_lang=de_uris=en_8EC1.ttl.bz2" only contain the following text:
<p>Temporary Redirect. Redirecting to <a href="http://akswnc7.informatik.uni-leipzig.de/dav/vehnem/replaced-iris/long-abstracts/2020.02.01/long-abstracts_lang=de_uris=en.ttl.bz2">http://akswnc7.informatik.uni-leipzig.de/dav/vehnem/replaced-iris/long-abstracts/2020.02.01/long-abstracts_lang=de_uris=en.ttl.bz2</a></p>
Is this intended?