A collection of existing semantic resources which can be linked to build a Global Semantic Knowledge Commons (GSKC). At present (2022-09) "We" is a group developing GSKC for Climate Change, and reaching out to other potential nodes. But the approach is generic and we can see an emerging GSKC community.
Currently much information is "published" as monolithic documents. Scholarly publishing is irretrievably broken (cannot be mended), and the PublisherAcademicComplex has crippled the development of the Semantic Web in creating/distributing knowledge. GSKC (aka SemanticObelisk) is a bottom-up low-profile approach to redress this, in the same way as the BlueObelisk.org in chemistry played an important part in promoting Open Data, Open Source software, and Open Standards. If we promote and coordinate our existing tools and resources we can build an impressive and useful Knowledge Commons . TODAY.
The Blue Obelisk is self-organising and makes nodes and connections organically: . SGKC grows in the same manner.
[https://en.wikipedia.org/wiki/Knowledge_commons] describes a system where everyone has immediate and equitable access to the creation, dissemination, use and re-use of knowledge. "Semantic" means that machines can understand it and help humans to use it. Immediate benefits are increased discoverability, human- and machine-undersstanding of content, filtering, transformation, re-use, etc. As the Open Definition says "free to use, re-use and redistribute (without permission)".
How often have you had to cycle through "click - download - edit - store - search" repeated hundreds of times? A semantic system can automate this - our py/getpapers
can do hundreds of scientific articles a minute. Then search those containing plants and the chemicals they emit. Identify diagrams related to monsoons and extract the places mentioned? Don't understand acronyms like AFOLU? The machine looks these up automatically. Automatically extract clinical trials with over 1000 patients lasting forover a year? etc. etc. Semantics applies to every field - law, psychology, mathematics... all are possible in the Semantic Obelisk.
Or English is not your native language. How is "stubble" related to air pollution? A semantic framework can accommodate languages 1.
The basic tasks for semantic documents include:
- create
- publish
- search
- download
- analyse
- filter
- section and reassemble
- translate
- annotate
- aggregate
- republish
Our emerging community has resources in many of these
Documents in HTML/XML are partially semantic; unfortunately the ubiquitous PDF destroys ALL semantics. We have to build up heuristically, but we've done much of the hard work. So it's possible to download hundreds of PDFs and add semantics to many types of documents.
There are a lot of potential resources - we are starting with those relevant to or project and will contact them to see how to make a semantic connection. No particular order... We'll have a discussion page for each
A huge global collection of identifiers in every discipline. A keystone of our project.
Millions of open access papers, and now preprints
Millions of preprints, mainly in biomedical
https://www.redalyc.org/ . Mexico/Latin America. 770K+ open articles. A focus for the Global South
Indonesia preprints
India preprints
Unsupervised classification of the Open Literature
Sam Moore Egon Willighagen Chris Hartgerink Dasapta Erwin (ID) Arianna Bec (MX) Ross Mounce Daniel Mietchen Gita Yadav (IN) The Shuttleworth Fellowship community Blue Obelisk