common-workflow-language/cwl-website

linkchecker needs to take into account actual root location; add ignore list; and break the build if a link is broken

mr-c opened this issue · 4 comments

mr-c commented

@lunacodes can you open a PR to fix these soonish?
https://github.com/common-workflow-language/common-workflow-language.github.io/blob/21704381d24d5774a12deab9503fc4541c87ea66/linkchecker-report.txt

@tetron can you configure the website build to fail if there are broken links?

@mr-c I think there's something wrong with the link checker tool. If those things were broken, the entire page would appear really messed up. It appears what's happening is that the tool is not picking up the fact that the file path for all these things is under /content/. So it's looking for e.g. /assets/js/bootstrap.min.js without understanding that the file path (not url) is actually /content/assets/js/bootstrap.min.js. You can verify this sort of thing by opening up your browser's Developer Tools (right click and hit inspect, or else F12 often works) and checking the Console tab. You'll see 404 errors there, if there's an issue.

I ran the site through brokenlinkcheck.com and ahrefs.com/broken-link-checker, which did find one broken link for the homepage (which I'll open a PR for in a few), and a handful for the various doc pages. I'm less familiar with those sections of the site, though I can try to take a look at them, if you want.

Here's the output from the ahrefs.com checker. The left column shows the page containing the broken link; the right column is the actual broken link:

Referring page
Anchor and backlink

Common Workflow Language (CWL) Command Line Tool Description, v1.1

www.commonwl.org/v1.1/CommandLineTool.html

field we recommend their RDF encoding: http://schema.org/version/latest/schema.rdf

schema.org/version/latest/schema.rdf
404 NOT FOUND

Common Workflow Language (CWL) Command Line Tool Description, v1.2

www.commonwl.org/v1.2/CommandLineTool.html

field we recommend their RDF encoding: http://schema.org/version/latest/schema.rdf

schema.org/version/latest/schema.rdf
404 NOT FOUND

Common Workflow Language (CWL) Workflow Description, v1.2

www.commonwl.org/v1.2/Workflow.html

field we recommend their RDF encoding: http://schema.org/version/latest/schema.rdf

schema.org/version/latest/schema.rdf
404 NOT FOUND

Common Workflow Language (CWL) Workflow Description, v1.1

www.commonwl.org/v1.1/Workflow.html

field we recommend their RDF encoding: http://schema.org/version/latest/schema.rdf

schema.org/version/latest/schema.rdf
404 NOT FOUND

trick_revsort.cwl - Common Workflow Language Viewer

view.commonwl.org/workflows/github.com/common-workflow-language/cwltool/blob/4700fbee9a5a3271eef8bc9ee595619d0720431b/tests/wf/trick_revsort.cwl

File [format]

www.iana.org/assignments/media-types/text/plain
404 NOT FOUND

prefactor.cwl - Common Workflow Language Viewer

view.commonwl.org/workflows/github.com/EOSC-LOFAR/prefactor-cwl/blob/master/prefactor.cwl

File [format]

docs.scipy.org/doc/numpy-dev/neps/npy-format.html
404 NOT FOUND

trick_revsort.cwl - Common Workflow Language Viewer

view.commonwl.org/workflows/github.com/common-workflow-language/cwltool/blob/9f3b9e7b74d5a904b12674dfd1300b56a48c3d33/tests/wf/trick_revsort.cwl

File [format]

www.iana.org/assignments/media-types/text/plain
404 NOT FOUND

trick_revsort.cwl - Common Workflow Language Viewer

view.commonwl.org/workflows/github.com/common-workflow-language/cwltool/blob/eba80916b5cde8bdbd56c077c94240ddf796a27b/tests/wf/trick_revsort.cwl

File [format]

www.iana.org/assignments/media-types/text/plain
404 NOT FOUND

revsort.cwl - Common Workflow Language Viewer

view.commonwl.org/workflows/github.com/common-workflow-language/cwltool/blob/2710cfe731374cf7244116dd7186fc2b6e4af344/tests/wf/revsort.cwl

File [format]

www.iana.org/assignments/media-types/text/plain
404 NOT FOUND

revsort.cwl - Common Workflow Language Viewer

view.commonwl.org/workflows/github.com/common-workflow-language/cwltool/blob/445f6b3a214f0c48317b25f7af278ae919ce5403/tests/wf/revsort.cwl

File [format]

www.iana.org/assignments/media-types/text/plain
404 NOT FOUND

Broken Link in Code Libraries section of homepage:

Ln. 64 of _includes/home/software-for-working-with-cwl.html currently reads:
|[cwltool](https://github.com/common-workflow-language/cwltool)|cwltool (can be [imported as a Python module](https://www.commonwl.org/(https://github.com/common-workflow-language/cwltool#import-as-a-module)) and [extended to create custom cwl runners](https://github.com/common-workflow-language/cwltool#extension-points)|

Should read:
|[cwltool](https://github.com/common-workflow-language/cwltool)|cwltool (can be [imported as a Python module](https://github.com/common-workflow-language/cwltool#import-as-a-module) and [extended to create custom cwl runners](https://github.com/common-workflow-language/cwltool#extension-points)|

mr-c commented

Thanks @lunacodes for the analaysis! @tetron can you configure the linkchecker to take this other root path into consideration?

mr-c commented

I fixed the link checker, and some broken links, in #86