[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: should we change dead links in source tree to archive.org?
From: |
Kai Torben Ohlhus |
Subject: |
Re: should we change dead links in source tree to archive.org? |
Date: |
Fri, 26 Jul 2019 15:20:55 +0900 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.2 |
On 7/25/19 5:26 PM, Andreas Weber wrote:
> Dear all,
>
> there are approx. 500 links in the source tree (gnulib excluded) and
> many of them are dead now, for example
>
> inpolygon.m: http://local.wasp.uwa.edu.au/~pbourke/geometry/insidepoly/
>
> but archive.org has a copy. The newest I've found is
> https://web.archive.org/web/20070923104638/http://local.wasp.uwa.edu.au/~pbourke/geometry/insidepoly/
>
> So my question is: Should we replace dead links with archive.org links?
>
> -- Andy
>
Good catch. For me this is indeed an unexpected large number of links
in the source tree.
Agreed, dead links are of little use and if the link content can be
found on archive.org, why not pointing to this resource instead of
pointing nowhere.
Do you have some fancy script or bash-one-liner to do this work? I
think your detection method is more sophisticated than mine ^^
HTTPS and HTTP
$ grep --exclude-dir=gnulib --exclude-dir=libgnu --exclude-dir=.hg
--exclude-dir=autom4te.cache -R -E "https{0,1}:\/\/" | grep -v
https://www.gnu.org/licenses | grep -v
https://www.gnu.org/software/octave | grep -v https://www.octave.org | wc -l
1767
HTTP
$ grep --exclude-dir=gnulib --exclude-dir=libgnu --exclude-dir=.hg
--exclude-dir=autom4te.cache -R -E "http:\/\/" | grep -v
https://www.gnu.org/licenses | grep -v
https://www.gnu.org/software/octave | grep -v https://www.octave.org | wc -l
1293
We can also hunt for some old HTTP links. This seems like a long term
task. A new contribution guideline might be to only include "permanent"
links to resources within documentation and source code.
Best,
Kai