EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #06446
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
Re: [EP-tech] Linkcheck
- To: eprints-tech@ecs.soton.ac.uk
- Subject: Re: [EP-tech] Linkcheck
- From: martin.braendle@id.uzh.ch
- Date: Tue, 18 Apr 2017 15:38:05 +0200
OK, it is on GitHub now: https://github.com/eprintsug/LinkCheck
Regards,
Martin
Centro de Documentación ---11/04/2017 18:52:40---Hi Martin, I like it :) It's a very useful tool. No one likes dead links.
Von: Centro de Documentación <cendocu@gmail.com>
An: eprints-tech@ecs.soton.ac.uk
Datum: 11/04/2017 18:52
Betreff: Re: [EP-tech] Linkcheck
Gesendet von: eprints-tech-bounces@ecs.soton.ac.uk
Hi Martin,
I like it :) It's a very useful tool. No one likes dead links.
Regards,
Cristian
On Fri, Apr 7, 2017 at 1:03 PM, <martin.braendle@id.uzh.ch> wrote:
> Hi,
>
> I just wrote a linkcheck crawler that checks the remote URLs stored in an
> EPrints repo and updates the issues list for URLs that have an invalid
> format or report HTTP status codes other than 200.
> Please let me know if there is an interest to have it available, then I will
> put it on GitHub. There's some more work to do, e.g. move some of the
> methods to a plugin so that they can be called from elsewhere.
>
> Please also be aware that by applying a linkcheck crawler your editorial
> team may come under strain to fix all the dead links. Our initial run
> revealed that after 10 years of running our repository, about 25% of the
> URLs (about 7500 in our case) are now working anymore.
>
> The script also produces a report by HTTP status code and that is sorted
> either by eprint id or by URL. The latter allows to identify patterns so
> that URLs can be replaced or removed in batch.
>
> Best regards,
>
> Martin
>
> --
> Dr. Martin Brändle
> Zentrale Informatik
> Universität Zürich
> Stampfenbachstr. 73
> CH-8006 Zürich
>
- References:
- [EP-tech] Linkcheck
- From: martin.braendle@id.uzh.ch
- Re: [EP-tech] Linkcheck
- From: Centro de Documentación <cendocu@gmail.com>
- [EP-tech] Linkcheck
- Prev by Date: [EP-tech] How to migration from 3.3.12 to 3.3.15?
- Next by Date: Re: [EP-tech] Virtual Field derived by SQL query or code and available for searching.
- Previous by thread: Re: [EP-tech] Linkcheck
- Next by thread: Re: [EP-tech] Antwort: Re: BatchEdit permission name
- Index(es):