EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #09624
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
RE: [EP-tech] coversheets
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
- Subject: RE: [EP-tech] coversheets
- From: George Macgregor <George.Macgregor@glasgow.ac.uk>
- Date: Fri, 16 Feb 2024 09:29:17 +0000
CAUTION: This e-mail originated outside the University of Southampton.
Hi Tomasz and Will -- Coversheets intuitively appear to make a lot of sense because, as Will says, they enable a level of branding and can ensure license details are clear. However, they are *explicitly
bad* for indexing by Google Scholar [1], [2]. They disrupt automatic metadata extraction processes used by GS, a feature that GS has become very good at doing, e.g. to supplement associated metadata. For a time, we were using automatic coversheets at my previous institution (Strathclyde) but I disabled them circa 2016 as it became clearer that they could be problematic (not just for GS, incidentally).
We are using coversheets at my new institution (Glasgow); however, this is something we will seek to review soon. Alternative options:
The team at Strathclyde began added footers to manuscripts containing any relevant info. They were already performing a lot of file handling (e.g. improving formatting, accessibility, etc.) that
inserting details in a footer wasn’t especially onerous. One final comment. I undertook some repository discoverability research a few years ago [3]. Coversheets were identified as a potential issue, but I concluded that it was impossible to measure the
negative influence they had on repository indexing because repository visibility tends to be about the ‘accumulation of marginal gains’. But given that GS explicitly advise against them, I suppose we must conclude that they are indeed a bad idea. Hope this helps! George [1] Acharya, Anurag. (2015) Indexing repositories: pitfalls and best practices,
https://purl.dlib.indiana.edu/iudl/media/6537033b6s [2] Westin, Monica. (2021) Google Scholar Indexing for Repositories: Best Practices and Fixes for Common Indexing Problems,
https://youtu.be/C-miRaROsaE?si=3sGDP-bD9MwDE7BX [3] Macgregor, George. (2020). "Enhancing Content Discovery of Open Repositories: An Analytics-Based Evaluation of Repository Optimizations" Publications 8, no. 1: 8.
https://doi.org/10.3390/publications8010008
-- Dr George Macgregor | Assistant Director – Digital Library Information Services | University of Glasgow Web:
https://purl.org/g3om4c | Fediverse:
@g3om4c@code4lib.social
Mobile: +44 (0)7977 858281 The University of Glasgow is a registered Scottish charity: Registration Number SC004401 From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk>
On Behalf Of Tomasz Neugebauer CAUTION: This e-mail originated outside the
University of Southampton. CAUTION: This e-mail originated outside the
University of Southampton. Thank you, Will, that's helpful. This plugin has a condition in the code that sends Google indexer to the "original" PDF that doesn't have a cover page, right? Was there a reason for that change? Did it cause problems with Google indexer to have the PDF with the cover page? Another follow-up question, if we're adding a cover page, wouldn't it also make sense to then add metadata to the PDF file (in the description of it, so that would be the author names, title, etc.)? Tomasz ________________________________________________
Tomasz Neugebauer Tel. / Tél. 514-848-2424 ext. / poste 7738
Mailing address / adresse postale: 1455 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8
library.concordia.ca From:
eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> on behalf of Will Fyson <will.fyson@cosector.com> Attention This email originates from outside the concordia.ca domain. // Ce courriel provient de l'extérieur du domaine de concordia.ca CAUTION: This e-mail originated outside the
University of Southampton. CAUTION: This e-mail originated outside the
University of Southampton. Hi Tomasz, There is an alternative coversheets plugin at
https://github.com/eprintsug/coversheets-latex/tree/data_checking which we use at CoSector. This one uses LaTeX for its coverpages (which may be a downside!) and regretfully the
documentation isn't super up to date. But if you are familiar with LaTeX it's not too tricky to get up and running hopefully. As to why people are keen to have coverpages, I'm not so sure! I've always been under the impression it's a way for institution's to brand some content as coming from their repository and being
affiliated with the institution, but I imagine there are also some Google Scholar indexing benefits too. Many thanks, Will Will Fyson Development & Support Analyst,
Research Technologies CoSector, University of London Senate House Malet Street London WC1E 7HU t: +44 (0)20 7863 1341 w:
https://cosector.com/digital-research/ The University of London is an exempt charity in England and Wales. From:
eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> on behalf of Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca> CAUTION: This e-mail originated outside the
University of Southampton. CAUTION: This e-mail originated outside the
University of Southampton. A question for the community: would you recommend the coversheets plugin? The code hasn't been updated since 2014, so is it working well? What are the main advantages/disadvantages of it? More specifically, does it make it easier for Google Scholar to index the content? Is it a practical way to satisfy some publisher requirements that demand a coverpage statement like this? Thank you for any comments on this! Tomasz ________________________________________________
Tomasz Neugebauer Tel. / Tél. 514-848-2424 ext. / poste 7738
Mailing address / adresse postale: 1455 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8 library.concordia.cahttps://bazaar.eprints.org/350/ |
- References:
- [EP-tech] coversheets
- From: Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca>
- Re: [EP-tech] coversheets
- From: Will Fyson <will.fyson@cosector.com>
- Re: [EP-tech] coversheets
- From: Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca>
- [EP-tech] coversheets
- Prev by Date: Re: [EP-tech] Eprints with Nginx Proxy Manager as Load Balancer
- Next by Date: Re: [EP-tech] Eprints with Nginx Proxy Manager as Load Balancer
- Previous by thread: Re: [EP-tech] coversheets
- Next by thread: Re: [EP-tech] coversheets
- Index(es):