EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #09624


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

RE: [EP-tech] coversheets


CAUTION: This e-mail originated outside the University of Southampton.

Hi Tomasz and Will -- Coversheets intuitively appear to make a lot of sense because, as Will says, they enable a level of branding and can ensure license details are clear. However, they are *explicitly bad* for indexing by Google Scholar [1], [2]. They disrupt automatic metadata extraction processes used by GS, a feature that GS has become very good at doing, e.g. to supplement associated metadata.

 

For a time, we were using automatic coversheets at my previous institution (Strathclyde) but I disabled them circa 2016 as it became clearer that they could be problematic (not just for GS, incidentally). We are using coversheets at my new institution (Glasgow); however, this is something we will seek to review soon.

 

Alternative options:

  • Instead of pre-pending the coversheet, you could consider post-pending, thereby including relevant branding, copyright info, bibliographic citation, etc. without screwing up GS indexing. The original coversheet plugin (assuming it works with 3.4+) enabled such a configuration.
  • Seek to develop a ‘watermark’, arXiv-esque plugin! 😉 I attempted this while at Strathclyde but, a) I’m a coding charlatan and so it wasn’t very good, and b) even when it was successful, it was spectacularly difficult to anticipate the formatting of accepted author manuscripts such that any watermark could be sized and located appropriately on the page. Watermarks would end up obscuring text, etc. It was a disaster.

 

The team at Strathclyde began added footers to manuscripts containing any relevant info. They were already performing a lot of file handling (e.g. improving formatting, accessibility, etc.) that inserting details in a footer wasn’t especially onerous.

 

One final comment. I undertook some repository discoverability research a few years ago [3]. Coversheets were identified as a potential issue, but I concluded that it was impossible to measure the negative influence they had on repository indexing because repository visibility tends to be about the ‘accumulation of marginal gains’. But given that GS explicitly advise against them, I suppose we must conclude that they are indeed a bad idea.

 

Hope this helps!

 

George

[1] Acharya, Anurag. (2015) Indexing repositories: pitfalls and best practices, https://purl.dlib.indiana.edu/iudl/media/6537033b6s

[2] Westin, Monica. (2021) Google Scholar Indexing for Repositories: Best Practices and Fixes for Common Indexing Problems, https://youtu.be/C-miRaROsaE?si=3sGDP-bD9MwDE7BX

[3] Macgregor, George. (2020). "Enhancing Content Discovery of Open Repositories: An Analytics-Based Evaluation of Repository Optimizations" Publications 8, no. 1: 8. https://doi.org/10.3390/publications8010008

 

--

Dr George Macgregor | Assistant Director – Digital Library

Information Services | University of Glasgow

Web: https://purl.org/g3om4c  | Fediverse: @g3om4c@code4lib.social
orcid.org/0000-0002-8482-3973

Mobile: +44 (0)7977 858281
--

The University of Glasgow is a registered Scottish charity: Registration Number SC004401

 

Athena SWAN Gold logo

 

 

From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> On Behalf Of Tomasz Neugebauer
Sent: Thursday, February 15, 2024 8:09 PM
To: eprints-tech@ecs.soton.ac.uk
Subject: Re: [EP-tech] coversheets

 

CAUTION: This e-mail originated outside the University of Southampton.

CAUTION: This e-mail originated outside the University of Southampton.

 

Thank you, Will, that's helpful.  This plugin has a condition in the code that sends Google indexer to the "original" PDF that doesn't have a cover page, right?  

Was there a reason for that change?  Did it cause problems with Google indexer to have the PDF with the cover page?

 

Another follow-up question, if we're adding a cover page, wouldn't it also make sense to then add metadata to the PDF file (in the description of it, so that would be the author names, title, etc.)?

 

Tomasz

 

 

 

 

________________________________________________

Tomasz Neugebauer
Senior Librarian | Bibliothécaire titulaire
Digital Projects & Systems Development Librarian / Bibliothécaire des Projets Numériques & Développement de Systèmes
Concordia University / Université Concordia

Tel. / Tél. 514-848-2424 ext. / poste 7738
Email / courriel:
tomasz.neugebauer@concordia.ca

Mailing address / adresse postale: 1455 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8
Street address / adresse municipale: 1400 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8

library.concordia.ca


From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> on behalf of Will Fyson <will.fyson@cosector.com>
Sent: Thursday, February 15, 2024 1:30 PM
To: eprints-tech@ecs.soton.ac.uk <eprints-tech@ecs.soton.ac.uk>
Subject: Re: [EP-tech] coversheets

 

Attention This email originates from outside the concordia.ca domain. // Ce courriel provient de l'extérieur du domaine de concordia.ca

 

 

CAUTION: This e-mail originated outside the University of Southampton.

CAUTION: This e-mail originated outside the University of Southampton.

Hi Tomasz,

 

There is an alternative coversheets plugin at https://github.com/eprintsug/coversheets-latex/tree/data_checking which we use at CoSector. This one uses LaTeX for its coverpages (which may be a downside!) and regretfully the documentation isn't super up to date. But if you are familiar with LaTeX it's not too tricky to get up and running hopefully.

 

As to why people are keen to have coverpages, I'm not so sure! I've always been under the impression it's a way for institution's to brand some content as coming from their repository and being affiliated with the institution, but I imagine there are also some Google Scholar indexing benefits too. 

 

Many thanks,

 

Will

 

 

Will Fyson

Development & Support Analyst, Research Technologies

CoSector, University of London

Senate House

Malet Street

London

WC1E 7HU

 

t: +44 (0)20 7863 1341

e: will.fyson@cosector.com

w: https://cosector.com/digital-research/

 

The University of London is an exempt charity in England and Wales.

 


From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> on behalf of Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca>
Sent: 15 February 2024 18:04
To: eprints-tech@ecs.soton.ac.uk <eprints-tech@ecs.soton.ac.uk>
Subject: [EP-tech] coversheets

 

CAUTION: This e-mail originated outside the University of Southampton.

CAUTION: This e-mail originated outside the University of Southampton.

A question for the community: would you recommend the coversheets plugin?

The code hasn't been updated since 2014, so is it working well?

What are the main advantages/disadvantages of it?

More specifically, does it make it easier for Google Scholar to index the content?

Is it a practical way to satisfy some publisher requirements that demand a coverpage statement like this?

 

Thank you for any comments on this!

 

Tomasz

 

 

 

________________________________________________

Tomasz Neugebauer
Senior Librarian | Bibliothécaire titulaire
Digital Projects & Systems Development Librarian / Bibliothécaire des Projets Numériques & Développement de Systèmes
Concordia University / Université Concordia

Tel. / Tél. 514-848-2424 ext. / poste 7738
Email / courriel:
tomasz.neugebauer@concordia.ca

Mailing address / adresse postale: 1455 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8
Street address / adresse municipale: 1400 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8

library.concordia.cahttps://bazaar.eprints.org/350/