EPrints Technical Mailing List Archive
Message: #01692
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
[EP-tech] Re: Eprints-tech Digest, Vol 54, Issue 17
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
- Subject: [EP-tech] Re: Eprints-tech Digest, Vol 54, Issue 17
- From: Alok Khode <alokkhode@yahoo.com>
- Date: Mon, 11 Mar 2013 18:44:26 +0800 (SGT)
Hi,
I have 12 IRs running as separate instances each for different labs of "CSIR India" on a central server csircentral.net with subdomains like: ncl.csircentral.net, cecri.csircentral.net etc.
I am the overall administrator but unable to answer queries on "Embargo expiry date". User complaining that documents are getting opened by registered users even before the "Embargo expiry date".
In my view, registered users can access these document anytime even before expiry of Embargo date. But users are not convinced, please clarify this so that I can satisfy them with the authentic response from Eprints
From: "eprints-tech-request@ecs.soton.ac.uk" <eprints-tech-request@ecs.soton.ac.uk>
To: eprints-tech@ecs.soton.ac.uk
Sent: Monday, 11 March 2013 5:21 AM
Subject: Eprints-tech Digest, Vol 54, Issue 17
Send Eprints-tech mailing list submissions to
eprints-tech@ecs.soton.ac.uk
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
or, via email, send a message with subject or body 'help' to
eprints-tech-request@ecs.soton.ac.uk
You can reach the person managing the list at
eprints-tech-owner@ecs.soton.ac.uk
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Eprints-tech digest..."
Today's Topics:
1. Tabs in Review section; Admin search (Marc Marc)
2. Re: Fwd: Are Closed Access Deposits Indexed by Google
Scholar? (Mark Gregson)
----------------------------------------------------------------------
Message: 1
Date: Sun, 10 Mar 2013 18:42:34 +0100 (CET)
From: "Marc Marc" <marc_3278@gmx.de>
Subject: [EP-tech] Tabs in Review section; Admin search
To: eprints-tech@ecs.soton.ac.uk
Message-ID:
<trinity-69e43c3b-1fc0-4b94-bdc0-54db99c511f2-1362937354358@3capp-gmx-bs01>
Content-Type: text/plain; charset="us-ascii"
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20130310/ff28698a/attachment-0001.html
------------------------------
Message: 2
Date: Mon, 11 Mar 2013 09:50:09 +1000
From: Mark Gregson <mark.gregson@qut.edu.au>
Subject: [EP-tech] Re: Fwd: Are Closed Access Deposits Indexed by
Google Scholar?
To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
Cc: Paul THIRION <Paul.Thirion@ulg.ac.be>, "Nguyen, Minh-Quang"
<nguyen.minh-quang@uqam.ca>
Message-ID:
<A394AD7795D78648887B45809E4586570100CA62@QUTEXMBX03.qut.edu.au" href="mailto:A394AD7795D78648887B45809E4586570100CA62@QUTEXMBX03.qut.edu.au">A394AD7795D78648887B45809E4586570100CA62@QUTEXMBX03.qut.edu.au>
Content-Type: text/plain; charset="iso-8859-1"
Hi Pierre
Have a look in eprints/<archivename>/cfg/cfg.d/eprint_render.pl in the anonymous function starting:
$c->{eprint_render} = sub {
If the EPrints native metadata is being rendered you should be able to find the following lines:
my $links = $session->make_doc_fragment();
$links->appendChild( $session->plugin('Export::Simple')->dataobj_to_html_header($eprint) );
If it's not there you should be able to add it, reload the configuration/restart httpd and it will just work. There are caveats, e.g., if you have changed the name of fields in the schema or the plugin has been disabled.
Regards
Mark
Mark Gregson | Applications and Development Team Leader
Library eServices | Queensland University of Technology
Level 3 | R Block | Kelvin Grove Campus | GPO Box 2434 | Brisbane 4001
Phone: +61 7 3138 3782 | Web: http://eprints.qut.edu.au/<http://www.qut.edu.au/>
ABN: 83 791 724 622
CRICOS No: 00213J
From: eprints-tech-bounces@ecs.soton.ac.uk [mailto:eprints-tech-bounces@ecs.soton.ac.uk] On Behalf Of Nault, Pierre
Sent: Saturday, 9 March 2013 1:32 AM
To: eprints-tech@ecs.soton.ac.uk
Cc: Paul THIRION; Nguyen, Minh-Quang
Subject: [EP-tech] Re: Fwd: Are Closed Access Deposits Indexed by Google Scholar?
Hi all,
We have removed the <meta name="robots" content="noindex,nofollow" /> from default.xml. We confirm that we are running on Eprints version 3.1-2008-12-03-r3984. Anurag : you mention that all versions of eprints over 3.0 can generate the machine-readable bibliographic metadata. Obviously this is not the case for us. Following your assertion, something is probably missing in the configuration of our repository. I'm not too familiar with the config of eprints: I would much appreciate any help on activating this feature.
Best regards,
Pierre Nault
De : eprints-tech-bounces@ecs.soton.ac.uk<mailto:eprints-tech-bounces@ecs.soton.ac.uk> [mailto:eprints-tech-bounces@ecs.soton.ac.uk] De la part de Stevan Harnad
Envoy? : 7 mars 2013 13:24
? : eprints-tech@ecs.soton.ac.uk<mailto:eprints-tech@ecs.soton.ac.uk> List
Cc : Paul THIRION; Nguyen, Minh-Quang
Objet : [EP-tech] Fwd: Are Closed Access Deposits Indexed by Google Scholar?
Begin forwarded message:
From: Anurag Acharya <acha@google.com<mailto:acha@google.com>>
Subject: Re: [EP-tech] Are Closed Access Deposits Indexed by Google Scholar?
Date: 5 March, 2013 10:30:35 PM EST
To: Stevan Harnad <harnad@ecs.soton.ac.uk<mailto:harnad@ecs.soton.ac.uk>>
Cc: eprints-tech@ecs.soton.ac.uk<mailto:eprints-tech@ecs.soton.ac.uk>, Couture Marc <marc.couture@teluq.ca<mailto:marc.couture@teluq.ca>>
Hi Marc: I took a quick look at the examples you mentioned. I noticed couple of issues:
First things first, you are explicitly asking for these pages to not be indexed. For the two examples you mentioned:
view-source:http://www.archipel.uqam.ca/4254/
<meta name="robots" content="noindex,nofollow" />
view-source:http://www.archipel.uqam.ca/4252/
<meta name="robots" content="noindex,nofollow" />
A noindex robots metatag on an html page asks web search services to not index the page.
Second, I don't know if this is an old version of eprints or a custom repository but looks like it doesn't include the machine-readable bibliographic metadata that eprints 3.0 and later embed using metatags. Eg:
view-source:http://eprints.soton.ac.uk/349474/
<meta name="eprints.creators_name" content="Ohka, Seii" />
<meta name="eprints.creators_name" content="Sakai, Mai" />
<meta name="eprints.creators_name" content="Bohnert, Stephanie" />
<meta name="eprints.creators_name" content="Igarashi, Hiroko" />
<meta name="eprints.creators_name" content="Deinhardt, Katrin" />
<meta name="eprints.creators_name" content="Schiavo, Giampietro" />
<meta name="eprints.creators_name" content="Nomoto, Akio" />
[...]
If you are using an older version of eprints, I would recommend upgrading to a version later than 3.0. If you are using a different repository software, I would recommend http://roar.eprints.org/help/google_scholar.html and http://scholar.google.com/intl/en/scholar/inclusion.html
cheers,
anurag
On Tue, Mar 5, 2013 at 5:17 AM, Stevan Harnad <harnad@ecs.soton.ac.uk<mailto:harnad@ecs.soton.ac.uk>> wrote:
On 2013-03-05, at 5:12 AM, Tim Brody <tdb2@ecs.soton.ac.uk<mailto:tdb2@ecs.soton.ac.uk>> wrote:
On Mon, 4 Mar 2013 15:23:06 -0500, Stevan Harnad <harnad@ecs.soton.ac.uk<mailto:harnad@ecs.soton.ac.uk>>
wrote:
I have been told that closed access deposits for
http://www.archipel.uqam.ca<http://www.archipel.uqam.ca/> are not being indexed by Google Scholar: Is
there any way around this?
(I mean the metadata, of course, not the full-text, which I know is
unharvestable till access is re-set as OA).
There's no reason that the metadata pages shouldn't be indexed, but I don't
think (?) Google Scholar will list metadata-only records from repositories.
A specific example would be useful.
It's bad news (for the Button) if GS does not index the metadata of Closed Access deposits. (GS certainly indexes plenty of papers that do not have a free full-text version on the web).
Could this (if it's true) be fixed by optimizing the way an EPrints IR presents itself to google and GS (levels of embedding or something like that)? I seem to remember Les saying that the depth of documents was important.
A DSpace IR, Orbi, has 50% Closed Access contents (for example, here<http://orbi.ulg.ac.be/browse?type=datepublished&rpp=20&value=2012>).
These are all picked up by Google, for example this one: "Tubulin isoforms identified in the brain by MALDI in-source decay"
but they appear very late in the Google hit list (especially for much-sited or multi-cited papers)
and the Orbi version does not seem to be picked up by GS at all.
This is extremely important, because it affects the efficacy of the Button, and thereby the power of an immediate-deposit mandate (and the incentive to adopt one).
Is there any way to address this problem directly in EPrints (plus advice for our cousins in DSpace)?
Many thanks,
Stevan
From: Couture Marc <marc.couture@teluq.ca<mailto:marc.couture@teluq.ca>>
Subject: RE: [EP-tech] Are Closed Access Deposits Indexed by Google Scholar?
Date: 4 March, 2013 6:17:13 PM EST
To: Stevan Harnad <harnad@ecs.soton.ac.uk<mailto:harnad@ecs.soton.ac.uk>>, Leslie Carr <lac@ecs.soton.ac.uk<mailto:lac@ecs.soton.ac.uk>>
Hi,
My belief that Google / Scholar doesn't index closed access documents (more precisely, the HTML page with the metadata) is based upon a simple check with two closed access documents in Archipel :
1. http://www.archipel.uqam.ca/4252
This is the manuscript of a published article (Title : So into it they forget what time it is?)
If I put the title (between quotes) in Google or Google Scholar, all I see is the published (toll access) version :
http://www.igi-global.com/chapter/into-they-forget-time/67430
2. http://www.archipel.uqam.ca/4254
The title is: Discretionary power of project managers in knowledge intensive firms and gender issues
Again, Google Scholar finds only the published version (Google doesn't even find it):
http://onlinelibrary.wiley.com/doi/10.1002/cjas.147/abstract
On the same results page, one sees another paper, available in open acces in Archipel, citing this one.
Both manuscripts have been in Archipel for more than one year (deposit date: Nov 2011).
Marc Couture
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20130311/00f733df/attachment.html
------------------------------
_______________________________________________
Eprints-tech mailing list
Eprints-tech@ecs.soton.ac.uk
http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
End of Eprints-tech Digest, Vol 54, Issue 17
********************************************
I have 12 IRs running as separate instances each for different labs of "CSIR India" on a central server csircentral.net with subdomains like: ncl.csircentral.net, cecri.csircentral.net etc.
I am the overall administrator but unable to answer queries on "Embargo expiry date". User complaining that documents are getting opened by registered users even before the "Embargo expiry date".
In my view, registered users can access these document anytime even before expiry of Embargo date. But users are not convinced, please clarify this so that I can satisfy them with the authentic response from Eprints
Regards,
Alok Khode
Scientist (IT)
CSIR-Unit for R & D of Information Productss
85/1,'Jopasana', Paud Road, Kothrud
Pune (India)-411038
--------------------------------------
Tele: +91-20-25383557 (Ext. 321)
Mob: +91-9960268260
--------------------------------------
Alok Khode
Scientist (IT)
CSIR-Unit for R & D of Information Productss
85/1,'Jopasana', Paud Road, Kothrud
Pune (India)-411038
--------------------------------------
Tele: +91-20-25383557 (Ext. 321)
Mob: +91-9960268260
--------------------------------------
From: "eprints-tech-request@ecs.soton.ac.uk" <eprints-tech-request@ecs.soton.ac.uk>
To: eprints-tech@ecs.soton.ac.uk
Sent: Monday, 11 March 2013 5:21 AM
Subject: Eprints-tech Digest, Vol 54, Issue 17
Send Eprints-tech mailing list submissions to
eprints-tech@ecs.soton.ac.uk
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
or, via email, send a message with subject or body 'help' to
eprints-tech-request@ecs.soton.ac.uk
You can reach the person managing the list at
eprints-tech-owner@ecs.soton.ac.uk
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Eprints-tech digest..."
Today's Topics:
1. Tabs in Review section; Admin search (Marc Marc)
2. Re: Fwd: Are Closed Access Deposits Indexed by Google
Scholar? (Mark Gregson)
----------------------------------------------------------------------
Message: 1
Date: Sun, 10 Mar 2013 18:42:34 +0100 (CET)
From: "Marc Marc" <marc_3278@gmx.de>
Subject: [EP-tech] Tabs in Review section; Admin search
To: eprints-tech@ecs.soton.ac.uk
Message-ID:
<trinity-69e43c3b-1fc0-4b94-bdc0-54db99c511f2-1362937354358@3capp-gmx-bs01>
Content-Type: text/plain; charset="us-ascii"
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20130310/ff28698a/attachment-0001.html
------------------------------
Message: 2
Date: Mon, 11 Mar 2013 09:50:09 +1000
From: Mark Gregson <mark.gregson@qut.edu.au>
Subject: [EP-tech] Re: Fwd: Are Closed Access Deposits Indexed by
Google Scholar?
To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
Cc: Paul THIRION <Paul.Thirion@ulg.ac.be>, "Nguyen, Minh-Quang"
<nguyen.minh-quang@uqam.ca>
Message-ID:
<A394AD7795D78648887B45809E4586570100CA62@QUTEXMBX03.qut.edu.au" href="mailto:A394AD7795D78648887B45809E4586570100CA62@QUTEXMBX03.qut.edu.au">A394AD7795D78648887B45809E4586570100CA62@QUTEXMBX03.qut.edu.au>
Content-Type: text/plain; charset="iso-8859-1"
Hi Pierre
Have a look in eprints/<archivename>/cfg/cfg.d/eprint_render.pl in the anonymous function starting:
$c->{eprint_render} = sub {
If the EPrints native metadata is being rendered you should be able to find the following lines:
my $links = $session->make_doc_fragment();
$links->appendChild( $session->plugin('Export::Simple')->dataobj_to_html_header($eprint) );
If it's not there you should be able to add it, reload the configuration/restart httpd and it will just work. There are caveats, e.g., if you have changed the name of fields in the schema or the plugin has been disabled.
Regards
Mark
Mark Gregson | Applications and Development Team Leader
Library eServices | Queensland University of Technology
Level 3 | R Block | Kelvin Grove Campus | GPO Box 2434 | Brisbane 4001
Phone: +61 7 3138 3782 | Web: http://eprints.qut.edu.au/<http://www.qut.edu.au/>
ABN: 83 791 724 622
CRICOS No: 00213J
From: eprints-tech-bounces@ecs.soton.ac.uk [mailto:eprints-tech-bounces@ecs.soton.ac.uk] On Behalf Of Nault, Pierre
Sent: Saturday, 9 March 2013 1:32 AM
To: eprints-tech@ecs.soton.ac.uk
Cc: Paul THIRION; Nguyen, Minh-Quang
Subject: [EP-tech] Re: Fwd: Are Closed Access Deposits Indexed by Google Scholar?
Hi all,
We have removed the <meta name="robots" content="noindex,nofollow" /> from default.xml. We confirm that we are running on Eprints version 3.1-2008-12-03-r3984. Anurag : you mention that all versions of eprints over 3.0 can generate the machine-readable bibliographic metadata. Obviously this is not the case for us. Following your assertion, something is probably missing in the configuration of our repository. I'm not too familiar with the config of eprints: I would much appreciate any help on activating this feature.
Best regards,
Pierre Nault
De : eprints-tech-bounces@ecs.soton.ac.uk<mailto:eprints-tech-bounces@ecs.soton.ac.uk> [mailto:eprints-tech-bounces@ecs.soton.ac.uk] De la part de Stevan Harnad
Envoy? : 7 mars 2013 13:24
? : eprints-tech@ecs.soton.ac.uk<mailto:eprints-tech@ecs.soton.ac.uk> List
Cc : Paul THIRION; Nguyen, Minh-Quang
Objet : [EP-tech] Fwd: Are Closed Access Deposits Indexed by Google Scholar?
Begin forwarded message:
From: Anurag Acharya <acha@google.com<mailto:acha@google.com>>
Subject: Re: [EP-tech] Are Closed Access Deposits Indexed by Google Scholar?
Date: 5 March, 2013 10:30:35 PM EST
To: Stevan Harnad <harnad@ecs.soton.ac.uk<mailto:harnad@ecs.soton.ac.uk>>
Cc: eprints-tech@ecs.soton.ac.uk<mailto:eprints-tech@ecs.soton.ac.uk>, Couture Marc <marc.couture@teluq.ca<mailto:marc.couture@teluq.ca>>
Hi Marc: I took a quick look at the examples you mentioned. I noticed couple of issues:
First things first, you are explicitly asking for these pages to not be indexed. For the two examples you mentioned:
view-source:http://www.archipel.uqam.ca/4254/
<meta name="robots" content="noindex,nofollow" />
view-source:http://www.archipel.uqam.ca/4252/
<meta name="robots" content="noindex,nofollow" />
A noindex robots metatag on an html page asks web search services to not index the page.
Second, I don't know if this is an old version of eprints or a custom repository but looks like it doesn't include the machine-readable bibliographic metadata that eprints 3.0 and later embed using metatags. Eg:
view-source:http://eprints.soton.ac.uk/349474/
<meta name="eprints.creators_name" content="Ohka, Seii" />
<meta name="eprints.creators_name" content="Sakai, Mai" />
<meta name="eprints.creators_name" content="Bohnert, Stephanie" />
<meta name="eprints.creators_name" content="Igarashi, Hiroko" />
<meta name="eprints.creators_name" content="Deinhardt, Katrin" />
<meta name="eprints.creators_name" content="Schiavo, Giampietro" />
<meta name="eprints.creators_name" content="Nomoto, Akio" />
[...]
If you are using an older version of eprints, I would recommend upgrading to a version later than 3.0. If you are using a different repository software, I would recommend http://roar.eprints.org/help/google_scholar.html and http://scholar.google.com/intl/en/scholar/inclusion.html
cheers,
anurag
On Tue, Mar 5, 2013 at 5:17 AM, Stevan Harnad <harnad@ecs.soton.ac.uk<mailto:harnad@ecs.soton.ac.uk>> wrote:
On 2013-03-05, at 5:12 AM, Tim Brody <tdb2@ecs.soton.ac.uk<mailto:tdb2@ecs.soton.ac.uk>> wrote:
On Mon, 4 Mar 2013 15:23:06 -0500, Stevan Harnad <harnad@ecs.soton.ac.uk<mailto:harnad@ecs.soton.ac.uk>>
wrote:
I have been told that closed access deposits for
http://www.archipel.uqam.ca<http://www.archipel.uqam.ca/> are not being indexed by Google Scholar: Is
there any way around this?
(I mean the metadata, of course, not the full-text, which I know is
unharvestable till access is re-set as OA).
There's no reason that the metadata pages shouldn't be indexed, but I don't
think (?) Google Scholar will list metadata-only records from repositories.
A specific example would be useful.
It's bad news (for the Button) if GS does not index the metadata of Closed Access deposits. (GS certainly indexes plenty of papers that do not have a free full-text version on the web).
Could this (if it's true) be fixed by optimizing the way an EPrints IR presents itself to google and GS (levels of embedding or something like that)? I seem to remember Les saying that the depth of documents was important.
A DSpace IR, Orbi, has 50% Closed Access contents (for example, here<http://orbi.ulg.ac.be/browse?type=datepublished&rpp=20&value=2012>).
These are all picked up by Google, for example this one: "Tubulin isoforms identified in the brain by MALDI in-source decay"
but they appear very late in the Google hit list (especially for much-sited or multi-cited papers)
and the Orbi version does not seem to be picked up by GS at all.
This is extremely important, because it affects the efficacy of the Button, and thereby the power of an immediate-deposit mandate (and the incentive to adopt one).
Is there any way to address this problem directly in EPrints (plus advice for our cousins in DSpace)?
Many thanks,
Stevan
From: Couture Marc <marc.couture@teluq.ca<mailto:marc.couture@teluq.ca>>
Subject: RE: [EP-tech] Are Closed Access Deposits Indexed by Google Scholar?
Date: 4 March, 2013 6:17:13 PM EST
To: Stevan Harnad <harnad@ecs.soton.ac.uk<mailto:harnad@ecs.soton.ac.uk>>, Leslie Carr <lac@ecs.soton.ac.uk<mailto:lac@ecs.soton.ac.uk>>
Hi,
My belief that Google / Scholar doesn't index closed access documents (more precisely, the HTML page with the metadata) is based upon a simple check with two closed access documents in Archipel :
1. http://www.archipel.uqam.ca/4252
This is the manuscript of a published article (Title : So into it they forget what time it is?)
If I put the title (between quotes) in Google or Google Scholar, all I see is the published (toll access) version :
http://www.igi-global.com/chapter/into-they-forget-time/67430
2. http://www.archipel.uqam.ca/4254
The title is: Discretionary power of project managers in knowledge intensive firms and gender issues
Again, Google Scholar finds only the published version (Google doesn't even find it):
http://onlinelibrary.wiley.com/doi/10.1002/cjas.147/abstract
On the same results page, one sees another paper, available in open acces in Archipel, citing this one.
Both manuscripts have been in Archipel for more than one year (deposit date: Nov 2011).
Marc Couture
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20130311/00f733df/attachment.html
------------------------------
_______________________________________________
Eprints-tech mailing list
Eprints-tech@ecs.soton.ac.uk
http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
End of Eprints-tech Digest, Vol 54, Issue 17
********************************************
- Prev by Date: [EP-tech] Re: ePrints SWORD API
- Next by Date: [EP-tech] Missing a recently published item in latest_tool
- Previous by thread: [EP-tech] Solr package!!
- Next by thread: [EP-tech] Missing a recently published item in latest_tool
- Index(es):