EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #04069


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] Re: Indexing issue


Okay; so there's a bunch of stuff but it's all from October 2013 and October 2014 - nothing that looks like the record that seems to be giving me the problem. Now, if I delete things off this list I presume it won't delete the actual record?

-----Original Message-----
From: eprints-tech-bounces@ecs.soton.ac.uk [mailto:eprints-tech-bounces@ecs.soton.ac.uk] On Behalf Of Alan.Stiles
Sent: 12 March 2015 16:22
To: eprints-tech@ecs.soton.ac.uk
Subject: [EP-tech] Re: Indexing issue

Admin page, System Tools Tab, Status button, Background Task Queue should list what the indexer is working on.
There should be an event for plugin 'Event::Indexer', action 'Index', with Parameters of '/id/eprint/xxxxxx' where xxxxx is the id of the offending eprint.  You should be able to delete or change the status on that item to a different status (e.g. Failed) to stop it being processed.

-----Original Message-----
From: Andrew Beeken [mailto:anbeeken@lincoln.ac.uk]
Sent: 12 March 2015 14:40
To: eprints-tech@ecs.soton.ac.uk
Subject: [EP-tech] Re: Indexing issue

Nope! :) How would I do that?

On 12/03/2015 14:33, "Alan.Stiles" <alan.stiles@open.ac.uk> wrote:

>Hi Andrew, have you removed the index event from the event queue?
>
>-----Original Message-----
>From: Andrew Beeken [mailto:anbeeken@lincoln.ac.uk]
>Sent: 12 March 2015 14:12
>To: eprints-tech@ecs.soton.ac.uk
>Subject: [EP-tech] Re: Indexing issue
>
>Further to this, we¹ve now removed the record in question, however 
>EPrints still seems to be trying to index it - I don¹t know if this is 
>a cache of some kind.
>
>On 11/03/2015 16:11, "Andrew Beeken" <anbeeken@lincoln.ac.uk> wrote:
>
>>Hello!
>>
>>We¹ve got a bit of an issue with our search indexing which, after some 
>>digging, seems to be related to a specific record. Dumping a log of 
>>the ‹verbose index to a file, the offending entry seems to be throwing 
>>the following error:
>>
>>Error: PDF file is damaged - attempting to reconstruct xref table...
>>
>>Error opening /tmp/ep-convertc4Azo/index.txt: No such file or 
>>directory at 
>>/usr/share/eprints3/perl_lib/EPrints/Plugin/Convert/IndexCodes.pm
>>line 57.
>>
>>The issue seems to be that once the indexer hits this record it bails 
>>out and doesn¹t index records after this, meaning that some users have 
>>items which do not show up when a search is conducted based on their 
>>user number ­ as this is used in conjunction with the JSON output on a 
>>search to populate our local staff directory with academics research 
>>outputs, it¹s causing us some issues internally and also concerning us 
>>that the EPrints search index is incomplete.
>>
>>We¹ve removed the PDF that appears to be causing the problem but it is 
>>potentially cached as it still comes up with the error; we¹ve tried 
>>putting the record as ³retired² and will look tomorrow to see if that 
>>does anything, however if anyone has encountered this before or has a 
>>potentially better suggestion, let me know!
>>
>>Cheers!
>>Andrew
>>
>>The University of Lincoln, located in the heart of the city of 
>>Lincoln, has established an international reputation based on high 
>>student satisfaction, excellent graduate employment and world-class research.
>>
>>The information in this e-mail and any attachments may be confidential.
>>If you have received this email in error please notify the sender 
>>immediately and remove it from your system. Do not disclose the 
>>contents to another person or take copies.
>>
>>Email is not secure and may contain viruses. The University of Lincoln 
>>makes every effort to ensure email is sent without viruses, but cannot 
>>guarantee this and recommends recipients take appropriate precautions.
>>
>>The University may monitor email traffic data and content in 
>>accordance with its policies and English law. Further information can be found at:
>>http://www.lincoln.ac.uk/legal.
>>
>>*** Options:
>>http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>>*** Archive: http://www.eprints.org/tech.php/
>>*** EPrints community wiki: http://wiki.eprints.org/
>>*** EPrints developers Forum: http://forum.eprints.org/
>
>
>*** Options:
>http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>*** Archive: http://www.eprints.org/tech.php/
>*** EPrints community wiki: http://wiki.eprints.org/
>*** EPrints developers Forum: http://forum.eprints.org/
>-- The Open University is incorporated by Royal Charter (RC 000391), an 
>exempt charity in England & Wales and a charity registered in Scotland 
>(SC 038302). The Open University is authorised and regulated by the 
>Financial Conduct Authority.
>
>*** Options:
>http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>*** Archive: http://www.eprints.org/tech.php/
>*** EPrints community wiki: http://wiki.eprints.org/
>*** EPrints developers Forum: http://forum.eprints.org/


*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
-- The Open University is incorporated by Royal Charter (RC 000391), an exempt charity in England & Wales and a charity registered in Scotland (SC 038302). The Open University is authorised and regulated by the Financial Conduct Authority.

*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/