EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #08805
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
Re: [EP-tech] Fulltext (PDF) index
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>, "MOHD.IZWAN SALIM" <mohdizwan8733@uitm.edu.my>
 - Subject: Re: [EP-tech] Fulltext (PDF) index
 - From: John Salter <J.Salter@leeds.ac.uk>
 - Date: Mon, 6 Dec 2021 15:23:18 +0000
 
| 
 
CAUTION: This e-mail originated outside the University of Southampton.
 
Hi Mohd, 
I would check to see if the indexer is running, and if the task queue has anything in it. 
The quickest way to do this is to visit: 
https://[your repository URL]/cgi/counter 
This should present a text response. Look for 'event_queue' and 'indexer'. 
In EPrints, the fulltext indexing jobs are placed in the event_queue. 
The 'indexer' works through this queue. Normally, the 'indexer' should report as 'running', and the event_queue should be close to zero - meaning the indexer is doing what is needed. 
If the indexer is either 'stopped' or 'stalled', try running the indexer with one of these parameters: 
~/bin/indexer [status | stop | start] The indexer writes a log to ~/var/indexer.log - if something is causing the indexer to stop, there may be some information in there. 
To see what is actually in the event_queue (rather than just how many items are waiting), in the web interface, go to the Admin menu -> Manage Records -> Tasks. 
If there are a lot of items, you can sort the list, or filter on the start time, status etc. 
Hopefully that helps! 
Cheers, 
John 
From: eprints-tech-bounces@ecs.soton.ac.uk <eprints-tech-bounces@ecs.soton.ac.uk> on behalf of MOHD.IZWAN SALIM via Eprints-tech <eprints-tech@ecs.soton.ac.uk> 
Sent: 06 December 2021 04:04 To: EDER Norbert via Eprints-tech <eprints-tech@ecs.soton.ac.uk> Subject: [EP-tech] Fulltext (PDF) index 
CAUTION: This e-mail originated outside the University of Southampton.
 
Dear EPrints Community
 
I just set up a new repo with the latest Eprints version. 
How searching word in pdf (full text) does not return any result. 
The PDF is already OCR and searchable. 
I already run ./epadmin erase_fulltext_index repo --verbose 
Is there anything should I look at? 
Regards 
Mohd Izwan Bin Salim 
UiTM Digital Library PENAFIAN: E-mel ini dan apa-apa fail yang dihantar bersama-samanya ("Mesej") adalah dihasratkan hanya untuk kegunaan penerima yang dinyatakan di atas dan mungkin mengandungi maklumat yang tidak umum, bermilik, istimewa, sulit dan dikecualikan dari penzahiran di bawah undang-undang yang terpakai termasuklah Akta Rahsia Rasmi 1972. BACA SELANJUTNYA... DISCLAIMER : This e-mail and any files transmitted with it ("Message") is intended only for the use of the recipient(s) named above and may contain information that is non-public, proprietary, privileged, confidential and exempt from disclosure under applicable law including the Official Secrets Act 1972. READ MORE...  | 
- Follow-Ups:
- Re: [EP-tech] Fulltext (PDF) index
- From: John Salter <J.Salter@leeds.ac.uk>
 
 
 - Re: [EP-tech] Fulltext (PDF) index
 
- References:
- [EP-tech] Fulltext (PDF) index
- From: "MOHD.IZWAN SALIM" <mohdizwan8733@uitm.edu.my>
 
 - Re: [EP-tech] Fulltext (PDF) index
- From: John Salter <J.Salter@leeds.ac.uk>
 
 
 - [EP-tech] Fulltext (PDF) index
 
- Prev by Date: [EP-tech] Fulltext (PDF) index
 - Next by Date: Re: [EP-tech] Fulltext (PDF) index
 - Previous by thread: [EP-tech] EPrints/CRIS
 - Next by thread: [EP-tech] DOI handling in orcid_support_advance
 - Index(es):
 
