EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #05347
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
[EP-tech] Re: Question about full text search (Documents in Advanced Search page)
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
- Subject: [EP-tech] Re: Question about full text search (Documents in Advanced Search page)
- From: "Alan.Stiles" <alan.stiles@open.ac.uk>
- Date: Mon, 25 Jan 2016 09:20:21 +0000
Have you tried to reindex one of the missing items to see if it made a difference? Check the error_log whilst it reindexes in case eprints is having some other issue with opening the pdf (we sometimes have issues with e.g. apostrophes in the filenames). -----Original Message----- From: eprints-tech-bounces@ecs.soton.ac.uk [mailto:eprints-tech-bounces@ecs.soton.ac.uk] On Behalf Of Michael Street Sent: 22 January 2016 21:01 To: eprints-tech@ecs.soton.ac.uk Subject: [EP-tech] Re: Question about full text search (Documents in Advanced Search page) Hi again, Does anyone have any idea why these documents are not showing up in the search results? Any suggestions would really be appreciated. I'm at a loss as to why it's not returning results that clearly have the search term in the pdf (and the converted text document). --Mike Street On 1/15/2016 11:05 AM, Michael Street wrote: > Hi John, > > Thanks very much for your response. Please find my answers below: > > 1) Indexer is running and confirmed to be working. The documents > that don't show up are some of the oldest and are available through > other links. Newly deposited items also show up in the Views. > > 2) I have tried pdftotext on the system and had no issues with > converting it. I also was able to find the search term within the > document easily. > > 3) I run a cronjob that updates the DB and switches everything to be > visible, every 15 minutes. My client does not want anything to be > hidden, especially previous versions of eprints, so this was the > easiest way to achieve that, for me. Also, the eprints in question do > show up in the Views, which shows they're set to visible. > > So if you have any other ideas, I'd really appreciate it. I'm at a > loss here. > > Thanks, > Mike. > > > On 1/14/2016 4:35 PM, John Salter wrote: >> Hi, >> I'd check that you indexer is running, and that the task queue is processed. >> >> I'd also check that the PDFs aren't restricted in some way (maybe see what something like pdftotext returns when run against one of the not-returned PDFs. >> >> Also, as was mentioned in a different thread recently, check what the 'metadata visibility' flag for the EPrint is. >> >> If none of that gets you anywhere, let us know and we'll put our collective thinking caps on! >> >> Cheers, >> John >> >> ________________________________________ >> From: eprints-tech-bounces@ecs.soton.ac.uk >> <eprints-tech-bounces@ecs.soton.ac.uk> on behalf of Michael Street >> <mstreet@yorku.ca> >> Sent: 14 January 2016 16:04 >> To: eprints-tech@ecs.soton.ac.uk >> Subject: [EP-tech] Question about full text search (Documents in Advanced Search page) >> >> Hi, >> >> I've got some pdfs in the repository that include the phrase 'bohm' >> many times but the Advanced Search page is only returning 4 out of >> probably >> 25+ eprints as hits on the phrase. I'm using the Documents search >> 25+ box, >> which I believe it the full-text search box. Is there something I'm >> missing? >> >> Any help would be appreciated thanks, Mike. >> >> *** Options: >> http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech >> *** Archive: http://www.eprints.org/tech.php/ >> *** EPrints community wiki: http://wiki.eprints.org/ >> *** EPrints developers Forum: http://forum.eprints.org/ >> >> *** Options: >> http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech >> *** Archive: http://www.eprints.org/tech.php/ >> *** EPrints community wiki: http://wiki.eprints.org/ >> *** EPrints developers Forum: http://forum.eprints.org/ > *** Options: > http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech > *** Archive: http://www.eprints.org/tech.php/ > *** EPrints community wiki: http://wiki.eprints.org/ > *** EPrints developers Forum: http://forum.eprints.org/ *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech *** Archive: http://www.eprints.org/tech.php/ *** EPrints community wiki: http://wiki.eprints.org/ *** EPrints developers Forum: http://forum.eprints.org/ -- The Open University is incorporated by Royal Charter (RC 000391), an exempt charity in England & Wales and a charity registered in Scotland (SC 038302). The Open University is authorised and regulated by the Financial Conduct Authority.
- References:
- [EP-tech] Perl question
- From: John Salter <J.Salter@leeds.ac.uk>
- [EP-tech] Re: Perl question
- From: Ian Stuart <Ian.Stuart@ed.ac.uk>
- [EP-tech] Re: Perl question
- From: "Field A.N." <af05v@ecs.soton.ac.uk>
- [EP-tech] Question about full text search (Documents in Advanced Search page)
- From: Michael Street <mstreet@yorku.ca>
- [EP-tech] Re: Question about full text search (Documents in Advanced Search page)
- From: John Salter <J.Salter@leeds.ac.uk>
- [EP-tech] Re: Question about full text search (Documents in Advanced Search page)
- From: Michael Street <mstreet@yorku.ca>
- [EP-tech] Re: Question about full text search (Documents in Advanced Search page)
- From: Michael Street <mstreet@yorku.ca>
- [EP-tech] Perl question
- Prev by Date: [EP-tech] Re: Question about full text search (Documents in Advanced Search page)
- Next by Date: [EP-tech] Virus Scan during Upload
- Previous by thread: [EP-tech] Re: Question about full text search (Documents in Advanced Search page)
- Next by thread: [EP-tech] Re: Question about full text search (Documents in Advanced Search page)
- Index(es):