EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #09780


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

Re: [EP-tech] Ask about search result and reindex

  • To: David R Newman <drn@ecs.soton.ac.uk>
  • Subject: Re: [EP-tech] Ask about search result and reindex
  • From: "Agung Prasetyo W." <prazetyo@gmail.com>
  • Date: Wed, 24 Jul 2024 17:49:29 +0700

CAUTION: This e-mail originated outside the University of Southampton.
Hi David,

How do I know we use eprints database or xapian? After I run your script, it shows nothing. After I open the the file /var/eprint_rindex_unindexed.txt, it shows like below :
Copyright (c) 2000, 2021, Oracle and/or its affiliates.

Is my step wrong ??

Thank you.

Regards,
Agung PW





On Wed, 24 Jul 2024 at 17:22, David R Newman <drn@ecs.soton.ac.uk> wrote:

Hi Agung,

If you are using the database (i.e. eprint__rindex) table, then I wrote the following (rather hacky) Bash script to test this:

https://files.eprints.org/3065/

The script will ignore items whose metadata visibility is not set to show.  It is worth manually checking you database for item you expect to be able to find in search but cannot to see if the metadata_visibility field has been changed.  If you create new versions of items this will automatically set the current (now old) version to hide.  (This is a far from ideal situation but it is quite difficult to determine a better way to ensure users only find the latest versions, especially when the "New Version" button gets used in the wrong circumstances).

If you are using a Xapian index, (e.g. typically used for simple search), then I did write a different script for this but it is a lot more complex to deploy.

Regards

David Newman

On 24/07/2024 10:51, Agung Prasetyo W. wrote:
CAUTION: This e-mail originated outside the University of Southampton.
CAUTION: This e-mail originated outside the University of Southampton.
Hi,

Sometimes there are items that don't appear when I do a search, even though they are in the repository. But after I did the command: epadmin reindex [archive_id] eprint [item_id]
As a result, these items can appear in search results.

Is there a way to find out the item IDs that have not been indexed so that we can reindex the item IDs?

Thank you.

Regards,
Agung Prasetyo W.

*** Options: https://wiki.eprints.org/w/Eprints-tech_Mailing_List
*** Archive: https://www.eprints.org/tech.php/
*** EPrints community wiki: https://wiki.eprints.org/