EPrints Technical Mailing List Archive
Message: #09186
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
[EP-tech] Searching URLs
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
- Subject: [EP-tech] Searching URLs
- From: Martin Brändle <martin.braendle@uzh.ch>
- Date: Fri, 20 Jan 2023 14:42:15 +0000
CAUTION: This e-mail originated outside the University of Southampton.
Hi, we observe in our repo that only complete URLs can be searched in Url-type fields. As far as I understand from the Metafield definition, text_index => 1, sql_index => 0, and default search behavior is "IN", so it should be possible to search also for single words of an URL or URLs truncated with %,
or not? Also, when I investigated the repository database tables, I see that the eprint__index table only contains complete URLs for an Url-type field. In addition, there is a limit in eprint__index for the ids column, which
might hamper large repositories. The ids column data type is "text", which allows for 64K characters maximum. It stores the eprint ids (concatenated with a colon) for an indexed word. So a maximum of about 10K eprintids is possible for a word. Frequent words
(which are not stopwords) may not be indexed completely … Kind regards, Martin -- Dr. Martin Brändle |
- Follow-Ups:
- [EP-tech] Searching URLs
- From: Martin Brändle <martin.braendle@uzh.ch>
- [EP-tech] Searching URLs
- References:
- [EP-tech] Searching URLs
- From: Martin Brändle <martin.braendle@uzh.ch>
- [EP-tech] Searching URLs
- Prev by Date: Re: [EP-tech] cannot upload file above 1GB
- Next by Date: Re: [EP-tech] cannot upload file above 1GB
- Previous by thread: [EP-tech] Sort view with creators_name and corp_creators
- Index(es):