EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #01454


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] international character search problem


I may have found a bug in EPrints 3.3.10. One of the authors in our repository is Anıl Zenginoğlu (if the name doesn't come out right in email, his homepage is  http://www.tapir.caltech.edu/~anil/). Searching for the surname works fine with the simple search, but with the advanced search we don't get any results. I believe the problem is with line 230 in perl_lib/EPrints/MetaField/Name.pm:

# remove not a-z characters (except ,)
$v2 =~ s/[^a-z,]/ /ig;

That code splits up "zenginoğlu" to "zengino lu". A possible solution may be

use utf8;
…
$v2 =~ s/[^\p{L},]/ /ig;
…

Maybe someone with a strong encodings-fu can comment?

tommy