EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #08513
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
[EP-tech] Elements-EPrints Odd Characters stopping upload
- To: eprints-tech@ecs.soton.ac.uk
- Subject: [EP-tech] Elements-EPrints Odd Characters stopping upload
- From: James Kerwin <jkerwin2101@gmail.com>
- Date: Wed, 17 Feb 2021 09:44:54 +0000
CAUTION: This e-mail originated outside the University of Southampton.
Hi All,
This is an Elements/EPrints question. Apologies that it isn't purely EPrints, but this is probably the best place to get an answer. I want to know if others experience this or it's some oddity to our setup.
We are using RT1 (for now) and EPrints 3.3.14 (also for now until upgrade). Occasionally we get an Elements record that is from Scopus, PubMed etc. that has an odd character in it that prevents upload. When I look in the Apache logs it tells me the problem.
Yesterday's one was the presence of:
"Unicode Character “’” (U+2019)"
Which showed in the logs as:
"Can't escape \\x{2019}, try uri_escape_utf8() instead at /opt/eprints3/perl_lib/URI/Escape.pm"
Importantly if I copy the problem characters to the manual elements record it doesn't pose a problem. There appears some processing to properly encode characters entered via text box, but not characters dragged in from other sources into Elements.
I've also had the issue with the files containing "'" or" ";" etc not being accessible via Elements (a very similar, but different problem).
I found where I COULD fix the former issue, but it involves changing EPrints code when I SHOULD be altering the Symplectic connector code on the repo server.
Anyway, I'm not specifically looking for a solution, but has anybody else experienced anything similar? If so, does it stop with RT2? I hope to raise a ticket with Symplectic over this eventually.
"Unicode Character “’” (U+2019)"
Which showed in the logs as:
"Can't escape \\x{2019}, try uri_escape_utf8() instead at /opt/eprints3/perl_lib/URI/Escape.pm"
Importantly if I copy the problem characters to the manual elements record it doesn't pose a problem. There appears some processing to properly encode characters entered via text box, but not characters dragged in from other sources into Elements.
I've also had the issue with the files containing "'" or" ";" etc not being accessible via Elements (a very similar, but different problem).
I found where I COULD fix the former issue, but it involves changing EPrints code when I SHOULD be altering the Symplectic connector code on the repo server.
Anyway, I'm not specifically looking for a solution, but has anybody else experienced anything similar? If so, does it stop with RT2? I hope to raise a ticket with Symplectic over this eventually.
Thanks,
James
- Follow-Ups:
- [EP-tech] Elements-EPrints Odd Characters stopping upload
- From: James Kerwin <jkerwin2101@gmail.com>
- [EP-tech] Elements-EPrints Odd Characters stopping upload
- References:
- [EP-tech] Elements-EPrints Odd Characters stopping upload
- From: James Kerwin <jkerwin2101@gmail.com>
- [EP-tech] Elements-EPrints Odd Characters stopping upload
- Prev by Date: Re: [EP-tech] Availability statement in discovery platforms
- Next by Date: [EP-tech] search filter to show only the last version
- Previous by thread: [EP-tech] EPrints/CRIS
- Next by thread: [EP-tech] DOI handling in orcid_support_advance
- Index(es):