EPrints Technical Mailing List Archive
Message: #04332
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
[EP-tech] Re: DSpace import plugin doesn't seem to parse UTF-8 correctly
- To: eprints-tech@ecs.soton.ac.uk
- Subject: [EP-tech] Re: DSpace import plugin doesn't seem to parse UTF-8 correctly
- From: George Mamalakis <mamalos@eng.auth.gr>
- Date: Thu, 18 Jun 2015 17:09:56 +0300
Tim,It did work! What is funny, though, is that if I run the import statement from the console by giving --debug to see the output, without your patch, the correct content is printed in the console (which displays utf8), whereas when I run it with your patch, it prints the escaped UTF-8 characters (e.g.\x{3b1}\x{3c0}\x{3cc}).
Nonetheless, the point is that it worked, so thank you VERY much!I filed an issue, as you suggested (https://github.com/eprints/eprints/issues/326).
Thanks again for the prompt help! On 18/06/2015 03:12 μμ, Timothy Miles-Board wrote:
Hi George, Try this patch: --- a/perl_lib/EPrints/Plugin/Import/DSpace.pm +++ b/perl_lib/EPrints/Plugin/Import/DSpace.pm @@ -235,7 +235,7 @@ sub retrieve_dcq return undef; }- my $dc = $self->find_dc_pairs( $r->content );+ my $dc = $self->find_dc_pairs( $r->decoded_content ); return undef unless defined $dc;$self->{errurl} = $self->{errmsg} = undef;If it works, please pay it forward by submitting a bug report at http://github.com/eprints/eprints and adding the patch to the ticket as a proposed fix. Thanks, Tim Timothy Miles-Board Web & Repositories Development Specialist, University of London Computer Centre 020 7863 1342 | 07742 970 351 | timothy.miles-board@london.ac.uk | @drtjmb The University of London is an exempt charity in England and Wales ________________________________________ From: eprints-tech-bounces@ecs.soton.ac.uk <eprints-tech-bounces@ecs.soton.ac.uk> on behalf of George Mamalakis <mamalos@eng.auth.gr> Sent: 18 June 2015 12:16 PM To: eprints-tech@ecs.soton.ac.uk Subject: [EP-tech] DSpace import plugin doesn't seem to parse UTF-8 correctly Hello everybody, I am trying to use the DSpace Import plugin from EPrints by giving a URL to the web interface. While the system import the record, the character sets seem to not be imported correctly. My import is performed from a Greek DSpace server, and a test record where should be imported is this: https://dspace.lib.uom.gr/handle/2159/323. If you give the above URL into the DSpace import form, you'll see that the record is imported but the character set is messed up. Thanks all for your help in advance, George. -- George Mamalakis IT and Security Officer, Electrical and Computer Engineer (Aristotle Univ. of Thessaloniki), PhD (Aristotle Univ. of Thessaloniki), MSc (Imperial College of London) School of Electrical and Computer Engineering Aristotle University of Thessaloniki phone number : +30 (2310) 994379 *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech *** Archive: http://www.eprints.org/tech.php/ *** EPrints community wiki: http://wiki.eprints.org/ *** EPrints developers Forum: http://forum.eprints.org/ *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech *** Archive: http://www.eprints.org/tech.php/ *** EPrints community wiki: http://wiki.eprints.org/ *** EPrints developers Forum: http://forum.eprints.org/
-- George Mamalakis IT and Security Officer, Electrical and Computer Engineer (Aristotle Univ. of Thessaloniki), PhD (Aristotle Univ. of Thessaloniki), MSc (Imperial College of London) School of Electrical and Computer Engineering Aristotle University of Thessaloniki phone number : +30 (2310) 994379
- References:
- [EP-tech] OAI harvesting / records moving from live to buffer (or other non-deletion datasets).
- From: John Salter <J.Salter@leeds.ac.uk>
- [EP-tech] Re: OAI harvesting / records moving from live to buffer (or other non-deletion datasets).
- From: John Salter <J.Salter@leeds.ac.uk>
- [EP-tech] DSpace import plugin doesn't seem to parse UTF-8 correctly
- From: George Mamalakis <mamalos@eng.auth.gr>
- [EP-tech] Re: DSpace import plugin doesn't seem to parse UTF-8 correctly
- From: Timothy Miles-Board <timothy.miles-board@ulcc.ac.uk>
- [EP-tech] OAI harvesting / records moving from live to buffer (or other non-deletion datasets).
- Prev by Date: [EP-tech] Re: Item Type - magazines and newspapers
- Next by Date: [EP-tech] Re: Problem with dependencies when installing
- Previous by thread: [EP-tech] Re: DSpace import plugin doesn't seem to parse UTF-8 correctly
- Next by thread: [EP-tech] How to set a field to be multilang
- Index(es):