EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #09978


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

Re: [EP-tech] Bulk import plugin


CAUTION: This e-mail originated outside the University of Southampton.
Hi, David

Thanks for the quick response. No, I'm not moving them from one to another Eprints repository. I am moving them from an entirely different source. Currently, I have everything in EndNote and can export as BibTex successfully. And this is just metadata, not a repository as such. I am providing a metadata database with URLs to the original papers and theses. This a specialist subject database for a research association, not an institutional repository.

Sorry, I should have mentioned this in the original question!

Best wishes 

Will
____

From: David R Newman <drn@ecs.soton.ac.uk>
Sent: Saturday, February 15, 2025 7:00:18 PM
To: eprints-tech@ecs.soton.ac.uk <eprints-tech@ecs.soton.ac.uk>; Will Hughes <w.p.hughes@reading.ac.uk>
Subject: Re: [EP-tech] Bulk import plugin
 
You don't often get email from drn@ecs.soton.ac.uk. Learn why this is important
Hi Will,

I am going to assume this is 20,000 records currently in an EPrints repository you want to transfer to a new/different EPrints repository.  If that is not the case please let me know what format you currently have for these records you want to import.

Exporting the existing records from your old EPrints repository should entail carrying out an (admin menu) EPrint search (for presumably all items in the live archive) and then an export as "EP3 XML with Files Embedded".  If you have big files (e.g. videos), as long as all the files you want to import are currently publicly accessible on the old EPrints repository, you can choose the EP3 XML export.

Importing is most easily/efficiently done from the (SSH) command line of the new EPrints repository server.  First, copy the export file generated from above.  Next, you need to run the following command to import the records (substituting EPRINTS_PATH and ARCHIVE_ID and OLD_ARCHIVE_ID as appropriate:

EPRINTS_PATH/bin/import ARCHIVE_ID --enable-file-imports --enable-web-imports eprint XML export_OLD_ARCHIVE_ID_XMLFiles.xml

However, these will be imported into the review buffer rather than the live archive, so you need to (temporarily) add the following to a configuration file in your new archive's cfg/cfg.d/ directory (e.g. z_skip_buffer.pl):

$c->{skip_buffer} = 1;

For more information about the import command see:

https://wiki.eprints.org/w/API:bin/import

In particular, you may want to set a user to import these records.  I would advise creating a special user for this, as having 20,000 records under a user account you regularly want to manage deposits will make this less responsive as it has to evaluate all 20,000 records to determine which to show on the first page of Manage Deposits.

Regards

David Newman

On 15/02/2025 6:13 pm, Will Hughes wrote:
CAUTION: This e-mail originated outside the University of Southampton.
CAUTION: This e-mail originated outside the University of Southampton.

Hi

 

With a new installation I am finding my way around the software. I am looking for the functionality to import records in bulk, straight to the repository.

 

I understand that there is or was a plugin for bulk import, but I cannot find it anywhere. What I want to do is to bring in 20,000 records in a way that make them immediately live. Is there a plugin that can be fired up from the website, or is this a command line interface kind of thing?

 

Any suggestions welcome

 

Thanks

 

Best wishes

 

Will   

 

Will Hughes

Emeritus Professor of Construction Management and Economics

School of the Built Environment     

University of Reading, PO Box 219, Whiteknights

Reading, RG6 6DF, UK

 


*** Options: https://wiki.eprints.org/w/Eprints-tech_Mailing_List
*** Archive: https://www.eprints.org/tech.php/
*** EPrints community wiki: https://wiki.eprints.org/