EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #05881
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
Re: [EP-tech] Antwort: Re: Digital Preservation in EPrints
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
- Subject: Re: [EP-tech] Antwort: Re: Digital Preservation in EPrints
- From: Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca>
- Date: Fri, 19 Aug 2016 15:03:03 +0000
| I asked the National Archives about the PRONOM risk scores for formats.  The use of these is documented and commented out in the code of  the EPrints Preservation
 Plugin. Although PRONOM has the potential to add risk scores, they are all blank now, and it is not something that they are looking into adding. 
 Tomasz  
 From: eprints-tech-bounces@ecs.soton.ac.uk [mailto:eprints-tech-bounces@ecs.soton.ac.uk]
On Behalf Of Tomasz Neugebauer I have been going through the installation of the DROID and Preservation Toolkit plugins over the last few days. 
 It was difficult to figure out, so I thought I would share a summary of what I learned about these plugins, and how I got them to work: DROID Bazaar:
http://bazaar.eprints.org/143/
 GitHub:
https://github.com/eprintsug/droid
 Prerequisites: Java 1.6 or higher What it does / how I got it to work: On activation, it is supposed to download the DROID 4 tar file from here: http://freefr.dl.sourceforge.net/project/droid/droid/4.0.0/droid-4.0.0-linux.tar.gz
 Then untar it into /lib/bin/DROID All of this failed without any error message on my EPrints 3.3.12 The bazaar package said it installed OK, but it didn't report the fact that it was unable to complete the required steps. There is a message on the list about File::Move vs File::Copy::Recursive::rmove, but I couldn’t get this work (http://www.eprints.org/tech.php/thread-16264.html
 )  Instead, I manually download tar file, and untar it (using command line) manually into /lib/bin/DROID/ folder. 
 The plugin also adds some cron events for updating the DROID_SignatureFile.xml and running the scan - I think this part is working.  I was also able to update
 the signature file using the command line: java -jar /lib/bin/DROID/droid.jar -d /lib/bin/DROID/DROID_SignatureFile.xml ======================= PRESERVATION Toolkit Bazaar:
http://bazaar.eprints.org/142/
 Github:
https://github.com/eprintsug/preservation_toolkit
   Prerequisites: DROID
 Some documentation:
http://www.eprints.org/software/training/3.2/admin/filerisks_tutorial.php
   What it does / how I got it to work: It is supposed to provide Editors with a Format/Risks button that would list the count of documents and their corresponding format types in their repository.  
 After plugin install, the button didn’t show up on my repository, because the can_be_viewed permission on line 45 of FormatRisks.pm didn’t exist in my EPrints,
 so I changed line 45 of FormatsRisks.pm to return $self->allow( "config/view" );  That got me a button.  Clicking on it, at first, it said that I had no objects in the repository, along with a new button: “Request File Type Recount” appears.  Either by pushing
 this button, or on plugin activation (I’m not sure), a cron event is added which went through the repository and a results table with two categories: 1) High Risk Objects – these are all the UNKNOWN (DROID found no classification match) 2) Format Breakdown
 – list of format types and how many there is of each.  It would be great if it provided a button to know/list which documents are high risk – there is mention of this in the docs (a “plus” button), but I didn’t see this working.  Has anyone figured out how
 to get the “plus” button or something like it, so that I can quickly find out which documents belong to the “high risk” category? In the documentation and the code, there is mention of classification into “low”, “medium” and “high” risk, but this is not working.   There are a number of reasons
 for that.  First, the “update_risk_scores()” call on line 23 of Update_Pronom_File_Counts is commented out (as is the whole function).  This is the function that uses SOAP::Lite to query PRONOM at NationalArchives for risk scores associated with each format. 
 Since this is actually commented out in the plugin, I see no reason to install SOAP::Lite.    Second, and this part I found most confusing: it looks to me like PRONOM still doesn’t have any risk scores associated with any format types in its database (is that
 correct?)– so it may be pointless to try to activate this part of the plugin.  PRONOM allows you to query for risk scores (see:
http://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=new
 ) but if you search, you will see that all formats have a blank risk score.   The documentation for the plugin talks about an “unstable” risk score retrieval set up for testing at EPrints, and used to generate screenshots for the docs/presentations. 
 My apologies for the long message; if you have read all of this, and want to correct something  or add some information, it would be very much appreciated. 
 Best wishes, Tomasz ________________________________________________ 
Tomasz Neugebauer 
Tel. / Tél. 514-848-2424 ext. / poste 7738 
Mailing address / adresse postale: 1455 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8 
library.concordia.ca 
 From:
eprints-tech-bounces@ecs.soton.ac.uk
 [mailto:eprints-tech-bounces@ecs.soton.ac.uk]
On Behalf Of martin.braendle@id.uzh.ch Hi Tomasz, 
 
   
 | 
- References:
- Re: [EP-tech] Digital Preservation in EPrints
- From: Adam Field <Adam.Field@jisc.ac.uk>
 
- [EP-tech] Antwort: Re:  Digital Preservation in EPrints
- From: martin.braendle@id.uzh.ch
 
- Re: [EP-tech] Antwort: Re:  Digital Preservation in EPrints
- From: Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca>
 
 
- Re: [EP-tech] Digital Preservation in EPrints
- Prev by Date: Re: [EP-tech] Rerecorded training video (has sound now)
- Next by Date: [EP-tech] Issue with searchfield labels and checkbox ids
- Previous by thread: Re: [EP-tech] Antwort: Re: Digital Preservation in EPrints
- Next by thread: Re: [EP-tech] Digital Preservation in EPrints
- Index(es):



