EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #06774


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] format migrations/conversions in EPrints


 

I am working on a digital preservation policy document, and I wanted to summarize the format migrations that take place by default in EPrints.

 

I am aware of these migrations, derivative files are generated as access copies for audio and video documents:

 

·         Audio documents ->

o    Audio.ogg (Format: Vorbis | Bitrate: 96K | Sampling Rate:44.1 kH | Container: OGG)

o   Audio.mp4 (Format: AAC (Advanced Audio Codec) | Bitrate: 96K | Sampling Rate: 44.1 kH | Container: MPEG-4)

·         Video documents ->

o   Video.ogg  (Video Format: Theora | Audio Format: Vorbis| Video Bitrate: 500 KB/s | Video Size: 640*360 (16:9) | Video Frame Rate: 10 FPS | Audio Sampling Rate: 44.1 kHz | Audio Bitrate: 500 kb/s | Container: OGG )

o   Video.mp4 (Video Format:  AVC (Advanced Video Codec ) | Audio Format: AAC (Advanced Audio Codec)  | Video Bitrate: 500 KB/s | Video Size: 640*360 (16:9) | Video Frame Rate: 10 FPS | Audio Sampling Rate: 44.1 kHz | Audio Bitrate: 500 kb/s | Container: MPEG-4 )

(see: https://github.com/eprints/eprints/blob/392474eec1b8125a66ed2d3e12b02aeb67dc07c4/perl_lib/EPrints/Plugin/Convert/Thumbnails.pm  )

 

I am also aware of this:

·         DOC files -> indexcodes.txt (see: https://github.com/eprints/eprints/blob/392474eec1b8125a66ed2d3e12b02aeb67dc07c4/perl_lib/EPrints/Plugin/Convert/IndexCodes.pm)

It extracts indexing terms from DOC files and places these in a TEXT file.  So this isn’t a migration, but a derivative text file which gets stored. It is similar to the thumbnail “preview.jpg” image for Video and PDF documents:

·         Video or PDF file -> preview.jpg

 

Are there more migrations/format conversions/normalizations that take place in EPrints by default?

For example, does EPrints ever convert .DOC files to PDF?, using this: https://github.com/eprints/eprints/blob/392474eec1b8125a66ed2d3e12b02aeb67dc07c4/perl_lib/EPrints/Plugin/Convert/DocPDF.pm ?

 

Best wishes,

Tomasz