EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #10115
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
RE: [EP-tech] Thousands of dataobj.xml files
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
- Subject: RE: [EP-tech] Thousands of dataobj.xml files
- From: John Salter <J.Salter@leeds.ac.uk>
- Date: Thu, 8 May 2025 21:10:28 +0000
CAUTION: This e-mail originated outside the University of Southampton.
Hi Fernando, They shouldn’t cause any problems on your system. If you want to get from the data you have, to the specific file, run the following queries in your database – based on the /id/file/[number] – 136799 in your example: From the results, the datasetid should be ‘history’, and there should be a number in the ‘objectid’ field e.g. 12345. If you select that ‘objectid’ from the history table, it will give you details of what caused the revision to the data: SELECT * FROM history WHERE historyid = 12345; The columns from history table should include datasetid (hopefully ‘eprint’), the objectid (the eprint ID – 6789 as an example below) and the revision number e.g. 10. This revision number is the filename on-disk, within the item’s document folder: archives/ARCHIVEID/documents/disk0/00/00/67/89/revisions/10.xml Each eprint should have a ‘revisions’ folder with numbered XML files in it. I think very old versions of EPrints (v2, maybe v3.1?) stored these revision files differently, so if the repository existed before v3.3.12, early eprints may have other styles of
revision files. Cheers, John From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk>
On Behalf Of David Newman
Hi, dataobj.xml files are the placeholder name for history revision files that appear in the individual EPrints record's document's subdirectory under its revisions subdirectory. Here they appear as 1.xml, 2.xml, etc. rather than daatobj.xml.
where the number is the revision number of the history record for that EPrints. History revision files are a snapshot in time for the metadata of that EPrints record. Regards David Newman Sent from Outlook for Android From:
eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> on behalf of
kralizeck@gmail.com <kralizeck@gmail.com> CAUTION: This
e-mail originated outside the University of Southampton. CAUTION: This
e-mail originated outside the University of Southampton. Hi. I get 77414 files when I go to "Manage records->Files" and filter by name "dataobj.xml". (a total of 119968 files without filters). Modifications date from 2010 (first eprints installation by other guys) until now (I took control to upgrade from 3.3.12 to 3.4.6 a few weeks ago). I've searched for information, but haven't found anything. All .xml have the same content when I export it with Atom (url edited): <?xml version="1.0" encoding="utf-8" ?> There is no dataobj.xml in the filesystem, so I assume they are in the database. I would appreciate any help or recommendations to investigate this issue and my doubts:
|
- Follow-Ups:
- Re: [EP-tech] Thousands of dataobj.xml files
- From: "kralizeck@gmail.com" <kralizeck@gmail.com>
- Re: [EP-tech] Thousands of dataobj.xml files
- References:
- [EP-tech] Thousands of dataobj.xml files
- From: "kralizeck@gmail.com" <kralizeck@gmail.com>
- Re: [EP-tech] Thousands of dataobj.xml files
- From: David Newman <drn@ecs.soton.ac.uk>
- [EP-tech] Thousands of dataobj.xml files
- Prev by Date: Re: [EP-tech] Thousands of dataobj.xml files
- Next by Date: Re: [EP-tech] Thousands of dataobj.xml files
- Previous by thread: Re: [EP-tech] Thousands of dataobj.xml files
- Next by thread: Re: [EP-tech] Thousands of dataobj.xml files
- Index(es):