EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #02696


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] Re: harvester (question)


yes Seb!

I am interested by your importer! And yes, why not a github project for french tools...

jm


Le 03/03/2014 11:29, Sebastien Francois a écrit :
Hi JM,

I have an importer for TEF (for STAR, via OAI), d'ya want to try that?

Perhaps we could put some French related tools under a common github project?

Seb.


On 03/03/14 08:18, Jean-Marie Le Bechec wrote:
hi Seb,

I need to harvest an OAI server in a format other than Dublin Core (TEF format). I can not get specific metadata with the same name.

For example :
...
<tef:thesisAdmin>
                    <tef:auteur>
                      <tef:nom>nom1</tef:nom>
...

and
...
<tef:directeurThese>
                      <tef:nom>nom2</tef:nom>
                      <tef:prenom>Carine</tef:prenom>
                      <tef:autoriteInterne>MADS_DIRECTEUR_DE_THESE_1</tef:autoriteInterne>
                      <tef:autoriteExterne autoriteSource="Sudoc">073367826</tef:autoriteExterne>
                    </tef:directeurThese>
                    <tef:directeurThese>
                      <tef:nom>nom3</tef:nom>
                      <tef:prenom>Louise</tef:prenom>
                      <tef:autoriteInterne>MADS_DIRECTEUR_DE_THESE_2</tef:autoriteInterne>
                      <tef:autoriteExterne autoriteSource="Sudoc">035036672</tef:autoriteExterne>
                    </tef:directeurThese>
...
in the same record !

I need to extract all this data.

I tried things like :

my $nom;
foreach my $node ($xml->findnodes( "//auteur/nom/*" ))
       {
               $nom = $node->textContent;     
       }

but it does not work (no result)

any idea ?


Thanks !

Jean-Marie

-- 

***********************************************
Jean Marie Le Bechec
Service Commun de la Documentation
Responsable ingenierie documentaire
&
Direction du Systeme d'Information
Referent Etudes

Institut National Polytechnique de Toulouse
6 allee Emile Monso - bp 34038 -
31029 Toulouse cedex 4
Tel : 05 34 32 31 16
Mail : lebechec@inp-toulouse.fr
*********************************************** 


*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/



*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/

-- 

***********************************************
Jean Marie Le Bechec
Service Commun de la Documentation
Responsable ingenierie documentaire
&
Direction du Systeme d'Information
Referent Etudes

Institut National Polytechnique de Toulouse
6 allee Emile Monso - bp 34038 -
31029 Toulouse cedex 4
Tel : 05 34 32 31 16
Mail : lebechec@inp-toulouse.fr
***********************************************