sdx-users
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[sdx-users] Pb de moissonneur OAI


From: Andre . Davignon
Subject: [sdx-users] Pb de moissonneur OAI
Date: Thu, 26 Aug 2004 11:41:15 +0200

Bonjour,

Je rencontre un problème avec un moissonneur OAI. Visiblement la moisson se
fait correctement puisque les documents se placent dans un sous répertoire
"work" de Tomcat
(C:\Tomcat\jakarta-tomcat-4.1.30\work\Standalone\localhost\sdx\cocoon-files\
upload-dir\ceddre_oaiHarvests\notices\harvest-2004...). Les fichiers .sdx
ont l'air correct.

Par contre les documents ne sont pas importés dans le sdxRepository (et pas
indexés...) et un fichier sdxError.log est généré. Pour chaque tentative
d'importation des documents moissonnés, j'ai l'erreur suivante :

ERROR   (2004-08-26) 10:56.56:380   [sdx.framework.ceddre.notices]
(Unknown-URI) Unknown-thread/SDXException: SDX - Document - XML : erreur
dans le document à
file://C%3A/Tomcat/jakarta-tomcat-4.1.30/work/Standalone/localhost/sdx/cocoo
n-files/upload-dir/ceddre_oaiHarvests/notices/harvest-2004-08-26T08%253A56%2
53A49Z/sdx%253A172.16.41.30%253A8080%253Avillesnouvelles%252Fnotices%252F621
7.sdx : C%3A
java.net.UnknownHostException: C%3A
        at
fr.gouv.culture.sdx.exception.SDXException.log(SDXException.java:115)
        at
fr.gouv.culture.sdx.exception.SDXException.<init>(SDXException.java:103)
        at
fr.gouv.culture.sdx.document.XMLDocument.parse(XMLDocument.java:208)
        at
fr.gouv.culture.sdx.document.XMLDocument.startIndexing(XMLDocument.java:174)
        at
fr.gouv.culture.sdx.documentbase.SDXDocumentBase.index(SDXDocumentBase.java:
1159)
        at
fr.gouv.culture.sdx.documentbase.SDXDocumentBase.index(SDXDocumentBase.java:
1032)
        at
fr.gouv.culture.sdx.oai.AbstractDocumentBaseOAIHarvester.storeHarvestedData(
AbstractDocumentBaseOAIHarvester.java:643)
        at
fr.gouv.culture.oai.AbstractOAIHarvester.endElement(AbstractOAIHarvester.jav
a:310)
        at
fr.gouv.culture.sdx.oai.AbstractDocumentBaseOAIHarvester.endElement(Abstract
DocumentBaseOAIHarvester.java:866)
        at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown
Source)
        at
org.apache.xerces.impl.XMLNamespaceBinder.handleEndElement(Unknown Source)
        at org.apache.xerces.impl.XMLNamespaceBinder.endElement(Unknown
Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanEndElement(Unknown
Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatc
her.dispatch(Unknown Source)
        at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown
Source)
        at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source)
        at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source)
        at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
        at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
        at
org.apache.avalon.excalibur.xml.JaxpParser.parse(JaxpParser.java:264)
        at
org.apache.avalon.excalibur.xml.JaxpParser.parse(JaxpParser.java:215)
        at
org.apache.cocoon.components.source.AbstractStreamSource.toSAX(AbstractStrea
mSource.java:206)
        at
fr.gouv.culture.oai.AbstractOAIHarvester.receiveRequest(AbstractOAIHarvester
.java:502)
        at
fr.gouv.culture.oai.AbstractOAIHarvester.receiveSynchronizedRequest(Abstract
OAIHarvester.java:473)
        at
fr.gouv.culture.sdx.oai.AbstractDocumentBaseOAIHarvester.targetTriggered(Abs
tractDocumentBaseOAIHarvester.java:827)
        at
fr.gouv.culture.util.apache.avalon.cornerstone.services.scheduler.SimpleTime
Scheduler$1.run(SimpleTimeScheduler.java:104)

Ci dessous les extraits de mon fichier application.xconf du moissonneur :


    <sdx:documentBases>
         <sdx:documentBase id="notices" type="lucene" default="true"
keepOriginalDocuments="true">
            <sdx:queryParser
class="fr.gouv.culture.sdx.search.lucene.queryparser.DefaultQueryParser"/>
             <sdx:repositories>
                <sdx:repository id="notices" type="FS"
baseDirectory="repos/notices" depth="2" extent="100" default="true"/>
                <sdx:repository id="url" type="URL"/>
            </sdx:repositories>
                .....
        <sdx:oai-harvester adminEmail="address@hidden">
           <sdx:oai-data-providers>
              <sdx:oai-repository
url="http://172.16.41.30:8080/sdx/sdx/oai/villesnouvelles/notices";
sdxRepository="notices">
               <sdx:update type="periodic">
                  <sdx:offset>60000</sdx:offset>
                  <sdx:period>3600000</sdx:period>
                </sdx:update>
                  <sdx:oai-verb name="ListRecords" metadataPrefix="oai_dc"/>
        </sdx:oai-repository>
        <sdx:pipeline>
           <sdx:transformation id="index-oai" type="XSLT"
src="index-oai.xsl"/>
        </sdx:pipeline> 
          </sdx:oai-data-providers>
         </sdx:oai-harvester>
  </sdx:documentBase>
</sdx:documentBases>

André Davignon





reply via email to

[Prev in Thread] Current Thread [Next in Thread]