Uploaded image for project: 'S2Robot'
  1. S2Robot
  2. ROBOT-44

XML を表現する .vm テンプレートファイルが XML として処理される

XMLWordPrintable

    • Type: Icon: Bug Bug
    • Resolution: Duplicate
    • Priority: Icon: Major Major
    • 0.1.1
    • Affects Version/s: None
    • Component/s: S2Robot
    • None

      XML を表現する .vm ファイルのときに、XML として処理しようとして Exception になっている模様。

      Caused by: org.seasar.robot.extractor.ExtractException: Could not extract a content.
          at org.seasar.robot.extractor.impl.TikaExtractor.getText(TikaExtractor.java:65)
          at jp.sf.fess.transformer.FessFileTransformer.transform(FessFileTransformer.java:103)
          ... 4 more
      Caused by: org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from org.apache.tika.parser.xml.DcXMLParser@4822e10
          at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:129)
          at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:105)
          at org.seasar.robot.extractor.impl.TikaExtractor.getText(TikaExtractor.java:63)
          ... 5 more
      Caused by: org.xml.sax.SAXParseException: XML document structures must start and end within the same entity.
          at org.apache.xerces.util.ErrorHandlerWrapper.createSAXParseException(Unknown Source)
          at org.apache.xerces.util.ErrorHandlerWrapper.fatalError(Unknown Source)
          at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source)
          at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source)
          at org.apache.xerces.impl.XMLScanner.reportFatalError(Unknown Source)
          at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.endEntity(Unknown Source)
          at org.apache.xerces.impl.XMLDocumentScannerImpl.endEntity(Unknown Source)
          at org.apache.xerces.impl.XMLEntityManager.endEntity(Unknown Source)
          at org.apache.xerces.impl.XMLEntityScanner.load(Unknown Source)
          at org.apache.xerces.impl.XMLEntityScanner.scanContent(Unknown Source)
          at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanContent(Unknown Source)
          at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source)
          at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source)
          at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
          at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
          at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
          at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
          at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)
          at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)
          at javax.xml.parsers.SAXParser.parse(SAXParser.java:198)
          at org.apache.tika.parser.xml.XMLParser.parse(XMLParser.java:59)
          at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:119)
          ... 7 more
      

      Tika の設定ファイルで対応できると思うが、修正内容が判明したら、Tika にもバグをあげて直すのが良い。

            Assignee:
            shinsuke shinsuke
            Reporter:
            shinsuke shinsuke
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Created:
              Updated:
              Resolved: