JWPL


JWPL Bug:
Bug:org.xml.sax.saxparseexception JAXP00010004

JWPL Debug:
Copy this code in App.java


package maven.maven;

import javax.xml.XMLConstants;
import javax.xml.parsers.ParserConfigurationException;
import javax.xml.parsers.SAXParserFactory;

import org.xml.sax.SAXNotRecognizedException;
import org.xml.sax.SAXNotSupportedException;

import de.tudarmstadt.ukp.wikipedia.datamachine.domain.JWPLDataMachine;

public class App
{
    public static void main( String[] args ) throws SAXNotRecognizedException, SAXNotSupportedException, ParserConfigurationException
    {
         System.setProperty("jdk.xml.totalEntitySizeLimit", "500000000");
        SAXParserFactory spf = SAXParserFactory.newInstance();
       spf.setFeature(XMLConstants.FEATURE_SECURE_PROCESSING, false);
        // Path where wiki dump is stored.
        String[] arg = {"french", "Accueil", "Homonymie", "/home/qcrialt/ATQAM/WikipediaDumps/frwiki-20170920"};
        JWPLDataMachine.main(arg);
        // french Accueil Homonymie
       // /home/qcrialt/ATQAM/WikipediaDumps/frwiki-20170920

    }
}

Add this dependency in pom.xml
https://dkpro.github.io/dkpro-jwpl/DeveloperSetup/ 



<dependency>
  <groupId>de.tudarmstadt.ukp.wikipedia</groupId>
  <artifactId>de.tudarmstadt.ukp.wikipedia.datamachine</artifactId>
  <version>1.0.0</version>
</dependency>
 
  
 link to JWPL tables.sql
 
Seconn 
 
JWPL Working time for English= 83.826.173 ms
JWPL Working time for French = 8.245.844 ms
JWPL Working time for Arabic= 2.004.813 ms 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Comments

Popular posts from this blog

Wikipedia Corpora

Links

Extraction Model