JWPL
JWPL Bug:
Bug:org.xml.sax.saxparseexception JAXP00010004
JWPL Debug:
Copy this code in App.java
package maven.maven;
import javax.xml.XMLConstants;
import javax.xml.parsers.ParserConfigurationException;
import javax.xml.parsers.SAXParserFactory;
import org.xml.sax.SAXNotRecognizedException;
import org.xml.sax.SAXNotSupportedException;
import de.tudarmstadt.ukp.wikipedia.datamachine.domain.JWPLDataMachine;
public class App
{
public static void main( String[] args ) throws SAXNotRecognizedException, SAXNotSupportedException, ParserConfigurationException
{
System.setProperty("jdk.xml.totalEntitySizeLimit", "500000000");
SAXParserFactory spf = SAXParserFactory.newInstance();
spf.setFeature(XMLConstants.FEATURE_SECURE_PROCESSING, false);
// Path where wiki dump is stored.
String[] arg = {"french", "Accueil", "Homonymie", "/home/qcrialt/ATQAM/WikipediaDumps/frwiki-20170920"};
JWPLDataMachine.main(arg);
// french Accueil Homonymie
// /home/qcrialt/ATQAM/WikipediaDumps/frwiki-20170920
}
}
Add this dependency in pom.xml
https://dkpro.github.io/dkpro-jwpl/DeveloperSetup/
<dependency>
<groupId>de.tudarmstadt.ukp.wikipedia</groupId>
<artifactId>de.tudarmstadt.ukp.wikipedia.datamachine</artifactId>
<version>1.0.0</version>
</dependency>
link to JWPL tables.sql
Seconn
JWPL Working time for English= 83.826.173 ms
JWPL Working time for French = 8.245.844 ms
JWPL Working time for Arabic= 2.004.813 ms
Comments
Post a Comment