Package com.qwazr.library.html
Class HtmlParser
java.lang.Object
com.qwazr.library.html.HtmlParser
- All Implemented Interfaces:
com.qwazr.extractor.ParserFactory,com.qwazr.extractor.ParserInterface
public class HtmlParser
extends java.lang.Object
implements com.qwazr.extractor.ParserFactory, com.qwazr.extractor.ParserInterface
-
Field Summary
-
Constructor Summary
Constructors Constructor Description HtmlParser() -
Method Summary
Modifier and Type Method Description com.qwazr.extractor.ParserInterfacecreateParser()com.qwazr.extractor.ParserResultextract(javax.ws.rs.core.MultivaluedMap<java.lang.String,java.lang.String> parameters, java.io.InputStream inputStream, javax.ws.rs.core.MediaType mediaType)com.qwazr.extractor.ParserResultextract(javax.ws.rs.core.MultivaluedMap<java.lang.String,java.lang.String> parameters, java.nio.file.Path filePath)java.util.Collection<com.qwazr.extractor.ParserField>getFields()java.lang.StringgetName()static org.apache.xerces.parsers.DOMParsergetNewDomParser()static org.cyberneko.html.HTMLConfigurationgetNewHtmlConfiguration()Create a new NekoHTML configurationjava.util.Collection<com.qwazr.extractor.ParserField>getParameters()java.util.Collection<java.lang.String>getSupportedFileExtensions()java.util.Collection<javax.ws.rs.core.MediaType>getSupportedMimeTypes()static org.apache.xerces.parsers.DOMParsergetThreadLocalDomParser()
-
Constructor Details
-
HtmlParser
public HtmlParser()
-
-
Method Details
-
getNewHtmlConfiguration
public static org.cyberneko.html.HTMLConfiguration getNewHtmlConfiguration()Create a new NekoHTML configuration- Returns:
-
getNewDomParser
public static org.apache.xerces.parsers.DOMParser getNewDomParser() -
getThreadLocalDomParser
public static org.apache.xerces.parsers.DOMParser getThreadLocalDomParser() -
getParameters
public java.util.Collection<com.qwazr.extractor.ParserField> getParameters()- Specified by:
getParametersin interfacecom.qwazr.extractor.ParserFactory
-
getFields
public java.util.Collection<com.qwazr.extractor.ParserField> getFields()- Specified by:
getFieldsin interfacecom.qwazr.extractor.ParserFactory
-
extract
public com.qwazr.extractor.ParserResult extract(javax.ws.rs.core.MultivaluedMap<java.lang.String,java.lang.String> parameters, java.io.InputStream inputStream, javax.ws.rs.core.MediaType mediaType) throws java.io.IOException- Specified by:
extractin interfacecom.qwazr.extractor.ParserInterface- Throws:
java.io.IOException
-
extract
public com.qwazr.extractor.ParserResult extract(javax.ws.rs.core.MultivaluedMap<java.lang.String,java.lang.String> parameters, java.nio.file.Path filePath) throws java.io.IOException- Specified by:
extractin interfacecom.qwazr.extractor.ParserInterface- Throws:
java.io.IOException
-
getName
public java.lang.String getName()- Specified by:
getNamein interfacecom.qwazr.extractor.ParserFactory
-
createParser
public com.qwazr.extractor.ParserInterface createParser()- Specified by:
createParserin interfacecom.qwazr.extractor.ParserFactory
-
getSupportedFileExtensions
public java.util.Collection<java.lang.String> getSupportedFileExtensions()- Specified by:
getSupportedFileExtensionsin interfacecom.qwazr.extractor.ParserFactory
-
getSupportedMimeTypes
public java.util.Collection<javax.ws.rs.core.MediaType> getSupportedMimeTypes()- Specified by:
getSupportedMimeTypesin interfacecom.qwazr.extractor.ParserFactory
-