Package 

Class MetadataIndexer

  • All Implemented Interfaces:
    ai.platon.pulsar.common.config.KConfigurable , ai.platon.pulsar.common.config.Parameterized , ai.platon.pulsar.crawl.common.JobInitialized , ai.platon.pulsar.crawl.index.IndexingFilter

    
    public final class MetadataIndexer
     implements IndexingFilter
                        

    Indexer which can be configured to extract metadata from the crawldb, parse metadata or content metadata. You can specify the properties "index.db", "index.parse" or "index.content" who's values are comma-delimited <value>key1,key2,key3</value>.

    • Field Summary

      Fields 
      Modifier and Type Field Description
      private ImmutableConfig conf
    • Constructor Summary

      Constructors 
      Constructor Description
      MetadataIndexer(ImmutableConfig conf)
    • Method Summary

      Modifier and Type Method Description
      ImmutableConfig getConf()
      Unit setConf(ImmutableConfig conf)
      Unit setup(ImmutableConfig conf)
      Params getParams()
      IndexDocument filter(IndexDocument doc, String url, WebPage page)
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • MetadataIndexer

        MetadataIndexer(ImmutableConfig conf)