org.allenai.nlpstack.parse.poly.polyparser

GoldParseTrainingVectorSource

Related Doc: package polyparser

case class GoldParseTrainingVectorSource(goldParses: PolytreeParseSource, transitionSystemFactory: TransitionSystemFactory, baseCostFunctionFactory: Option[StateCostFunctionFactory] = None) extends FSMTrainingVectorSource with Product with Serializable

A GoldParseTrainingVectorSource reduces a gold parse tree to a set of feature vectors for classifier training.

Essentially, we derive the 2*n parser states that lead to the gold parse. Each of these states becomes a feature vector (using the apply method of the provided TransitionParserFeature), labeled with the transition executed from that state in the gold parse.

One of the constructor arguments is a TaskIdentifer. This will dispatch the feature vectors to train different classifiers. For instance, if taskIdentifier(state) != taskIdentifier(state2), then their respective feature vectors (i.e. feature(state) and feature(state2)) will be used to train different classifiers.

goldParses

the data source for the parse trees

transitionSystemFactory

the transition system factory to use (for generating states)

baseCostFunctionFactory

a trained cost function factory to adapt (optional)

Linear Supertypes
Serializable, Serializable, Product, Equals, FSMTrainingVectorSource, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. GoldParseTrainingVectorSource
  2. Serializable
  3. Serializable
  4. Product
  5. Equals
  6. FSMTrainingVectorSource
  7. AnyRef
  8. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new GoldParseTrainingVectorSource(goldParses: PolytreeParseSource, transitionSystemFactory: TransitionSystemFactory, baseCostFunctionFactory: Option[StateCostFunctionFactory] = None)

    goldParses

    the data source for the parse trees

    transitionSystemFactory

    the transition system factory to use (for generating states)

    baseCostFunctionFactory

    a trained cost function factory to adapt (optional)

Value Members

  1. final def !=(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  5. val baseCostFunctionFactory: Option[StateCostFunctionFactory]

    a trained cost function factory to adapt (optional)

  6. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  8. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  9. def generateVectors(sculpture: Sculpture): List[FSMTrainingVector]

    This generates a list of labeled feature vectors from a gold parse tree (for training).

    This generates a list of labeled feature vectors from a gold parse tree (for training). The gold parse tree is reduced to its representation as a list of 2*n transitions, then a TrainingVector is produced for each transition (in order).

    Note that this function is implemented using tail-recursion.

    sculpture

    the sculpture to generate feature vectors from

    returns

    a list of training vectors

    Attributes
    protected
    Definition Classes
    FSMTrainingVectorSource
  10. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  11. def getVectorIterator: Iterator[FSMTrainingVector]

  12. val goldParses: PolytreeParseSource

    the data source for the parse trees

  13. def groupVectorIteratorsByTask: Iterator[(ClassificationTask, Iterator[FSMTrainingVector])]

    Definition Classes
    FSMTrainingVectorSource
  14. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  15. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  16. final def notify(): Unit

    Definition Classes
    AnyRef
  17. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  18. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  19. lazy val tasks: Set[ClassificationTask]

    Returns the set of tasks associated with the training vectors in this source.

    Returns the set of tasks associated with the training vectors in this source.

    In a perhaps over-careful attempt to avoid having all the non-uniqued tasks being stored in memory simultaneously, this was originally implemented as:

    format: OFF lazy val tasks: Iterable[ClassificationTask] = taskHelper(Set(), getVectorIterator) tailrec private def taskHelper( resultSoFar: Set[ClassificationTask], vectorIter: Iterator[FSMTrainingVector] ): Set[ClassificationTask] = {

    if (!vectorIter.hasNext) { resultSoFar } else { taskHelper(resultSoFar + vectorIter.next().task, vectorIter) } } format: ON

    Definition Classes
    FSMTrainingVectorSource
  20. val transitionSystemFactory: TransitionSystemFactory

    the transition system factory to use (for generating states)

  21. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  22. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  23. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Serializable

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from FSMTrainingVectorSource

Inherited from AnyRef

Inherited from Any

Ungrouped