Packages

c

org.apache.spark.sql.rapids.execution

HashJoinIterator

class HashJoinIterator extends BaseHashJoinIterator

An iterator that does a hash join against a stream of batches.

Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. HashJoinIterator
  2. BaseHashJoinIterator
  3. SplittableJoinIterator
  4. Logging
  5. AbstractGpuJoinIterator
  6. TaskAutoCloseableResource
  7. AutoCloseable
  8. Iterator
  9. TraversableOnce
  10. GenTraversableOnce
  11. AnyRef
  12. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new HashJoinIterator(built: LazySpillableColumnarBatch, boundBuiltKeys: Seq[GpuExpression], buildStatsOpt: Option[JoinBuildSideStats], stream: Iterator[LazySpillableColumnarBatch], boundStreamKeys: Seq[GpuExpression], streamAttributes: Seq[Attribute], targetSize: Long, joinType: JoinType, buildSide: GpuBuildSide, compareNullsEqual: Boolean, opTime: GpuMetric, joinTime: GpuMetric)

Type Members

  1. class GroupedIterator[B >: A] extends AbstractIterator[Seq[B]] with Iterator[Seq[B]]
    Definition Classes
    Iterator

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. def ++[B >: ColumnarBatch](that: ⇒ GenTraversableOnce[B]): Iterator[B]
    Definition Classes
    Iterator
  4. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  5. def addString(b: StringBuilder): StringBuilder
    Definition Classes
    TraversableOnce
  6. def addString(b: StringBuilder, sep: String): StringBuilder
    Definition Classes
    TraversableOnce
  7. def addString(b: StringBuilder, start: String, sep: String, end: String): StringBuilder
    Definition Classes
    TraversableOnce
  8. def aggregate[B](z: ⇒ B)(seqop: (B, ColumnarBatch) ⇒ B, combop: (B, B) ⇒ B): B
    Definition Classes
    TraversableOnce → GenTraversableOnce
  9. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  10. val boundBuiltKeys: Seq[GpuExpression]
  11. val boundStreamKeys: Seq[GpuExpression]
  12. def buffered: BufferedIterator[ColumnarBatch]
    Definition Classes
    Iterator
  13. val buildSide: GpuBuildSide
  14. lazy val buildStats: JoinBuildSideStats
    Attributes
    protected
    Definition Classes
    BaseHashJoinIterator
  15. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  16. def close(): Unit
    Definition Classes
    SplittableJoinIteratorAbstractGpuJoinIteratorTaskAutoCloseableResource → AutoCloseable
  17. val closed: Boolean
    Attributes
    protected
    Definition Classes
    TaskAutoCloseableResource
  18. def collect[B](pf: PartialFunction[ColumnarBatch, B]): Iterator[B]
    Definition Classes
    Iterator
    Annotations
    @migration
    Migration

    (Changed in version 2.8.0) collect has changed. The previous behavior can be reproduced with toSeq.

  19. def collectFirst[B](pf: PartialFunction[ColumnarBatch, B]): Option[B]
    Definition Classes
    TraversableOnce
  20. val compareNullsEqual: Boolean
  21. def computeNumJoinRows(cb: LazySpillableColumnarBatch): Long
  22. def contains(elem: Any): Boolean
    Definition Classes
    Iterator
  23. def copyToArray[B >: ColumnarBatch](xs: Array[B], start: Int, len: Int): Unit
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  24. def copyToArray[B >: ColumnarBatch](xs: Array[B]): Unit
    Definition Classes
    TraversableOnce → GenTraversableOnce
  25. def copyToArray[B >: ColumnarBatch](xs: Array[B], start: Int): Unit
    Definition Classes
    TraversableOnce → GenTraversableOnce
  26. def copyToBuffer[B >: ColumnarBatch](dest: Buffer[B]): Unit
    Definition Classes
    TraversableOnce
  27. def corresponds[B](that: GenTraversableOnce[B])(p: (ColumnarBatch, B) ⇒ Boolean): Boolean
    Definition Classes
    Iterator
  28. def count(p: (ColumnarBatch) ⇒ Boolean): Int
    Definition Classes
    TraversableOnce → GenTraversableOnce
  29. def createGatherer(cb: LazySpillableColumnarBatch, numJoinRows: Option[Long]): Option[JoinGatherer]

    Create a join gatherer.

    Create a join gatherer.

    cb

    next column batch from the streaming side of the join

    numJoinRows

    if present, the number of join output rows computed for this batch

    returns

    some gatherer to use next or None if there is no next gatherer or the loop should try to build the gatherer again (e.g.: to skip a degenerate join result batch)

    Definition Classes
    BaseHashJoinIteratorSplittableJoinIterator
  30. def drop(n: Int): Iterator[ColumnarBatch]
    Definition Classes
    Iterator
  31. def dropWhile(p: (ColumnarBatch) ⇒ Boolean): Iterator[ColumnarBatch]
    Definition Classes
    Iterator
  32. def duplicate: (Iterator[ColumnarBatch], Iterator[ColumnarBatch])
    Definition Classes
    Iterator
  33. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  34. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  35. def exists(p: (ColumnarBatch) ⇒ Boolean): Boolean
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  36. def filter(p: (ColumnarBatch) ⇒ Boolean): Iterator[ColumnarBatch]
    Definition Classes
    Iterator
  37. def filterNot(p: (ColumnarBatch) ⇒ Boolean): Iterator[ColumnarBatch]
    Definition Classes
    Iterator
  38. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  39. def find(p: (ColumnarBatch) ⇒ Boolean): Option[ColumnarBatch]
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  40. def flatMap[B](f: (ColumnarBatch) ⇒ GenTraversableOnce[B]): Iterator[B]
    Definition Classes
    Iterator
  41. def fold[A1 >: ColumnarBatch](z: A1)(op: (A1, A1) ⇒ A1): A1
    Definition Classes
    TraversableOnce → GenTraversableOnce
  42. def foldLeft[B](z: B)(op: (B, ColumnarBatch) ⇒ B): B
    Definition Classes
    TraversableOnce → GenTraversableOnce
  43. def foldRight[B](z: B)(op: (ColumnarBatch, B) ⇒ B): B
    Definition Classes
    TraversableOnce → GenTraversableOnce
  44. def forall(p: (ColumnarBatch) ⇒ Boolean): Boolean
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  45. def foreach[U](f: (ColumnarBatch) ⇒ U): Unit
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  46. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  47. def grouped[B >: ColumnarBatch](size: Int): GroupedIterator[B]
    Definition Classes
    Iterator
  48. def hasDefiniteSize: Boolean
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  49. def hasNext: Boolean
    Definition Classes
    AbstractGpuJoinIterator → Iterator
  50. def hasNextStreamBatch: Boolean

    Returns whether there are any more batches on the stream side of the join

    Returns whether there are any more batches on the stream side of the join

    Definition Classes
    SplittableJoinIteratorAbstractGpuJoinIterator
  51. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  52. def indexOf[B >: ColumnarBatch](elem: B, from: Int): Int
    Definition Classes
    Iterator
  53. def indexOf[B >: ColumnarBatch](elem: B): Int
    Definition Classes
    Iterator
  54. def indexWhere(p: (ColumnarBatch) ⇒ Boolean, from: Int): Int
    Definition Classes
    Iterator
  55. def indexWhere(p: (ColumnarBatch) ⇒ Boolean): Int
    Definition Classes
    Iterator
  56. def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean
    Attributes
    protected
    Definition Classes
    Logging
  57. def initializeLogIfNecessary(isInterpreter: Boolean): Unit
    Attributes
    protected
    Definition Classes
    Logging
  58. def isEmpty: Boolean
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  59. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  60. def isTraceEnabled(): Boolean
    Attributes
    protected
    Definition Classes
    Logging
  61. def isTraversableAgain: Boolean
    Definition Classes
    Iterator → GenTraversableOnce
  62. def joinGathererLeftRight(leftKeys: Table, leftData: LazySpillableColumnarBatch, rightKeys: Table, rightData: LazySpillableColumnarBatch): Option[JoinGatherer]

    Perform a hash join, returning a gatherer if there is a join result.

    Perform a hash join, returning a gatherer if there is a join result.

    leftKeys

    table of join keys from the left table

    leftData

    batch containing the full data from the left table

    rightKeys

    table of join keys from the right table

    rightData

    batch containing the full data from the right table

    returns

    join gatherer if there are join results

    Attributes
    protected
    Definition Classes
    HashJoinIteratorBaseHashJoinIterator
  63. val joinType: JoinType
  64. def length: Int
    Definition Classes
    Iterator
  65. def log: Logger
    Attributes
    protected
    Definition Classes
    Logging
  66. def logDebug(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  67. def logDebug(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  68. def logError(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  69. def logError(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  70. def logInfo(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  71. def logInfo(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  72. def logName: String
    Attributes
    protected
    Definition Classes
    Logging
  73. def logTrace(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  74. def logTrace(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  75. def logWarning(msg: ⇒ String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  76. def logWarning(msg: ⇒ String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  77. def makeGatherer(maps: Array[GatherMap], leftData: LazySpillableColumnarBatch, rightData: LazySpillableColumnarBatch, joinType: JoinType): Option[JoinGatherer]

    Create a join gatherer from gather maps.

    Create a join gatherer from gather maps.

    maps

    gather maps produced from a cudf join

    leftData

    batch corresponding to the left table in the join

    rightData

    batch corresponding to the right table in the join

    returns

    some gatherer or None if the are no rows to gather in this join batch

    Attributes
    protected
    Definition Classes
    SplittableJoinIterator
  78. def map[B](f: (ColumnarBatch) ⇒ B): Iterator[B]
    Definition Classes
    Iterator
  79. def max[B >: ColumnarBatch](implicit cmp: Ordering[B]): ColumnarBatch
    Definition Classes
    TraversableOnce → GenTraversableOnce
  80. def maxBy[B](f: (ColumnarBatch) ⇒ B)(implicit cmp: Ordering[B]): ColumnarBatch
    Definition Classes
    TraversableOnce → GenTraversableOnce
  81. def min[B >: ColumnarBatch](implicit cmp: Ordering[B]): ColumnarBatch
    Definition Classes
    TraversableOnce → GenTraversableOnce
  82. def minBy[B](f: (ColumnarBatch) ⇒ B)(implicit cmp: Ordering[B]): ColumnarBatch
    Definition Classes
    TraversableOnce → GenTraversableOnce
  83. def mkString: String
    Definition Classes
    TraversableOnce → GenTraversableOnce
  84. def mkString(sep: String): String
    Definition Classes
    TraversableOnce → GenTraversableOnce
  85. def mkString(start: String, sep: String, end: String): String
    Definition Classes
    TraversableOnce → GenTraversableOnce
  86. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  87. def next(): ColumnarBatch
    Definition Classes
    AbstractGpuJoinIterator → Iterator
  88. def nonEmpty: Boolean
    Definition Classes
    TraversableOnce → GenTraversableOnce
  89. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  90. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  91. val opTime: GpuMetric
    Definition Classes
    AbstractGpuJoinIterator
  92. def padTo[A1 >: ColumnarBatch](len: Int, elem: A1): Iterator[A1]
    Definition Classes
    Iterator
  93. def partition(p: (ColumnarBatch) ⇒ Boolean): (Iterator[ColumnarBatch], Iterator[ColumnarBatch])
    Definition Classes
    Iterator
  94. def patch[B >: ColumnarBatch](from: Int, patchElems: Iterator[B], replaced: Int): Iterator[B]
    Definition Classes
    Iterator
  95. def product[B >: ColumnarBatch](implicit num: Numeric[B]): B
    Definition Classes
    TraversableOnce → GenTraversableOnce
  96. def reduce[A1 >: ColumnarBatch](op: (A1, A1) ⇒ A1): A1
    Definition Classes
    TraversableOnce → GenTraversableOnce
  97. def reduceLeft[B >: ColumnarBatch](op: (B, ColumnarBatch) ⇒ B): B
    Definition Classes
    TraversableOnce
  98. def reduceLeftOption[B >: ColumnarBatch](op: (B, ColumnarBatch) ⇒ B): Option[B]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  99. def reduceOption[A1 >: ColumnarBatch](op: (A1, A1) ⇒ A1): Option[A1]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  100. def reduceRight[B >: ColumnarBatch](op: (ColumnarBatch, B) ⇒ B): B
    Definition Classes
    TraversableOnce → GenTraversableOnce
  101. def reduceRightOption[B >: ColumnarBatch](op: (ColumnarBatch, B) ⇒ B): Option[B]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  102. def reversed: List[ColumnarBatch]
    Attributes
    protected[this]
    Definition Classes
    TraversableOnce
  103. def sameElements(that: Iterator[_]): Boolean
    Definition Classes
    Iterator
  104. def scanLeft[B](z: B)(op: (B, ColumnarBatch) ⇒ B): Iterator[B]
    Definition Classes
    Iterator
  105. def scanRight[B](z: B)(op: (ColumnarBatch, B) ⇒ B): Iterator[B]
    Definition Classes
    Iterator
  106. def seq: Iterator[ColumnarBatch]
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  107. def setupNextGatherer(): Option[JoinGatherer]

    Called to setup the next join gatherer instance when the previous instance is done or there is no previous instance.

    Called to setup the next join gatherer instance when the previous instance is done or there is no previous instance. Because this is likely to call next or has next on the stream side all implementations must track their own opTime metrics.

    returns

    some gatherer to use next or None if there is no next gatherer or the loop should try to build the gatherer again (e.g.: to skip a degenerate join result batch)

    Definition Classes
    SplittableJoinIteratorAbstractGpuJoinIterator
  108. val shouldAutoCloseOnExhaust: Boolean

    Whether to automatically call close() on this iterator when it is exhausted.

    Whether to automatically call close() on this iterator when it is exhausted.

    Attributes
    protected
    Definition Classes
    AbstractGpuJoinIterator
  109. def size: Int
    Definition Classes
    TraversableOnce → GenTraversableOnce
  110. def sizeHintIfCheap: Int
    Attributes
    protected[collection]
    Definition Classes
    GenTraversableOnce
  111. def slice(from: Int, until: Int): Iterator[ColumnarBatch]
    Definition Classes
    Iterator
  112. def sliceIterator(from: Int, until: Int): Iterator[ColumnarBatch]
    Attributes
    protected
    Definition Classes
    Iterator
  113. def sliding[B >: ColumnarBatch](size: Int, step: Int): GroupedIterator[B]
    Definition Classes
    Iterator
  114. def span(p: (ColumnarBatch) ⇒ Boolean): (Iterator[ColumnarBatch], Iterator[ColumnarBatch])
    Definition Classes
    Iterator
  115. def splitAndSave(cb: ColumnarBatch, numBatches: Int, oom: Option[Throwable] = None): Unit

    Split a stream-side input batch, making all splits spillable, and replacing this batch with the splits in the stream-side input

    Split a stream-side input batch, making all splits spillable, and replacing this batch with the splits in the stream-side input

    cb

    stream-side input batch to split

    numBatches

    number of splits to produce with approximately the same number of rows each

    oom

    a prior OOM exception that this will try to recover from by splitting

    Attributes
    protected
    Definition Classes
    SplittableJoinIterator
  116. val streamAttributes: Seq[Attribute]
  117. def sum[B >: ColumnarBatch](implicit num: Numeric[B]): B
    Definition Classes
    TraversableOnce → GenTraversableOnce
  118. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  119. def take(n: Int): Iterator[ColumnarBatch]
    Definition Classes
    Iterator
  120. def takeWhile(p: (ColumnarBatch) ⇒ Boolean): Iterator[ColumnarBatch]
    Definition Classes
    Iterator
  121. val targetSize: Long
  122. def to[Col[_]](implicit cbf: CanBuildFrom[Nothing, ColumnarBatch, Col[ColumnarBatch]]): Col[ColumnarBatch]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  123. def toArray[B >: ColumnarBatch](implicit arg0: ClassTag[B]): Array[B]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  124. def toBuffer[B >: ColumnarBatch]: Buffer[B]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  125. def toIndexedSeq: IndexedSeq[ColumnarBatch]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  126. def toIterable: Iterable[ColumnarBatch]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  127. def toIterator: Iterator[ColumnarBatch]
    Definition Classes
    Iterator → GenTraversableOnce
  128. def toList: List[ColumnarBatch]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  129. def toMap[T, U](implicit ev: <:<[ColumnarBatch, (T, U)]): Map[T, U]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  130. def toSeq: Seq[ColumnarBatch]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  131. def toSet[B >: ColumnarBatch]: Set[B]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  132. def toStream: Stream[ColumnarBatch]
    Definition Classes
    Iterator → GenTraversableOnce
  133. def toString(): String
    Definition Classes
    Iterator → AnyRef → Any
  134. def toTraversable: Traversable[ColumnarBatch]
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  135. def toVector: Vector[ColumnarBatch]
    Definition Classes
    TraversableOnce → GenTraversableOnce
  136. def use[T <: AutoCloseable](ac: T): T
    Definition Classes
    TaskAutoCloseableResource
  137. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  138. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  139. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  140. def withFilter(p: (ColumnarBatch) ⇒ Boolean): Iterator[ColumnarBatch]
    Definition Classes
    Iterator
  141. def zip[B](that: Iterator[B]): Iterator[(ColumnarBatch, B)]
    Definition Classes
    Iterator
  142. def zipAll[B, A1 >: ColumnarBatch, B1 >: B](that: Iterator[B], thisElem: A1, thatElem: B1): Iterator[(A1, B1)]
    Definition Classes
    Iterator
  143. def zipWithIndex: Iterator[(ColumnarBatch, Int)]
    Definition Classes
    Iterator

Deprecated Value Members

  1. def /:[B](z: B)(op: (B, ColumnarBatch) ⇒ B): B
    Definition Classes
    TraversableOnce → GenTraversableOnce
    Annotations
    @deprecated
    Deprecated

    (Since version 2.12.10) Use foldLeft instead of /:

  2. def :\[B](z: B)(op: (ColumnarBatch, B) ⇒ B): B
    Definition Classes
    TraversableOnce → GenTraversableOnce
    Annotations
    @deprecated
    Deprecated

    (Since version 2.12.10) Use foldRight instead of :\

Inherited from BaseHashJoinIterator

Inherited from SplittableJoinIterator

Inherited from Logging

Inherited from AbstractGpuJoinIterator

Inherited from TaskAutoCloseableResource

Inherited from AutoCloseable

Inherited from Iterator[ColumnarBatch]

Inherited from TraversableOnce[ColumnarBatch]

Inherited from GenTraversableOnce[ColumnarBatch]

Inherited from AnyRef

Inherited from Any

Ungrouped