trait JoinGatherer extends LazySpillable
Generic trait for all join gather instances. A JoinGatherer takes the gather maps that are the result of a cudf join call along with the data batches that need to be gathered and allow someone to materialize the join in batches. It also provides APIs to help decide on how many rows to gather.
This is a LazySpillable instance so the life cycle follows that too.
- Alphabetic
- By Inheritance
- JoinGatherer
- LazySpillable
- Retryable
- AutoCloseable
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Abstract Value Members
-
abstract
def
allowSpilling(): Unit
Indicate that we are done using the data for now and it can be spilled.
Indicate that we are done using the data for now and it can be spilled.
This method should not have issues with being called multiple times without the data being accessed.
- Definition Classes
- LazySpillable
-
abstract
def
checkpoint(): Unit
- Definition Classes
- Retryable
-
abstract
def
close(): Unit
- Definition Classes
- AutoCloseable
- Annotations
- @throws( classOf[java.lang.Exception] )
-
abstract
def
gatherNext(n: Int): ColumnarBatch
Gather the next n rows from the join gather maps.
Gather the next n rows from the join gather maps.
- n
how many rows to gather
- returns
the gathered data as a ColumnarBatch
-
abstract
def
getBitSizeMap(n: Int): ColumnView
Get the bit count size map for the next n rows to be gathered.
Get the bit count size map for the next n rows to be gathered. It returns a column of INT64 values. One for each of the next n rows requested. This is a bit count to deal with validity bits, etc. This is an INT64 to allow a prefix sum (running total) to be done on it without overflowing so we can compute an accurate cuttoff point for a batch size limit.
-
abstract
def
getFixedWidthBitSize: Option[Int]
If the data is all fixed width return the size of each row, otherwise return None.
-
abstract
def
isDone: Boolean
Is all of the data gathered so far.
-
abstract
def
numRowsLeft: Long
Number of rows left to gather
-
abstract
def
realCheapPerRowSizeEstimate: Double
A really fast and dirty way to estimate the size of each row in the join output measured as in bytes.
-
abstract
def
restore(): Unit
- Definition Classes
- Retryable
Concrete Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
equals(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] )
-
def
gatherRowEstimate(targetSize: Long): Int
Do a complete/expensive job to get the number of rows that can be gathered to get close to the targetSize for the final output.
Do a complete/expensive job to get the number of rows that can be gathered to get close to the targetSize for the final output.
- targetSize
The target size in bytes for the final output batch.
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
toString(): String
- Definition Classes
- AnyRef → Any
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()