package files
- Alphabetic
- Public
- Protected
Type Members
- class CdcAddFileIndex extends TahoeBatchFileIndex
A TahoeFileIndex for scanning a sequence of added files as CDC.
A TahoeFileIndex for scanning a sequence of added files as CDC. Similar to TahoeBatchFileIndex, with a bit of special handling to attach the log version and CDC type on a per-file basis.
- class DelayedCommitProtocol extends FileCommitProtocol with Serializable with Logging
Writes out the files to
pathand returns a list of them inaddedStatuses.Writes out the files to
pathand returns a list of them inaddedStatuses. Includes special handling for partitioning on CDC_PARTITION_COL for compatibility between enabled and disabled CDC; partitions with a value of false in this column produce no corresponding partitioning directory. - class DeltaSourceSnapshot extends SnapshotIterator with StateCache
Converts a
Snapshotinto the initial set of files read when starting a new streaming query.Converts a
Snapshotinto the initial set of files read when starting a new streaming query. The list of files that represent the table at the time the query starts are selected by: - Addingversionandindexto each file to enable splitting of the initial state into multiple batches. - Filtering files that don't match partition predicates, while preserving the aforementioned indexing. - case class PinnedTahoeFileIndex(spark: SparkSession, deltaLog: DeltaLog, path: Path, snapshot: Snapshot) extends TahoeFileIndex with Product with Serializable
A TahoeFileIndex that generates the list of files from the given Snapshot.
- trait SQLMetricsReporting extends AnyRef
This trait is used to register SQL metrics for a Delta Operation.
This trait is used to register SQL metrics for a Delta Operation. Registering will allow the metrics to be instrumented via the CommitInfo and is accessible via DescribeHistory
- trait SnapshotIterator extends AnyRef
- class TahoeBatchFileIndex extends TahoeFileIndex
A TahoeFileIndex that generates the list of files from a given list of files that are within a version range of DeltaLog.
- class TahoeChangeFileIndex extends TahoeFileIndex
A TahoeFileIndex for scanning a sequence of CDC files.
A TahoeFileIndex for scanning a sequence of CDC files. Similar to TahoeBatchFileIndex, the equivalent for reading AddFile actions.
- abstract class TahoeFileIndex extends FileIndex
A FileIndex that generates the list of files managed by the Tahoe protocol.
- case class TahoeLogFileIndex(spark: SparkSession, deltaLog: DeltaLog, path: Path, snapshotAtAnalysis: Snapshot, partitionFilters: Seq[Expression] = Nil, isTimeTravelQuery: Boolean = false) extends TahoeFileIndex with Product with Serializable
A TahoeFileIndex that generates the list of files from DeltaLog with given partition filters.
- class TahoeRemoveFileIndex extends TahoeFileIndex
A TahoeFileIndex for scanning a sequence of removed files as CDC.
A TahoeFileIndex for scanning a sequence of removed files as CDC. Similar to TahoeBatchFileIndex, the equivalent for reading AddFile actions.
- trait TransactionalWrite extends DeltaLogging
Adds the ability to write files out as part of a transaction.
Adds the ability to write files out as part of a transaction. Checks are performed to ensure that the data being written matches either the current metadata or the new metadata being set by this transaction.
Value Members
- object TahoeLogFileIndex extends Serializable