Creates a table in the physical storage layer.
Creates a table in the physical storage layer.
table metadata
In one spark job scans all of the specified tables and infers stats about each region of the listed tables.
In one spark job scans all of the specified tables and infers stats about each region of the listed tables.
if true, than hot regions will be included in the scan. By default is true. False is useful when reading production data from dev environments, as compactions will be happening in out of office hours, this helps to avoid reading data in an inconsistent state.
Generic function that generates sequential region ids that are padded on the left with zeros up to 20 chars.
Reads the state of the multiple Audit Tables.
Reads the state of the multiple Audit Tables. It will scan the state of regions of all specified tables in one go.
object that actually interacts with the physical storage
parent folder which contains folders with table names
list of tables to open
include hot regions in the table
function that generates region ids
(Map[TABLE NAME, AuditTable], Seq[MISSING TABLES]) - audit table objects that exist and of table names that were not found under the basePath
Creates a copy of the table with new list of regions.
Creates a copy of the table with new list of regions.
- Audit table with old regions
- Complete set of current regions
- Optional list of regions that have been appended. If given, only new regions are written to cache. If None the current cached region information is completely rewritten. If the case of any region deletes, this should be None.