schema
Syntax
schema(table|dbHandle)
Arguments
table | dbHandle can be a table object or a database handle.
Details
Display information about the schema of a table or a database. The function returns an unordered dictionary containing the following information (in alphabetical order):
- atomic: the level at which the atomicity is guaranteed for a write transaction. It can be TRANS or CHUNK.
- chunkGranularity: the chunk granularity which determines whether to allow concurrent writes to different tables in the same chunk. It can be TABLE or DATABASE.
- clusterReplicationEnabled: whether asynchronous replication has been enabled.
- colDefs: information about each column in the table:
- name: column name
- typeString: column type
- typeInt: the type ID
- extra: the scale of a DECIMAL value
- comment: comment on the column
- compressMethods: the compression methods used for specified columns of a DFS
table.
- name: Column names
- compressMethods: Compression algorithm applied, including "lz4", "delta", or "zstd".
- databaseDir: the directory where the database is stored.
- databaseOwner: the database creator.
- engineType: the storage engine type. It can be OLAP or TSDB.
- keepDuplicates: how to deal with records with duplicate sort key values.
- partitionColumnIndex: the index that indicates the positions of partitioning columns. It returns -1 for a dimension table.
- partitionColumnName: the partitioning column names.
- partitionColumnType: the data type ID (which can be checked at Data Types and Data Forms) of the partitioning column.
- partitionSchema: how the partitions are organized
- partitionSites (optional): If the parameter locations is specified for function
database
, it displays the ip:port information of the specified node. - partitionTypeName / partitionType: the partitioning scheme and the corresponding ID, including VALUE(1), RANGE(2), LIST(3), COMPO(4), HASH(5).
- sortColumns: the sort columns for a table.
- softDelete: whether soft deletion of the table has been enabled. This field will only be returned for tables in TSDB databases. Return true if soft deletion has been enabled, false otherwise.
- sortKeyMappingFunction: the mapping functions applied to sort keys.
- tableComment: the comment for DFS tables.
- tableOwner: the table creator.
- partitionFunction: the function applied to a column for data
partitioning. It is a STRING vector with each element as a function signature.
If an element is "asis", meaning no function is applied to that column. For
example,
partitionFunction->["myPartitionFunc{, 6, 8}","asis"]
.
Examples
OLAP database:
n=1000000
ID=rand(10, n)
x=rand(1.0, n)
t=table(ID, x)
db=database("dfs://rangedb101", RANGE, 0 5 10)
pt = db.createPartitionedTable(t, `pt, `ID)
pt.append!(t)
pt=loadTable(db,`pt);
schema(db);
/* output:
databaseDir->dfs://rangedb101
partitionSchema->[0,5,10]
partitionSites->
atomic->TRANS
chunkGranularity->TABLE
partitionType->2
partitionTypeName->RANGE
partitionColumnType->4
clusterReplicationEnabled->1
databaseOwner->admin
*/
OLAP table:
schema(pt);
/* output:
chunkGranularity->TABLE
tableOwner->admin
compressMethods->
name compressMethods
---- ---------------
ID lz4
x lz4
colDefs->
name typeString typeInt comment
---- ---------- ------- -------
ID INT 4
x DOUBLE 16
chunkPath->
partitionColumnIndex->0
partitionColumnName->ID
partitionColumnType->4
partitionType->2
partitionTypeName->RANGE
partitionSchema->[0,5,10]
partitionSites->
*/
TSDB database:
n = 10000
SecurityID = rand(`st0001`st0002`st0003`st0004`st0005, n)
sym = rand(`A`B, n)
TradeDate = 2022.01.01 + rand(100,n)
TotalVolumeTrade = rand(1000..3000, n)
TotalValueTrade = rand(100.0, n)
schemaTable_snap = table(SecurityID, TradeDate, TotalVolumeTrade, TotalValueTrade).sortBy!(`SecurityID`TradeDate)
dbPath = "dfs://TSDB_STOCK"
if (existsDatabase(dbPath)){dropDatabase(dbPath)}
db_snap = database(dbPath, VALUE, 2022.01.01..2022.01.05, engine='TSDB')
schema(db_snap)
/* output:
databaseDir->dfs://TSDB_STOCK
partitionSchema->[2022.01.01,2022.01.02,2022.01.03,2022.01.04,2022.01.05]
partitionSites->
engineType->TSDB
atomic->TRANS
chunkGranularity->TABLE
partitionType->1
partitionTypeName->VALUE
partitionColumnType->6
clusterReplicationEnabled->1
databaseOwner->admin
*/
TSDB table:
def myHashFunc(x){
return hashBucket(x, 10)
}
snap=createPartitionedTable(dbHandle=db_snap, table=schemaTable_snap, tableName="snap", partitionColumns=`TradeDate, sortColumns=`SecurityID`TradeDate, keepDuplicates=ALL, sortKeyMappingFunction=[myHashFunc])
schema(snap)
/* output:
engineType->TSDB
keepDuplicates->ALL
chunkGranularity->TABLE
sortColumns->["SecurityID","TradeDate"]
sortKeyMappingFunction->["def myHashFunc(x){
return hashBucket(x, 10)
}"]
softDelete->0
tableOwner->admin
compressMethods->
name compressMethods
---------------- ---------------
SecurityID lz4
TradeDate lz4
TotalVolumeTrade lz4
TotalValueTrade lz4
colDefs->
name typeString typeInt extra comment
---------------- ---------- ------- ----- -------
SecurityID SYMBOL 17
TradeDate DATE 6
TotalVolumeTrade INT 4
TotalValueTrade DOUBLE 16
chunkPath->
partitionColumnIndex->1
partitionColumnName->TradeDate
partitionColumnType->6
partitionType->1
partitionTypeName->VALUE
partitionSchema->[2022.01.01,2022.01.02,2022.01.03,2022.01.04,2022.01.05]
partitionSites->
*/