schema

Syntax

schema(table|dbHandle)

Arguments

table | dbHandle can be a table object or a database handle.

Details

Display information about the schema of a table or a database. The function returns an unordered dictionary containing the following information (in alphabetical order):

  • atomic: the level at which the atomicity is guaranteed for a write transaction. It can be TRANS or CHUNK.
  • chunkGranularity: the chunk granularity which determines whether to allow concurrent writes to different tables in the same chunk. It can be TABLE or DATABASE.
  • clusterReplicationEnabled: whether asynchronous replication has been enabled.
  • colDefs: information about each column in the table:
    • name: column name
    • typeString: column type
    • typeInt: the type ID
    • extra: the scale of a DECIMAL value
    • comment: comment on the column
  • compressMethods: the compression methods used for specified columns of a DFS table.
    • name: Column names
    • compressMethods: Compression algorithm applied, including "lz4", "delta", or "zstd".
  • databaseDir: the directory where the database is stored.
  • databaseOwner: the database creator.
  • engineType: the storage engine type. It can be OLAP or TSDB.
  • keepDuplicates: how to deal with records with duplicate sort key values.
  • partitionColumnIndex: the index that indicates the positions of partitioning columns. It returns -1 for a dimension table.
  • partitionColumnName: the partitioning column names.
  • partitionColumnType: the data type ID (which can be checked at Data Types and Data Forms) of the partitioning column.
  • partitionSchema: how the partitions are organized
  • partitionSites (optional): If the parameter locations is specified for function database, it displays the ip:port information of the specified node.
  • partitionTypeName / partitionType: the partitioning scheme and the corresponding ID, including VALUE(1), RANGE(2), LIST(3), COMPO(4), HASH(5).
  • sortColumns: the sort columns for a table.
  • softDelete: whether soft deletion of the table has been enabled. This field will only be returned for tables in TSDB databases. Return true if soft deletion has been enabled, false otherwise.
  • sortKeyMappingFunction: the mapping functions applied to sort keys.
  • tableComment: the comment for DFS tables.
  • tableOwner: the table creator.
  • partitionFunction: the function applied to a column for data partitioning. It is a STRING vector with each element as a function signature. If an element is "asis", meaning no function is applied to that column. For example, partitionFunction->["myPartitionFunc{, 6, 8}","asis"].

Examples

OLAP database:

n=1000000
ID=rand(10, n)
x=rand(1.0, n)
t=table(ID, x)
db=database("dfs://rangedb101", RANGE, 0 5 10)
pt = db.createPartitionedTable(t, `pt, `ID)
pt.append!(t)
pt=loadTable(db,`pt);

schema(db);
/* output:
databaseDir->dfs://rangedb101
partitionSchema->[0,5,10]
partitionSites->
atomic->TRANS
chunkGranularity->TABLE
partitionType->2
partitionTypeName->RANGE
partitionColumnType->4
clusterReplicationEnabled->1
databaseOwner->admin
*/

OLAP table:

schema(pt);
/* output:
chunkGranularity->TABLE
tableOwner->admin
compressMethods->
name compressMethods
---- ---------------
ID   lz4            
x    lz4            

colDefs->
name typeString typeInt comment
---- ---------- ------- -------
ID   INT        4              
x    DOUBLE     16             

chunkPath->
partitionColumnIndex->0
partitionColumnName->ID
partitionColumnType->4
partitionType->2
partitionTypeName->RANGE
partitionSchema->[0,5,10]
partitionSites->
*/

TSDB database:

n = 10000
SecurityID = rand(`st0001`st0002`st0003`st0004`st0005, n)
sym = rand(`A`B, n)
TradeDate = 2022.01.01 + rand(100,n)
TotalVolumeTrade = rand(1000..3000, n)
TotalValueTrade = rand(100.0, n)
schemaTable_snap = table(SecurityID, TradeDate, TotalVolumeTrade, TotalValueTrade).sortBy!(`SecurityID`TradeDate)

dbPath = "dfs://TSDB_STOCK"
if (existsDatabase(dbPath)){dropDatabase(dbPath)}
db_snap = database(dbPath, VALUE, 2022.01.01..2022.01.05, engine='TSDB')

schema(db_snap)
/* output:
databaseDir->dfs://TSDB_STOCK
partitionSchema->[2022.01.01,2022.01.02,2022.01.03,2022.01.04,2022.01.05]
partitionSites->
engineType->TSDB
atomic->TRANS
chunkGranularity->TABLE
partitionType->1
partitionTypeName->VALUE
partitionColumnType->6
clusterReplicationEnabled->1
databaseOwner->admin
*/

TSDB table:

def myHashFunc(x){
  return hashBucket(x, 10)
}
snap=createPartitionedTable(dbHandle=db_snap, table=schemaTable_snap, tableName="snap", partitionColumns=`TradeDate, sortColumns=`SecurityID`TradeDate, keepDuplicates=ALL, sortKeyMappingFunction=[myHashFunc])

schema(snap)
/* output:
engineType->TSDB
keepDuplicates->ALL
chunkGranularity->TABLE
sortColumns->["SecurityID","TradeDate"]
sortKeyMappingFunction->["def myHashFunc(x){
  return hashBucket(x, 10)
}"]
softDelete->0
tableOwner->admin
compressMethods->
name             compressMethods
---------------- ---------------
SecurityID       lz4            
TradeDate        lz4            
TotalVolumeTrade lz4            
TotalValueTrade  lz4            

colDefs->
name             typeString typeInt extra comment
---------------- ---------- ------- ----- -------
SecurityID       SYMBOL     17                   
TradeDate        DATE       6                    
TotalVolumeTrade INT        4                    
TotalValueTrade  DOUBLE     16                   

chunkPath->
partitionColumnIndex->1
partitionColumnName->TradeDate
partitionColumnType->6
partitionType->1
partitionTypeName->VALUE
partitionSchema->[2022.01.01,2022.01.02,2022.01.03,2022.01.04,2022.01.05]
partitionSites->
*/