When HetuEngine accesses the Hive data source, it needs to access the Hive metastore to obtain the metadata information. HetuEngine provides the metadata cache function. When the database or table of the Hive data source is accessed for the first time, the metadata information (database name, table name, table field, partition information, and permission information) of the database or table is cached, the Hive metastore does not need to be accessed again during subsequent access. If the table data of the Hive data source does not change frequently, the query performance can be improved to some extent.
Parameter |
Description |
Default Value |
Parameter File |
---|---|---|---|
hive.metastore-cache-ttl |
Cache duration of the metadata of the co-deployed Hive data source. |
0s |
hive.properties |
hive.metastore-cache-maximum-size |
Maximum cache size of the metadata of the co-deployed Hive data source. |
10000 |
hive.properties |
hive.metastore-refresh-interval |
Interval for refreshing the metadata of the co-deployed Hive data source. |
1s |
hive.properties |
hive.per-transaction-metastore-cache-maximum-size |
Maximum cache size of the metadata for each transaction of the co-deployed Hive data source. |
1000 |
hive.properties |