When writing data, Hudi generates a Hudi table based on attributes such as the storage path, table name, and partition structure.
Hudi table data files can be stored in the OS file system or distributed file system such as HDFS. To ensure analysis performance and data reliability, HDFS is generally used for storage. Using HDFS as an example, Hudi table storage files are classified into two types.
Hudi data files of are stored in Parquet base files and Avro log files.