Q&A
Ask and answer questions to make information more available to wider audiences.
Brooklyn Jeffries @jeffriesbrooklyn   14, Jun 2023 12:00 AM
maximum size of a string data type
What is the maximum size of a string data type supported by Hive?
answers 2
 
Answer 1
Michaela Samuel @samuelmichaela5   21, Jun 2023 11:42 AM
Sequence file: It is a splittable, compressible, and row-oriented file with a general binary format.
ORC file: Optimized row columnar (ORC) format file is a record-columnar and column-oriented storage file. It divides the table in row split. Each split stores the value of the first row in the first column and follows subsequently.
Avro data file: It is the same as a sequence file that is splittable, compressible, and row-oriented but without the support of schema evolution and multilingual binding.
Parquet file: In Parquet format, along with storing rows of data adjacent to one another, we can also store column values adjacent to each other such that both horizontally and vertically datasets are partitioned.

 
Answer 2
Michaela Samuel @samuelmichaela5   21, Jun 2023 11:41 AM
The maximum size of a string data type supported by Hive is 2 GB. Hive supports the text file format by default, and it also supports the binary format sequence files, ORC files, Avro data files, and Parquet files.