Skip to main content

parquet2csv

Synopsis

starlake parquet2csv [options]

Description

Convert Parquet files to CSV format for inspection or downstream processing.

Parameters

ParameterCardinalityDescription
--input_dir <value>RequiredFull Path to input directory
--output_dir <value>OptionalFull Path to output directory, if not specified, input_dir is used as output dir
--domain <value>OptionalDomain name to convert. All schemas in this domain are converted. If not specified, all schemas of all domains are converted
--schema <value>OptionalSchema name to convert. If not specified, all schemas are converted.
--delete_source <value>OptionalShould we delete source parquet files after conversion ?
--write_mode <value>OptionalOne of Set(OVERWRITE, APPEND)
--options k1=v1,k2=v2...OptionalAny Spark option to use (sep, delimiter, quote, quoteAll, escape, header ...)
--partitions <value>OptionalHow many output partitions