Schema([('name', String),
('manufacturer', String),
('calories', Int64),
('rating', Float64)])
Get schema of parquet file without reading data in polars
read_parquet_schema
100DaysOfPolars
You have a large Parquet dataset and want to determine its columns and their data types. However, you do not want to read the entire dataset just to retrieve this information. The path to the Parquet file is shown below.
data = 'file_data.parquet'Get schema on read
To get the schema of the file without reading the dataset contents, use the Polars function read_parquet_schema as like this:
pl.read_parquet_schema(data)Check out my Polars course to learn more!