Spark Partition Discovery

event 2021-12-22 visibility 541 comment 0

more_vert

Spark supports partition discovery. All built in file sources (Text/CSV/JSON/ORC/Parquet) supports partition discovery and partition information inference.

This data shows a example data set that is stored by two partition levels: month and country.

The following code snippet will read all the underlying parquet files:

df = spark.read.option("basePath","/data").parquet("/data")

info Last modified by Raymond 4 years ago copyright This page is subject to Site terms.

comment Comments

No comments yet.

Please log in or register to comment.

account_circle Log in person_add Register

Log in with external accounts

tag Tags

spark

info Info

Image URL

SVG URL

Solution Diagrams

Log in with external accounts

Spark Partition Discovery

Log in with external accounts