Raymond Raymond / Code Snippets & Tips

Spark DataFrame: Show Full Column Content without Truncation

event 2021-03-08 visibility 2,818 comment 0 insights toc
insights Stats
toc Table of contents

When calling Spark show function to display the content of a DataFrame, it will not print out the full content of a column by default.  For example, the following output prints out truncated column content:

| ID|               Value|
|  1|Adddddddddddddddd...|
|  1|                   A|
|  3|                   C|

To show the full content of the column, we just need to specify the truncate parameter to False:

:param truncate: If set to ``True``, truncate strings longer than 20 chars by default.
            If set to a number greater than one, truncates long strings to length ``truncate``
            and align cells right.

Code snippet

from pyspark.sql import SparkSession
from pyspark.sql.types import IntegerType, StringType, StructField, StructType

appName = "PySpark Example - Show Column Full Content"
master = "local"

# Create Spark session
spark = SparkSession.builder \
    .appName(appName) \
    .master(master) \

# Sample data
data = [(1, 'Addddddddddddddddddddddddddddddddddddddddddd'),
        (1, 'A'), (3, 'C')]

# schema
schema = StructType([StructField("ID", IntegerType(), True),
                     StructField("Value", StringType(), True)])

# Create Spark DaraFrame from pandas DataFrame
df = spark.createDataFrame(data, schema)



|ID |Value                                       |
|1  |Addddddddddddddddddddddddddddddddddddddddddd|
|1  |A                                           |
|3  |C                                           |
More from Kontext
comment Comments
No comments yet.

Please log in or register to comment.

account_circle Log in person_add Register

Log in with external accounts