DataFrame.foreach
can be used to iterate/loop through each row (pyspark.sql.types.Row
) in a Spark DataFrame object and apply a function to all the rows. This method is a shorthand for DataFrame.rdd.foreach
.
Note: Please be cautious when using this method especially if your DataFrame is big.
Output:
+-----+--------+ | col1| col2| +-----+--------+ |Hello| Kontext| |Hello|Big Data| +-----+--------+ col1=Hello, col2=Kontext col1=Hello, col2=Big Data