Gustavo Frederico: PySpark: references to variable number of columns in UDF

Sunday, November 18, 2018

PySpark: references to variable number of columns in UDF

Problem statement:

Suppose that you want to create a column in a DataFrame based on many existing columns, but you don't know how many columns, possibly because that will be given by the user or another software.

This is how you can do it:

Gustavo Frederico

FB_init

Sunday, November 18, 2018

PySpark: references to variable number of columns in UDF

No comments: