FB_init

Sunday, November 18, 2018

PySpark: references to variable number of columns in UDF


Problem statement:

  Suppose that you want to create a column in a DataFrame based on many existing columns, but you don't know how many columns, possibly because that will be given by the user or another software.

This is how you can do it:

No comments: