Expose one-hot encoding to H2OFrame operations
Requesting a method on a H2ODataframe with one categorical column that outputs a multi column dataframe (one column for each unique category) and a 0/1 value for that row.
this function is GLM-specific - it might produce a different order of columns than how xgboost sees the frame - which might be important in some cases
just pointed out that R has a (private, non-exported) function which already does this FYI .getExpanded:
Let’s revisit this. Lots of interest on Stack Overflow, especially by people using interpretability methods.
It will be closed, when issue hit the master.
It's in branch vlad_PUBDEV_3955, waiting for Michal's approval.