, , this was fix in H2O. There is no fix needed on SW side. I assigned the ticket to you because I think Parquet import should be automatically tested in end-to-end tests in Docker.
When we have the Hadoop-docker tests import parquet can be one of the tests to ensure it works across different Hadoop versions
Especially different Spark versions, Hadoop version can be important too but at this point Spark is more important because Spark bundles Parquet libraries.
I see. So creating a simple unit test for parquet import would help here. We cherry-pick the changes to all release branches so that means the test would be running on all supported Spark versions.
I would decouple this from the dockerization of Sparkling Water and , close this guy and implement the test in different PR - - would do you think? As the core issue in this PR is actually solved and we just need to write a test
Feel free to reopen this if you think it is still necessary to keep this open. I will work on as soon as possible to ensure we test the parquet import at least on basic levels