Release 2.3.0-k8 · splicemachine/pysplice

What's New?

The External Native Spark Datasource API is now available (@jpanko1 )
Added functions to splicemachine.notebook to access the Spark UI and the Mlflow UI (@Ben-Epstein )
Python Dependency fixes for the October 2020 pip changes (@Ben-Epstein )
More graceful [errors] for unsupported models (#74) (@Ben-Epstein )
Better checking for spark datatypes (@Ben-Epstein, @ZachC16 )
Deployment support for non-pipeline models (@Ben-Epstein, @ZachC16 )
Support for Linear Support Vector Machine Spark Model (@Ben-Epstein, @ZachC16 )
Better unit testing (@Ben-Epstein @ZachC16)
New warning passed on Keras and Spark models when the number of label columns passed in doesn't match model (@Ben-Epstein, @domclassen )
Database Deployment Migrated to Server side running on Bobby pod (@abaveja313, @Ben-Epstein )
Initial K8s deployment code available - known bug with init container hanging, expected to be working in next release (@abaveja313 )
Models are now logged as MLModels instead of the raw model binary (@abaveja313 )
Model caching for database deployment (@Ben-Epstein @sergioferragut )
Fix for artifacts downloading without file extension (@Ben-Epstein )
Model deployment metadata managed by Bobby (@abaveja313 )

The models table no longer exists. The deployment model is instead stored in a new column of the Artifacts table called database_binary. You must run the migration scripts to alter the artifacts table, otherwise existing deployments won't work
Models currently saved in the database with log_model will not be deployable as we have changed the model saving format from model to MLModel. You must read in the model binary, deserialize it, and re-log the model under a new run.

This release is in tandem with the ml-workflow release.

Upgrade scripts from 2.2.0 are available here