| At Logical Clocks, we build a horizontally scalable ML pipeline framework on the open-source Hopsworks platform, based around its feature store and Airflow for orchestration: * https://hopsworks.readthedocs.io/en/latest/hopsml/hopsML.htm... * https://www.logicalclocks.com/feature-store/ The choice for the DataPrep stage basically comes down to Spark or Apache Beam, and we currently support Spark, but plan to soon add support for Beam, because of some of the goodies in TFX (TensorFlow Extended). For distributed hyperparam opt and distributed training, we leverage Apache Spark and our own version of YARN that supports GPUs - * https://www.youtube.com/watch?v=tx6HyoUYGL0 For model serving, we support Kubernetes: * https://hopsworks.readthedocs.io/en/0.9/hopsml/hopsML.html#s... Our platform supports TLS/SSL certificates everywhere and is open-source. Download it and try it, and it runs in several large enterprises in Europe. We have a cluster with >1000 users in Sweden here: * https://www.hops.site (Edited for formatting) |