Logical Clocks, the enterprise vendor for Hopsworks – a data platform for scale-out data science and AI, today announced the release of the first Enterprise Feature Store for Machine Learning. The Feature Store solves the problem of ad-hoc and siloed machine learning pipelines, where features, the training data for such pipelines, tend to become disorganized, disjointed, and duplicated, leading to correctness problems and redundant work.
Today, Logical Clocks AB are announcing the release of a Feature Store as part of Hopsworks version 0.8.0. The Feature Store is a central vault for documented, curated, and access-controlled features. In-house Feature Stores are already successfully in production at companies such as Uber, LinkedIn, Airbnb, and Comcast. Now, for the first time, a Feature Store is available, as open-source, in an Enterprise Data platform, Hopsworks.
With the increasing adoption of machine learning in the Enterprise, organizations are looking to reduce the cost of developing and deploying AI by increasing the productivity of their Data Scientists. According to Uber, “dealing with data access, integration, feature management, and pipelines can often waste a huge amount of a data scientist’s time”. The Feature Store solves the data access and feature management problem for Data Science by removing the need for Data Scientists to constantly re-implement feature pipelines for collecting and transforming data to feed their machine learning models. Instead, Data Scientists can select features from the Feature Store to generate clean training data that can then be consumed directly by machine learning models. Hopsworks’ Feature Store builds on Apache Spark and Apache Hive to enable it to scale to massive data volumes.
“As part of the Hopsworks platform, the Feature Store also gives Enterprises full Machine Learning Governance – the exercise of authority and control (access, monitoring, auditing, and provenance) over the management of machine learning assets. Repeatable experiments, features, and models are now all governed and managed by Hopsworks” Dr. Jim Dowling (CEO) said.
About Logical Clocks AB
Logical Clocks was founded by the team that created and continues to drive Hops, the world’s most scalable and advanced Hadoop platform, and Hopsworks, the Data and AI platform for Hops. Logical Clocks’ vision is to simplify the process of refining data into intelligence at scale. Logical Clocks has offices in Stockholm, London, and Palo Alto. For more information, visit https://www.logicalclocks.com.
Hopsworks 1.x series brings many new features and improvements, ranging from services such as the Feature Store and Experiments, to enhanced support for distributed stream processing and analytics with Apache Flink and Apache Beam, to building Deep Learning pipelines with TensorFlow Extended (TFX), to code versioning support for Jupyter notebooks with Git, to all-new provenance/lineage of data across all steps of a data engineering and data science. We are also excited that Hopsworks 1.x is the back-bone of the all new Managed Hopsworks platform for AWS, Hopsworks.ai (https://www.hopsworks.ai/).
On September 5th, 2019, Logical Clocks won the European DatSci award for “Data Science Technology Innovation of the Year”. Hopsworks is a data-intensive platform for data science and AI, that includes the first Enterprise Feature Store for Machine Learning.
Hopsworks 0.10 brings the latest features, improvements and bug fixes. It is the biggest release done so far, made up of 191 JIRAs including many new features. Also, this version marks the last of the 0.x series, as Hopsworks is gearing up towards its 1.x series starting with 1.0 end of Q3 2019.
Hopsworks 0.9.0 brings the latest features, improvements and bug fixes. It introduces Apache Airflow as-a-service which means users can now create their own workflows from within their familiar environment of a Hopsworks project. You can get started with Airflow in Hopsworks by visiting the user-guide.
Announcing the release of the first Enterprise Feature Store for Machine Learning. The Feature Store solves the problem of ad-hoc and siloed machine learning pipelines, where features, the training data for such pipelines, tend to become disorganized, disjointed, and duplicated, leading to correctness problems and redundant work.
Hopsworks 0.8.0 brings the latest features, improvements and bug fixes. It comes a short while after version 0.7.0 and brings the world’s first open-source feature store, a revamped REST API for managing jobs in Hopsworks and improvements in visualization for python notebooks.