当前位置：首页 > 软件库 > 神经网络/人工智能 > 机器学习/深度学习 >

kale

Kubeflow’s superfood for Data Scientists

授权协议 Apache-2.0 License

开发语言 Python

所属分类神经网络/人工智能、机器学习/深度学习

软件类型开源软件

地区不详

投递者利永年

操作系统跨平台

开源组织无

适用人群未知

软件概览

KALE (Kubeflow Automated pipeLines Engine) is a project that aims at simplifyingthe Data Science experience of deploying Kubeflow Pipelines workflows.

Kubeflow is a great platform for orchestrating complex workflows on topKubernetes and Kubeflow Pipeline provides the mean to create reusable componentsthat can be executed as part of workflows. The self-service nature of Kubeflowmake it extremely appealing for Data Science use, at it provides an easy accessto advanced distributed jobs orchestration, re-usability of components, JupyterNotebooks, rich UIs and more. Still, developing and maintaining Kubeflowworkflows can be hard for data scientists, who may not be experts in workingorchestration platforms and related SDKs. Additionally, data science ofteninvolve processes of data exploration, iterative modelling and interactiveenvironments (mostly Jupyter notebook).

Kale bridges this gap by providing a simple UI to define Kubeflow Pipelinesworkflows directly from you JupyterLab interface, without the need to change asingle line of code.

Read more about Kale and how it works in this Medium post:Automating Jupyter Notebook Deployments to Kubeflow Pipelines with Kale

Getting started

Install the Kale backend from PyPI and the JupyterLab extension. You can find aset of curated Notebooks in theexamples repository

# install kale
pip install kubeflow-kale

# install jupyter lab
pip install "jupyterlab>=2.0.0,<3.0.0"

# install the extension
jupyter labextension install kubeflow-kale-labextension
# verify extension status
jupyter labextension list

# run
jupyter lab

To build images to be used as a NotebookServer in Kubeflow, refer to theDockerfile in the docker folder.

FAQ

Head over to FAQ to read about some known issues and some of thelimitations imposed by the Kale data marshalling model.

Resources

Kale introduction blog post
Codelabs showcasing Kale working in MiniKF with Arrikto's Rok:
- From Notebook to Kubeflow Pipelines
- From Notebook to Kubeflow Pipelines with HP Tuning
KubeCon NA Tutorial 2019: From Notebook to Kubeflow Pipelines: An End-to-End Data Science Workflow/ video
CNCF Webinar 2020: From Notebook to Kubeflow Pipelines with MiniKF & Kale/ video
KubeCon EU Tutorial 2020: From Notebook to Kubeflow Pipelines with HP Tuning: A Data Science Journey/ video

Contribute

Backend

Create a new Python virtual environment with Python >= 3.6. Then:

cd backend/
pip install -e .[dev]

# run tests
pytest -x -vv

Labextension

The JupyterLab Python package comes with its own yarn wrapper, called jlpm.While using the previously installed venv, install JupyterLab by running:

pip install "jupyterlab>=2.0.0,<3.0.0"

You can then run the following to install the Kale extension:

cd labextension/

# install dependencies from package.lock
jlpm install
# build extension
jlpm run build

# list installed jp extensions
jlpm labextension list
# install Kale extension
jlpm labextension install .

# for development:
# build and watch
jlpm run watch

# in another shell, run JupyterLab in watch mode
jupyter lab --no-browser --watch

Git Hooks

This repository useshuskyto set up git hooks.

For husky to function properly, you need to have yarn installed and in yourPATH. The reason that is required is that husky is installed viajlpm install and jlpm is a yarn wrapper. (Similarly, if it was installedusing the npm package manager, then npm would have to be in PATH.)

Currently installed git hooks:

pre-commit: Run a prettier check on staged files, usingpretty-quick

kale

Getting started

FAQ

Resources

Contribute

Backend

Labextension

Git Hooks

同类工具

相关阅读

相关文章

相关问答

相关文档