1、部署环境:ubantu
2、部署docker:docker版本20.10.7
启动racher时,建议启动命令:
docker run -d --privileged --restart=unless-stopped -p 443:443 -p 80:80 --privileged --name=myrancher -e AUDIT_LEVEL=3 rancher/rancher:v2.5.2
3、单机部署参考:cube-studio/install/kubernetes at master · tencentmusic/cube-studio · GitHub
cube-studio 开源一站式云原生机器学习平台 单机部署视频_哔哩哔哩_bilibili
注意:注意 :安装自己的集群时,一定要移动命名空间
4、执行start.sh时,redis包可能报错,问题1:Error response from daemon: Head https://ccr.ccs.tencentyun.com/v2/cube-studio/bitnami-redis/manifests/latest: error parsing HTTP 404 response body: invalid character 'i' looking f or beginning of value: "image repo not found"
可以pull_image_kubeflow.sh替换redis,使用如下命令替换:sudo docker pull ranchercharts/bitnami-redis && sudo docker tag ranchercharts/bitnami-redis bitnami/redis &
5、部署过程中与视频出入: 缺少两个组件:
knative-serving
pre-service
5、部署最后,知道原密码重命名:https://IP:443/update-password
不知道原密码重命名: sudo docker exec -it containerid reset-password
6、部署完成后几个重要的网址:
https://192.168.9.37:443/ 集群管理
http://192.168.9.37:8080 cube_studio 对应的app页面,此处视频描述不对
7、部署失败时,环境清空:
docker stop $(docker ps -aq)
docker system prune -f
docker volume rm -f$(docker volume ls -q)#如果镜像版本没问题,注释掉小面一行命令
docker image rm -f $(docker image ls -q)#删除pods磁盘挂载
umount $(df -HT | grep '/var/lib/kubelet/pods' | awk '{print $7}')
rm -rf /etc/ceph \
/etc/cni \
/etc/kubernetes \
/opt/cni \
/opt/rke \
/run/secrets/kubernetes.io \
/run/calico \
/run/flannel \
/var/lib/calico \
/var/lib/etcd \
/var/lib/cni \
/var/lib/kubelet \
/var/lib/rancher/rke/log \
/var/log/containers \
/var/log/pods \
/var/run/calico