暗能星系

    • 登录
    • 搜索

    tmp

    张渌
    2
    262
    1755
    正在加载更多帖子
    • 从旧到新
    • 从新到旧
    • 最多赞同
    回复
    • 在新帖中回复
    登录后回复
    此主题已被删除。只有拥有主题管理权限的用户可以查看。
    • Z
      zhanglu 最后由 编辑

      https://www.cnblogs.com/hukey/p/16600243.html

      1 条回复 最后回复 回复 引用 0
      • Z
        zhanglu 最后由 编辑

        wget https://raw.githubusercontent.com/NVIDIA/k8s-device-plugin/v0.14.1/nvidia-device-plugin.yml

        1 条回复 最后回复 回复 引用 0
        • Z
          zhanglu 最后由 编辑

          KUBE_LOGTOSTDERR="--logtostderr=true"KUBE_LOGTOSTDERR="--logtostderr=true"
          KUBE_LOG_LEVEL="--v=2"
          KUBELET_ADDRESS="--node-ip=192.168.10.11"
          KUBELET_HOSTNAME="--hostname-override=node2"

          KUBELET_ARGS="--bootstrap-kubeconfig=/etc/kubernetes/bootstrap-kubelet.conf
          --config=/etc/kubernetes/kubelet-config.yaml
          --kubeconfig=/etc/kubernetes/kubelet.conf
          --pod-infra-container-image=k8s.gcr.io/pause:3.3
          --runtime-cgroups=/systemd/system.slice
          "
          KUBELET_NETWORK_PLUGIN="--network-plugin=cni --cni-conf-dir=/etc/cni/net.d --cni-bin-dir=/opt/cni/bin"

          KUBE_LOG_LEVEL="--v=2"
          KUBELET_ADDRESS="--node-ip=192.168.10.11"
          KUBELET_HOSTNAME="--hostname-override=node2"

          KUBELET_ARGS="--bootstrap-kubeconfig=/etc/kubernetes/bootstrap-kubelet.conf
          --config=/etc/kubernetes/kubelet-config.yaml
          --kubeconfig=/etc/kubernetes/kubelet.conf
          --pod-infra-container-image=k8s.gcr.io/pause:3.3
          --runtime-cgroups=/systemd/system.slice
          "
          KUBELET_NETWORK_PLUGIN="--network-plugin=cni --cni-conf-dir=/etc/cni/net.d --cni-bin-dir=/opt/cni/bin"

          Z 1 条回复 最后回复 回复 引用 0
          • Z
            zhanglu @zhanglu 最后由 编辑

            @zhanglu --logtostderr=true
            --v=2
            --node-ip=192.168.10.11
            --hostname-override=node2
            --bootstrap-kubeconfig=/etc/kubernetes/bootstrap-kubelet.conf
            --config=/etc/kubernetes/kubelet-config.yaml
            --kubeconfig=/etc/kubernetes/kubelet.conf
            --pod-infra-container-image=k8s.gcr.io/pause:3.3
            --runtime-cgroups=/systemd/system.slice
            --network-plugin=cni --cni-conf-dir=/etc/cni/net.d --cni-bin-dir=/opt/cni/bin \

            1 条回复 最后回复 回复 引用 0
            • Z
              zhanglu 最后由 编辑

              https://raw.githubusercontent.com/NVIDIA/k8s-device-plugin/v0.17.0/deployments/static/nvidia-device-plugin.yml

              1 条回复 最后回复 回复 引用 0
              • Z
                zhanglu 最后由 编辑

                https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

                1 条回复 最后回复 回复 引用 0
                • Z
                  zhanglu 最后由 编辑

                  curl -s -L https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo |
                  sudo tee /etc/yum.repos.d/nvidia-container-toolkit.repo

                  yum install -y nvidia-container-toolkit

                  1 条回复 最后回复 回复 引用 0
                  • Z
                    zhanglu 最后由 编辑

                    wget https://developer.download.nvidia.com/compute/cuda/10.2/Prod/local_installers/cuda-repo-rhel8-10-2-local-10.2.89-440.33.01-1.0-1.x86_64.rpm
                    sudo rpm -i cuda-repo-rhel8-10-2-local-10.2.89-440.33.01-1.0-1.x86_64.rpm
                    sudo dnf clean all
                    sudo dnf -y module install nvidia-driver:latest-dkms
                    sudo dnf -y install cuda

                    1 条回复 最后回复 回复 引用 0
                    • Z
                      zhanglu 最后由 编辑

                      https://raw.githubusercontent.com/NVIDIA/k8s-device-plugin/v0.12.0/nvidia-device-plugin.yml

                      1 条回复 最后回复 回复 引用 0
                      • Z
                        zhanglu 最后由 编辑

                        The request operation to custom alerting rule could not be done because thanos ruler is not enabled

                        1 条回复 最后回复 回复 引用 0
                        • Z
                          zhanglu 最后由 编辑

                          akka.http.server.request-timeout = 20000000s
                          akka.http.server.idle-timeout = 20000000s

                          1 条回复 最后回复 回复 引用 0
                          • Z
                            zhanglu 最后由 编辑

                            docker.io/library/nginx:1.25.2-alpine

                            1 条回复 最后回复 回复 引用 0
                            • Z
                              zhanglu 最后由 编辑

                              nerdctl pull dockerhub.genostack.com:8090/library/csi-attacher:v4.4.2
                              nerdctl pull dockerhub.genostack.com:8090/library/csi-resizer:v1.9.2
                              nerdctl pull dockerhub.genostack.com:8090/library/csi-provisioner:v3.6.2
                              nerdctl pull dockerhub.genostack.com:8090/library/csi-node-driver-registrar:v2.9.1
                              nerdctl pull dockerhub.genostack.com:8090/library/csi-resizer:v1.9.2
                              nerdctl pull dockerhub.genostack.com:8090/library/csi-attacher:v4.4.2
                              nerdctl pull dockerhub.genostack.com:8090/library/ceph:v18.2.0
                              nerdctl pull dockerhub.genostack.com:8090/library/cephcsi:v3.10.0
                              nerdctl pull dockerhub.genostack.com:8090/library/ceph:v1.13.0

                              nerdctl tag dockerhub.genostack.com:8090/library/csi-attacher:v4.4.2 registry.k8s.io/sig-storage/csi-attacher:v4.4.2
                              nerdctl tag dockerhub.genostack.com:8090/library/csi-resizer:v1.9.2 registry.k8s.io/sig-storage/csi-resizer:v1.9.2
                              nerdctl tag dockerhub.genostack.com:8090/library/csi-provisioner:v3.6.2 registry.k8s.io/sig-storage/csi-provisioner:v3.6.2
                              nerdctl tag dockerhub.genostack.com:8090/library/csi-node-driver-registrar:v2.9.1 registry.k8s.io/sig-storage/csi-node-driver-registrar:v2.9.1
                              nerdctl tag dockerhub.genostack.com:8090/library/csi-resizer:v1.9.2 registry.k8s.io/sig-storage/csi-resizer:v1.9.2
                              nerdctl tag dockerhub.genostack.com:8090/library/csi-attacher:v4.4.2 registry.k8s.io/sig-storage/csi-attacher:v4.4.2
                              nerdctl tag dockerhub.genostack.com:8090/library/ceph:v18.2.0 quay.io/ceph/ceph:v18.2.0
                              nerdctl tag dockerhub.genostack.com:8090/library/cephcsi:v3.10.0 quay.io/cephcsi/cephcsi:v3.10.0
                              nerdctl tag dockerhub.genostack.com:8090/library/ceph:v1.13.0 rook/ceph:v1.13.0

                              1 条回复 最后回复 回复 引用 0
                              • Z
                                zhanglu 最后由 编辑

                                nerdctl pull dockerhub.genostack.com:8090/library/csi-snapshotter:v6.3.2
                                nerdctl tag dockerhub.genostack.com:8090/library/csi-snapshotter:v6.3.2 registry.k8s.io/sig-storage/csi-snapshotter:v6.3.2

                                1 条回复 最后回复 回复 引用 0
                                • Z
                                  zhanglu 最后由 编辑

                                  kubectl -n rook-ceph get secret rook-ceph-dashboard-password -o jsonpath="{['data']['password']}" | base64 --decode && echo

                                  1 条回复 最后回复 回复 引用 0
                                  • Z
                                    zhanglu 最后由 编辑

                                    mount -t ceph 10.233.31.47:6789,10.233.36.144:6789,10.233.33.7:6789:/ /cephfs_data -o name=admin,secret=AQDw/2VjAJmTBxAAezkJzzXVJ4VCxiYtx49L+w==

                                    1 条回复 最后回复 回复 引用 0
                                    • Z
                                      zhanglu 最后由 编辑

                                      docker pull dockerhub.genostack.com:8090/library/web_socket:latest

                                      1 条回复 最后回复 回复 引用 0
                                      • Z
                                        zhanglu 最后由 编辑

                                        PostgreSQL Database directory appears to contain a database; Skipping initialization

                                        2025-04-02 13:24:07.154 UTC [1] LOG: listening on IPv4 address "0.0.0.0", port 5432
                                        2025-04-02 13:24:07.154 UTC [1] LOG: listening on IPv6 address "::", port 5432
                                        2025-04-02 13:24:07.157 UTC [1] LOG: listening on Unix socket "/var/run/postgresql/.s.PGSQL.5432"
                                        2025-04-02 13:24:07.411 UTC [26] LOG: database system was shut down at 2025-04-02 11:47:48 UTC
                                        2025-04-02 13:24:07.411 UTC [26] LOG: invalid record length at 8/DBE56CD8: wanted 24, got 0
                                        2025-04-02 13:24:07.411 UTC [26] LOG: invalid primary checkpoint record
                                        2025-04-02 13:24:07.411 UTC [26] LOG: invalid resource manager ID in secondary checkpoint record
                                        2025-04-02 13:24:07.411 UTC [26] PANIC: could not locate a valid checkpoint record
                                        2025-04-02 13:24:08.323 UTC [1] LOG: startup process (PID 26) was terminated by signal 6: Aborted
                                        2025-04-02 13:24:08.323 UTC [1] LOG: aborting startup due to startup process failure
                                        2025-04-02 13:24:08.338 UTC [1] LOG: database system is shut down

                                        1 条回复 最后回复 回复 引用 0
                                        • Z
                                          zhanglu 最后由 编辑

                                          PostgreSQL Database directory appears to contain a database; Skipping initialization

                                          2025-04-07 04:24:18.838 UTC [1] FATAL: lock file "postmaster.pid" is empty
                                          2025-04-07 04:24:18.838 UTC [1] HINT: Either another server is starting, or the lock file is the remnant of a previous server startup crash.

                                          1 条回复 最后回复 回复 引用 0
                                          • Z
                                            zhanglu 最后由 编辑

                                            kill -9 lsof | grep delete | awk '{print $2}'

                                            1 条回复 最后回复 回复 引用 0
                                            • First post
                                              Last post
                                            Powered by 暗能星系