暗能星系

    • 登录
    • 搜索

    tmp

    张渌
    2
    262
    1755
    正在加载更多帖子
    • 从旧到新
    • 从新到旧
    • 最多赞同
    回复
    • 在新帖中回复
    登录后回复
    此主题已被删除。只有拥有主题管理权限的用户可以查看。
    • Z
      zhanglu 最后由 编辑

      https://cn.download.nvidia.com/XFree86/Linux-x86_64/580.95.05/NVIDIA-Linux-x86_64-580.95.05.run

      1 条回复 最后回复 回复 引用 0
      • Z
        zhanglu 最后由 编辑

        2025/10/13 07:23:31 Failed to initialize NVML: could not load NVML library.
        2025/10/13 07:23:31 If this is a GPU node, did you set the docker default runtime to nvidia?

        1 条回复 最后回复 回复 引用 0
        • Z
          zhanglu 最后由 编辑

          wget https://developer.download.nvidia.com/compute/cuda/13.0.2/local_installers/cuda_13.0.2_580.95.05_linux.run

          1 条回复 最后回复 回复 引用 0
          • Z
            zhanglu 最后由 编辑

            wget https://developer.download.nvidia.com/compute/cuda/11.6.2/local_installers/cuda_11.6.2_510.47.03_linux.run

            1 条回复 最后回复 回复 引用 0
            • Z
              zhanglu 最后由 编辑

              Failed to initialize NVML: could not load NVML library

              1 条回复 最后回复 回复 引用 0
              • Z
                zhanglu 最后由 编辑

                https://raw.githubusercontent.com/NVIDIA/k8s-device-plugin/v0.17.1/deployments/static/nvidia-device-plugin.yml

                1 条回复 最后回复 回复 引用 0
                • Z
                  zhanglu 最后由 编辑

                  https://www.nvidia.cn/drivers/details/252785/

                  1 条回复 最后回复 回复 引用 0
                  • Z
                    zhanglu 最后由 编辑

                    $ curl -s -L https://nvidia.github.io/nvidia-docker/centos8/nvidia-docker.repo | sudo tee /etc/yum.repos.d/nvidia-docker.repo
                    yum install -y nvidia-container-toolkit

                    1 条回复 最后回复 回复 引用 0
                    • Z
                      zhanglu 最后由 编辑

                      d?????????? ? ? ? ? ? cephfs_data

                      1 条回复 最后回复 回复 引用 0
                      • Z
                        zhanglu 最后由 编辑

                        root
                        Tzzs@2025*
                        220.185.228.106
                        30001

                        1 条回复 最后回复 回复 引用 0
                        • Z
                          zhanglu 最后由 编辑

                          http://192.168.30.202:31237/api/workflows/v1/8a6aa1f8-0ab1-4518-bc22-06390d1c7494/abort

                          1 条回复 最后回复 回复 引用 0
                          • Z
                            zhanglu 最后由 编辑

                            curl -X POST "http://192.168.30.202:31237/api/workflows/v1/8a6aa1f8-0ab1-4518-bc22-06390d1c7494/abort" -H "accept: application/json"

                            1 条回复 最后回复 回复 引用 0
                            • Z
                              zhanglu 最后由 编辑

                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/78cf44c7-8a10-45e7-b518-dda45a05c7ad/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/989090fd-143a-47b8-b967-389a427d75a9/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/42345c0e-542d-433d-971b-fd1398e5831f/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/d21d8ffc-526e-4776-9632-401438be25c0/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/c13bdcc8-76d3-48a9-acbc-feb1842f1b95/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/222d2b48-0974-4cb7-8629-a0daf49d1177/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/8a6aa1f8-0ab1-4518-bc22-06390d1c7494/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/06dedc34-5518-453f-8dea-d922f793d329/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/dbdcb701-f9a8-4e87-8b6f-17b96e612e4b/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/2b427cce-9d1f-4522-8482-f4e55e7cad3f/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/e18a9ee9-23cf-42e1-8383-a632908a5f23/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/e01a39d8-6140-4fae-85c8-547a15d2f99f/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/9d43ed37-0c46-49f6-8b5b-17a75a392ece/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/975b5959-6cce-4c2e-8237-432dcf0451a7/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/3f1fcf19-bf03-489a-a317-9f0452376af3/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/3e58f4b4-6f2e-45d0-bfb2-585acfd0529f/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/925fe273-25fb-43d4-93b3-56b752b40053/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/cad4787c-baa4-4520-ba74-e244aceccd29/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/b0b21bcc-61de-417d-8a49-29fffbd35f40/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/fd4b9685-6495-4cd3-b1ff-7af3bf14bba0/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/d5c5d280-692d-47d4-9db0-edbed55dc893/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/fc85a28b-63e7-4f3e-ac19-282da4ce74b7/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/164adb9e-9f72-43ab-a22d-90d5fa0469c5/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/f76b822a-85f7-47c0-b285-a06ba61f619c/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/0c7eee09-ab69-4d9f-9979-1f3d6ec0513c/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/cbd712e0-1861-4c15-b5d5-1bd793b5c5c3/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/dfaacb27-0b44-44c3-9664-108cdc11a076/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/14606e5c-9399-4c8a-9908-3898a9caca73/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/add7f4d1-2cb7-463a-bd4a-3dee1fb7dac6/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/f8f8eb9c-077a-440d-922d-82ae0ff8109d/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/920fdb9a-816e-4fcf-bed2-14b5ba333d7a/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/a4caf18d-1795-4093-aeae-2468a325f771/abort" -H "accept: application/json"
                              curl -X POST "http://192.168.30.202:31237/api/workflows/v1/f44a01dc-ac87-4a51-9598-91873643a3c0/abort" -H "accept: application/json"

                              1 条回复 最后回复 回复 引用 0
                              • Z
                                zhanglu 最后由 编辑

                                import json
                                import argparse
                                import requests
                                from requests.exceptions import RequestException # 处理网络请求异常

                                def fetch_json_from_url(url, timeout=10):
                                """通过GET请求从HTTP接口获取JSON数据并返回解析后的字典"""
                                try:
                                # 发送GET请求,设置超时时间
                                response = requests.get(url, timeout=timeout)

                                    # 检查HTTP响应状态码(200表示成功)
                                    response.raise_for_status()
                                    
                                    # 解析JSON响应内容
                                    return response.json()
                                
                                except requests.exceptions.ConnectTimeout:
                                    raise ValueError(f"错误:连接接口 '{url}' 超时({timeout}秒),请检查网络或接口可用性")
                                except requests.exceptions.ConnectionError:
                                    raise ValueError(f"错误:无法连接到接口 '{url}',请检查URL是否正确或接口是否在线")
                                except requests.exceptions.HTTPError as e:
                                    # 捕获404、500等HTTP错误状态码
                                    raise ValueError(f"错误:接口返回 HTTP {response.status_code} 错误,详情:{str(e)}")
                                except json.JSONDecodeError:
                                    raise ValueError(f"错误:接口返回的数据不是有效的JSON格式,请检查接口响应")
                                except RequestException as e:
                                    # 捕获其他网络请求异常
                                    raise ValueError(f"请求接口时发生错误:{str(e)}")
                                

                                def parse_running_task_ids(json_data):
                                """从JSON数据中提取状态为'Running'的任务ID,返回ID列表"""
                                if "results" not in json_data:
                                return [] # 无results字段时返回空列表

                                running_ids = []
                                for task in json_data["results"]:
                                    # 仅收集状态为"Running"的任务ID
                                    if task.get("status") == "Running":
                                        running_ids.append(task.get("id", "N/A"))
                                return running_ids
                                

                                def generate_abort_commands(task_ids):
                                """根据任务ID生成中止任务的curl命令并打印"""
                                if not task_ids:
                                print("未找到状态为'Running'的任务,无需生成中止命令")
                                return

                                print("生成的中止任务命令:")
                                for task_id in task_ids:
                                    if task_id != "N/A":
                                        print(f'curl -X POST "http://192.168.30.202:31237/api/workflows/v1/{task_id}/abort" -H "accept: application/json"')
                                    else:
                                        print(f"警告:存在状态为'Running'的任务,但未获取到有效ID,跳过")
                                

                                def main():
                                # 解析命令行参数(指定HTTP接口URL)
                                parser = argparse.ArgumentParser(description="从HTTP接口获取任务信息,生成Running状态任务的中止命令")
                                parser.add_argument("url", help="JSON数据所在的HTTP接口URL(例如:http://example.com/tasks)")
                                parser.add_argument("--timeout", type=int, default=10, help="请求超时时间(秒),默认10秒")
                                args = parser.parse_args()

                                try:
                                    # 从接口获取并解析JSON数据
                                    json_data = fetch_json_from_url(args.url, timeout=args.timeout)
                                    # 提取Running状态的任务ID
                                    running_task_ids = parse_running_task_ids(json_data)
                                    # 生成并打印中止命令
                                    generate_abort_commands(running_task_ids)
                                except ValueError as e:
                                    print(f"处理失败:{e}")
                                    exit(1)
                                

                                if name == "main":
                                main()

                                1 条回复 最后回复 回复 引用 0
                                • Z
                                  zhanglu 最后由 编辑

                                  -o Report/sample_info.xlsx`
                                  ################# retry : Some(9977) ################################## retry : Some(9933) ################################## retry : Some(9921) ################################## retry : Some(9992) ################################## retry : Some(9919) ################################## retry : Some(9921) ################################## retry : Some(9952) ################################## retry : Some(9867) ################################## retry : Some(9978) ################################## retry : Some(9962) ################################## retry : Some(9958) ################################## retry : Some(9948) ################################## retry : Some(9839) ################################## retry : Some(9994) ################################## retry : Some(9959) ################################## retry : Some(9964) ################################## retry : Some(9985) ################################## retry : Some(9953) ################################## retry : Some(9899) ################################## retry : Some(9990) ################################## re

                                  1 条回复 最后回复 回复 引用 0
                                  • Z
                                    zhanglu 最后由 编辑

                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/f6d52f2f-1b0e-4c30-a98b-2283369f8320/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/f09ea5d2-e6aa-4692-ae33-8ce56386a3c8/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/f06ec8db-a926-43e2-9cf0-449645ac379f/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/9104b0ad-ce9b-4709-82d8-2d2baa4f2ac9/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/d17e79d7-a42c-460e-a2fd-719d343f1d2e/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/cbd712e0-1861-4c15-b5d5-1bd793b5c5c3/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/dfaacb27-0b44-44c3-9664-108cdc11a076/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/14606e5c-9399-4c8a-9908-3898a9caca73/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/add7f4d1-2cb7-463a-bd4a-3dee1fb7dac6/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/f8f8eb9c-077a-440d-922d-82ae0ff8109d/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/920fdb9a-816e-4fcf-bed2-14b5ba333d7a/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/a4caf18d-1795-4093-aeae-2468a325f771/abort" -H "accept: application/json"
                                    curl -X POST "http://192.168.30.202:31237/api/workflows/v1/f44a01dc-ac87-4a51-9598-91873643a3c0/abort" -H "accept: application/json"

                                    1 条回复 最后回复 回复 引用 0
                                    • Z
                                      zhanglu 最后由 编辑

                                      docker network create
                                      --driver bridge
                                      --opt com.docker.network.bridge.enable_ip_masquerade=true
                                      my-bridge

                                      后续启动容器时使用该网络

                                      docker run -it --network my-bridge 镜像名

                                      Z 1 条回复 最后回复 回复 引用 0
                                      • Z
                                        zhanglu @zhanglu 最后由 zhanglu 编辑

                                        @zhanglu docker network create
                                        --driver bridge
                                        --opt com.docker.network.bridge.default_bridge=true
                                        --opt com.docker.network.bridge.enable_icc=true
                                        --opt com.docker.network.bridge.enable_ip_masquerade=true
                                        --opt com.docker.network.bridge.host_binding_ipv4=0.0.0.0
                                        --opt com.docker.network.bridge.name=docker0
                                        --opt com.docker.network.driver.mtu=1500
                                        my-bridge-network

                                        1 条回复 最后回复 回复 引用 0
                                        • Z
                                          zhanglu 最后由 编辑

                                          [docker-ce-stable]
                                          name=Docker CE Stable - $basearch
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/$basearch/stable
                                          enabled=1
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          [docker-ce-stable-debuginfo]
                                          name=Docker CE Stable - Debuginfo $basearch
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/debug-$basearch/stable
                                          enabled=0
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          [docker-ce-stable-source]
                                          name=Docker CE Stable - Sources
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/source/stable
                                          enabled=0
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          [docker-ce-test]
                                          name=Docker CE Test - $basearch
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/$basearch/test
                                          enabled=0
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          [docker-ce-test-debuginfo]
                                          name=Docker CE Test - Debuginfo $basearch
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/debug-$basearch/test
                                          enabled=0
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          [docker-ce-test-source]
                                          name=Docker CE Test - Sources
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/source/test
                                          enabled=0
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          [docker-ce-nightly]
                                          name=Docker CE Nightly - $basearch
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/$basearch/nightly
                                          enabled=0
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          [docker-ce-nightly-debuginfo]
                                          name=Docker CE Nightly - Debuginfo $basearch
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/debug-$basearch/nightly
                                          enabled=0
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          [docker-ce-nightly-source]
                                          name=Docker CE Nightly - Sources
                                          baseurl=https://mirrors.aliyun.com/docker-ce/linux/centos/$releasever/source/nightly
                                          enabled=0
                                          gpgcheck=1
                                          gpgkey=https://mirrors.aliyun.com/docker-ce/linux/centos/gpg

                                          1 条回复 最后回复 回复 引用 0
                                          • Z
                                            zhanglu 最后由 编辑

                                            ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABgQDW8bqVdBe8okVnEaWxn/GCBn2ngPWezwomczuS+tcj/T3odSjNPe6NQVRb+KkumnwMk8CHLq41v1J+C9BZp2x/w90pCKtSjotl94CbNzqnEFJK+tfrKG38twMlPnCeHfsGXEeagbT0Td4nqhvvM2oz1gEzYwoNsY3VV/hufLbj0qNBeoKOqltTUtdRBaNyuxDxIzh+uHy5LrJAWjftGkjOHbJDXp+HEgXsUDyhr13lnAm2s3WPT5jLLiZvPiTBnrEscLZEoLkfelUJMwGOOIfTakd7illHLUtyRO9UcG3cYvUOTc2Pfp/SDLcuxrHbfpsRSz9ZD4ZzsX50SjTxQR3O44RGk2b4SQK7JNhEIDIzbhXbGTRPQVK4yI+ox6NKKYgj/GkxxMCErcAWa/o9PGLYEHJSuyK8vaE/Dh0rHYgUtq0EIolQ0DLPHa4vzlhkxK9j5qUtu6oZkYcqIhevz+blwnQ2h8DIWmK+9AL/QmqvbfWysMyPzXj7/475Bmt5aZ0= admin@example.com

                                            1 条回复 最后回复 回复 引用 0
                                            • First post
                                              Last post
                                            Powered by 暗能星系