go-computing-provider

module

v1.1.1 Latest Latest Go to latest Published: Feb 11, 2025 License: MIT

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/swanchain/go-computing-provider

README ¶

Computing Provider

A computing provider is an individual or organization that participates in the decentralized computing network by offering computational resources such as processing power (CPU and GPU), memory, storage, and bandwidth.

As a resource provider, you can run a ECP(Edge Computing Provider) and FCP(Fog Computing Provider) to contribute yourcomputing resource.

ECP (Edge Computing Provider) specializes in processing data at the source of data generation, using minimal latency setups ideal for real-time applications. This provider handles specific, localized tasks directly on devices at the network’s edge, such as IoT devices. At the current stage, ECP supports the generation of ZK-Snark proof of Filecoin network, and more ZK proof types will be gradually supported, such as Aleo, Scroll, starkNet, etc. Install Guideline
FCP (Fog Computing Provider) Offers a layered network that extends cloud capabilities to the edge of the network, providing services such as AI model training and deployment. This provider utilizes infrastructure like Kubernetes (K8S) to support scalable, distributed computing tasks. FCP will execute tasks assigned by Market Provider, like Orchestrator on the Swan chain.

Prerequisites

Before you install the Computing Provider, you need to know there are some resources required:

Possess a public IP
Have a domain name (*.example.com)
Have an SSL certificate
Go version must 1.21+, you can refer here:

wget -c https://golang.org/dl/go1.21.7.linux-amd64.tar.gz -O - | sudo tar -xz -C /usr/local

echo "export PATH=$PATH:/usr/local/go/bin" >> ~/.bashrc && source ~/.bashrc

Install the Kubernetes

The Kubernetes version should be v1.24.0+

Install Container Runtime Environment

If you plan to run a Kubernetes cluster, you need to install a container runtime into each node in the cluster so that Pods can run there, refer to here. And you just need to choose one option to install the Container Runtime Environment

Option 1: Install the `Docker` and `cri-dockerd` （Recommended）

To install the Docker Container Runtime and the cri-dockerd, follow the steps below:

Install the Docker:
- Please refer to the official documentation from here.
Install cri-dockerd:
- cri-dockerd is a CRI (Container Runtime Interface) implementation for Docker. You can install it refer to here.

Option 2: Install the `Docker` and the`Containerd`

Install the Docker:
- Please refer to the official documentation from here.
To install Containerd on your system:
- Containerd is an industry-standard container runtime that can be used as an alternative to Docker. To install containerd on your system, follow the instructions on getting started with containerd.

Optional-Setup a docker registry server

If you are using the docker and you have only one node, the step can be skipped.

If you have deployed a Kubernetes cluster with multiple nodes, it is recommended to set up a private Docker Registry to allow other nodes to quickly pull images within the intranet.

Create a directory /docker_repo on your docker server. It will be mounted on the registry container as persistent storage for our docker registry.

sudo mkdir /docker_repo
sudo chmod -R 777 /docker_repo

Launch the docker registry container:

sudo docker run --detach \
  --restart=always \
  --name registry \
  --volume /docker_repo:/docker_repo \
  --env REGISTRY_STORAGE_FILESYSTEM_ROOTDIRECTORY=/docker_repo \
  --publish 5000:5000 \
  registry:2

Add the registry server to the node
- If you have installed the Docker and cri-dockerd(Option 1), you can update every node's configuration:
```
sudo vi /etc/docker/daemon.json
```
```
## Add the following config
"insecure-registries": ["<Your_registry_server_IP>:5000"]
```
Then restart the docker service
```
sudo systemctl restart docker
```
- If you have installed the containerd(Option 2), you can update every node's configuration:

[plugins."io.containerd.grpc.v1.cri".registry]
  [plugins."io.containerd.grpc.v1.cri".registry.mirrors]
    [plugins."io.containerd.grpc.v1.cri".registry.mirrors."<Your_registry_server_IP>:5000"]
      endpoint = ["http://<Your_registry_server_IP>:5000"]

[plugins."io.containerd.grpc.v1.cri".registry.configs]
  [plugins."io.containerd.grpc.v1.cri".registry.configs."<Your_registry_server_IP>:5000".tls]
      insecure_skip_verify = true

Then restart containerd service

sudo systemctl restart containerd

<Your_registry_server_IP>: the intranet IP address of your registry server.

Finally, you can check the installation by the command:

docker system info

Create a Kubernetes Cluster

To create a Kubernetes cluster, you can use a container management tool like kubeadm. The below steps can be followed:

Install the Network Plugin

Calico is an open-source networking and network security solution for containers, virtual machines, and native host-based workloads. Calico supports a broad range of platforms including Kubernetes, OpenShift, Mirantis Kubernetes Engine (MKE), OpenStack, and bare metal services.

To install Calico, you can follow the below steps, more information can be found here.

step 1: Install the Tigera Calico operator and custom resource definitions

kubectl create -f https://raw.githubusercontent.com/projectcalico/calico/v3.25.1/manifests/tigera-operator.yaml

step 2: Install Calico by creating the necessary custom resource

kubectl create -f https://raw.githubusercontent.com/projectcalico/calico/v3.25.1/manifests/custom-resources.yaml

step 3: Confirm that all of the pods are running with the following command

watch kubectl get pods -n calico-system

step 4: Remove the taints on the control plane so that you can schedule pods on it.

kubectl taint nodes --all node-role.kubernetes.io/control-plane-
kubectl taint nodes --all node-role.kubernetes.io/master-

If you have installed it correctly, you can see the result shown in the figure by the command kubectl get po -A

Note:

If you are a single-host Kubernetes cluster, remember to remove the taint mark, otherwise, the task can not be scheduled to it.

kubectl taint node ${nodeName}  node-role.kubernetes.io/control-plane:NoSchedule-

Install the NVIDIA Plugin

If your computing provider wants to provide a GPU resource, the NVIDIA Plugin should be installed, please follow the steps:

Install NVIDIA Driver.

Recommend NVIDIA Linux drivers version should be 470.xx+

Install NVIDIA Device Plugin for Kubernetes.

If you have installed it correctly, you can see the result shown in the figure by the command kubectl get po -n kube-system

Install the Ingress-nginx Controller

The ingress-nginx is an ingress controller for Kubernetes using NGINX as a reverse proxy and load balancer. You can run the following command to install it:

kubectl apply -f https://raw.githubusercontent.com/kubernetes/ingress-nginx/controller-v1.7.1/deploy/static/provider/cloud/deploy.yaml

Note

If you want to support the deployment of jobs with IP whitelists, you need to change the configuration of the configmap of the Ingress-nginx Controller and apply it. First download the deploy.yaml file, modify the ConfigMap resource object in the configuration file, and add a line underdata:

use-forwarded-headers: "true"

If you have installed it correctly, you can see the result shown in the figure by the command:

Run kubectl get po -n ingress-nginx

Run kubectl get svc -n ingress-nginx

Install and config the Nginx

Install Nginx service to the Server

sudo apt update
sudo apt install nginx

Add a configuration for your Domain name Assume your domain name is *.example.com

vi /etc/nginx/conf.d/example.conf

map $http_upgrade $connection_upgrade {
    default upgrade;
    ''      close;
}

server {
        listen 80;
        listen [::]:80;
        server_name *.example.com;                                           # need to your domain
        return 301 https://$host$request_uri;
        #client_max_body_size 1G;
}
server {
        listen 443 ssl;
        listen [::]:443 ssl;
        ssl_certificate  /etc/letsencrypt/live/example.com/fullchain.pem;     # need to config SSL certificate
        ssl_certificate_key  /etc/letsencrypt/live/example.com/privkey.pem;   # need to config SSL certificate

        server_name *.example.com;                                            # need to config your domain
        location / {
          proxy_pass http://127.0.0.1:<port>;  	# Need to configure the Intranet port corresponding to ingress-nginx-controller service port 80
          proxy_set_header Upgrade $http_upgrade;
          proxy_set_header Connection $connection_upgrade;
          proxy_cookie_path / "/; HttpOnly; Secure; SameSite=None";
          proxy_set_header Host $host;
          proxy_set_header X-Real-IP $remote_addr;
          proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
          proxy_set_header X-Forwarded-Proto $scheme;
       }
}

Note:
- server_name: a generic domain name
- ssl_certificate and ssl_certificate_key: certificate for https.
- proxy_pass: The port should be the Intranet port corresponding to ingress-nginx-controller service port 80
Reload the Nginx config
```
sudo nginx -s reload
```
Map your "catch-all (wildcard) subdomain(*.example.com)" to a public IP address

Install the Hardware resource-exporter

The resource-exporter plugin is developed to collect the node resource constantly, computing provider will report the resource to the Lagrange Auction Engine to match the space requirement. To get the computing task, every node in the cluster must install the plugin. You just need to run the following command:

cat <<EOF | kubectl apply -f -
apiVersion: apps/v1
kind: DaemonSet
metadata:
  namespace: kube-system
  name: resource-exporter-ds
  labels:
    app: resource-exporter
spec:
  selector:
    matchLabels:
      app: resource-exporter
  template:
    metadata:
      labels:
        app: resource-exporter
    spec:
      containers:
        - name: resource-exporter
          image: filswan/resource-exporter:v12.0.0
          imagePullPolicy: IfNotPresent
          securityContext:
            privileged: true
          volumeMounts:
            - name: machine-id
              mountPath: /etc/machine-id
              readOnly: true
      volumes:
        - name: machine-id
          hostPath:
            path: /etc/machine-id
            type: File
EOF

If you have installed it correctly, you can see the result shown in the figure by the command: kubectl get po -n kube-system

Build and config the Computing Provider

Build the Computing Provider

Firstly, clone the code to your local:

git clone https://github.com/swanchain/go-computing-provider.git
cd go-computing-provider
git checkout releases

Then build the Computing provider on the Swan Mainnet by following the below steps:

make clean && make mainnet
make install

If you want to test the CP in the testnet, please build a testnet version:
make clean && make testnet
make install

Initialize CP repo and Update Configuration

Initialize repo
```
computing-provider init --multi-address=/ip4/<YOUR_PUBLIC_IP>/tcp/<YOUR_PORT> --node-name=<YOUR_NODE_NAME>
```
Note:
- By default, the CP's repo is ~/.swan/computing, you can configure it by export CP_PATH="<YOUR_CP_PATH>"
- The CP service port (8085 by default) must be mapped to the public IP address and port

Update config.toml

Edit the necessary configuration files according to your deployment requirements.

   [API]
   Port = 8085                                    # The port number that the web server listens on
   MultiAddress = "/ip4/<public_ip>/tcp/<port>"   # The multiAddress for libp2p
   Domain = ""                                    # The domain name
   NodeName = ""                                  # The computing-provider node name
   WalletWhiteList = ""                           # CP only accepts user addresses from this whitelist for space deployment
   WalletBlackList = ""                           # CP reject user addresses from this blacklist for space deployment
   Pricing = "true"                               # default True, indicating acceptance of smart pricing orders, which may include orders priced lower than self-determined pricing.
   AutoDeleteImage = false                        # Default false, automatically delete unused images
   ClearLogDuration = 24                          # The interval for automatically clearing the log, the unit is hours
   PortRange= ["40000-40050","40070"]             # Externally exposed port number for deploying ECP image tasks

   [UBI]
   UbiEnginePk = "0xB5aeb540B4895cd024c1625E146684940A849ED9"              # UBI Engine's public key, CP only accept the task from this UBI engine
   EnableSequencer = true                                                  # Submit the proof to Sequencer service(default: true)
   AutoChainProof = true                                                   # When Sequencer doesn't have enough funds or the service is unavailable, automatically submit proof to the Swan chain 
   SequencerUrl = "https://sequencer.swanchain.io"                         # Sequencer service's API address
   EdgeUrl = "https://edge-api.swanchain.io/v1"                            # Edge service's API address
   VerifySign = true                                                       # Verify that the task signature is from Engine

   [LOG]
   CrtFile = "/YOUR_DOMAIN_NAME_CRT_PATH/server.crt"                       # Your domain name SSL .crt file path
   KeyFile = "/YOUR_DOMAIN_NAME_KEY_PATH/server.key"                       # Your domain name SSL .key file path

   [HUB]
   BalanceThreshold= 10                                                    # The cp’s collateral balance threshold
   OrchestratorPk = "0xd875bD44158208fD0FDD46729Aab6709f62C7821"           # Orchestrator's public key, CP only accept the task from this Orchestrator
   VerifySign = true                                                       # Verify that the task signature is from Orchestrator

   [MCS]
   ApiKey = ""                                   # Acquired from "https://www.multichain.storage" -> setting -> Create API Key
   BucketName = ""                               # Acquired from "https://www.multichain.storage" -> bucket -> Add Bucket
   Network = "polygon.mainnet"                   # polygon.mainnet for mainnet, polygon.mumbai for testnet

   [Registry]
   ServerAddress = ""                            # The docker container image registry address, if only a single node, you can ignore
   UserName = ""                                 # The login username, if only a single node, you can ignore
   Password = ""                                 # The login password, if only a single node, you can ignore

   [RPC]
   SWAN_CHAIN_RPC = "https://mainnet-rpc-01.swanchain.org"     # Swan chain RPC

Note:

Example [api].WalletWhiteList hosted on GitHub can be found here.
Example [api].WalletBlackList hosted on GitHub can be found here.

Initialize a Wallet and Deposit `SwanETH`

Generate a new wallet address or import the previous wallet:

computing-provider wallet new

Example output:

0x7791f48931DB81668854921fA70bFf0eB85B8211

or import your wallet:

# Import wallet using the private key
computing-provider wallet import <YOUR_PRIVATE_KEY_FILE>

Note: <YOUR_PRIVATE_KEY_FILE> is a file that contains the private key

Deposit SwanETH to the wallet address:
```
computing-provider wallet send --from <YOUR_WALLET_ADDRESS> 0x7791f48931DB81668854921fA70bFf0eB85B8211 0.01
```
Note: If you don't have SwanETH and SWAN, please follow the guideline to bridge ETH to Swan Mainnet.

Initialization CP Account

Deploy a CP account contract:

computing-provider account create --ownerAddress <YOUR_OWNER_WALLET_ADDRESS> \
	--workerAddress <YOUR_WORKER_WALLET_ADDRESS> \
	--beneficiaryAddress <YOUR_BENEFICIARY_WALLET_ADDRESS>  \
	--task-types 3

Note: --task-types: Supports 5 task types (1: Fil-C2, 2: Mining, 3: AI, 4: Inference, 5: NodePort), separated by commas. For FCP, it needs to be set to 3.

Output:

Contract deployed! Address: 0x3091c9647Ea5248079273B52C3707c958a3f2658
Transaction hash: 0xb8fd9cc9bfac2b2890230b4f14999b9d449e050339b252273379ab11fac15926

Collateral `SWAN` for FCP

 computing-provider collateral add --fcp --from <YOUR_WALLET_ADDRESS>  <amount>

Note: Please deposit enough collaterals for the tasks

Withdraw `SWAN` from FCP

 computing-provider collateral withdraw --fcp --owner <YOUR_WALLET_ADDRESS> --account <YOUR_CP_ACCOUNT> <amount>

Note: If you want to withdraw the funds from FCP, you can run the above command

Start the Computing Provider

You can run computing-provider using the following command

export CP_PATH=<YOUR_CP_PATH>
nohup computing-provider run >> cp.log 2>&1 &

[OPTIONAL] Install AI Inference Dependency

It is necessary for the Computing Provider to deploy the AI inference endpoint. But if you do not want to support the feature, you can skip it.

export CP_PATH=<YOUR_CP_PATH>
./install.sh

[OPTIONAL] Install Node-Port Dependency

Install Resource Isolation service on the k8s cluster In order to view the actual available resources of the container, you need to install a resource isolation service on the cluster.

For Ubuntu 20.04:

kubectl apply -f https://raw.githubusercontent.com/swanchain/go-computing-provider/refs/heads/releases/resource-isolation-20.04.yaml

For Ubuntu 22.04 and higher.

Edit /etc/default/grub and modify it to the following content:

   GRUB_CMDLINE_LINUX_DEFAULT="quiet splash systemd.unified_cgroup_hierarchy=0"

Update grub configuration

update-grub

Reboot the system

   reboot now

Install resource-isolation service on k8s

  kubectl apply -f https://raw.githubusercontent.com/swanchain/go-computing-provider/refs/heads/releases/resource-isolation.yaml

Install network policies

Generate Network Policy (location at $CP_PATH/network-policy.yaml )

computing-provider network generate

Deploy Network Policy

kubectl apply -f $CP_PATH/network-policy.yaml

Confirm that all of the network policy are running with the following command.

# kubectl get gnp
NAME                    CREATED AT
global-01kls78xh7dk4n   2024-09-25T04:00:59Z
global-ao9kq72mjc0sl3   2024-09-25T04:00:59Z
global-e59cad59af9c65   2024-09-25T04:00:59Z
global-pd6sdo8cjd61yd   2024-09-25T04:00:59Z
global-pod1namespace1   2024-09-25T04:01:00Z
global-s92ms87dl3j6do   2024-09-25T04:01:00Z

# kubectl get globalnetworksets
NAME                    CREATED AT
netset-2300e518e9ad45   2024-09-25T04:00:59Z

Note: The nodes for deploying CP need to open ports in the range of 30000-32767

Change the tasktypes

computing-provider account changeTaskTypes --ownerAddress <YOUR_OWNER_WALLET_ADDRESS> 3,5

Note: --task-types Supports 5 task types:

1: FIL-C2

2: Mining

3: AI

4: Inference

5: NodePort

[OPTIONAL] Config and Receive ZK Tasks

This section mainly introduces how to enable the function of receiving ZK tasks on FCP, which is equivalent to running an ECP. This function is optional. Once enabled, FCP can earn double benefits simultaneously, but it will also consume certain resources.

Step 1: Prerequisites: Perform Filecoin Commit2 (fil-c2) ZK tasks.

Download parameters (specify the path with PARENT_PATH variable):

# At least 200G storage is needed
export PARENT_PATH="<V28_PARAMS_PATH>"

# 512MiB parameters
curl -fsSL https://raw.githubusercontent.com/swanchain/go-computing-provider/releases/ubi/fetch-param-512.sh | bash

# 32GiB parameters
curl -fsSL https://raw.githubusercontent.com/swanchain/go-computing-provider/releases/ubi/fetch-param-32.sh | bash

Configure environment variables in fil-c2.env under CP repo ($CP_PATH):

FIL_PROOFS_PARAMETER_CACHE=$PARENT_PATH
RUST_GPU_TOOLS_CUSTOM_GPU="GeForce RTX 3080:8704"

Adjust the value of RUST_GPU_TOOLS_CUSTOM_GPU based on the GPU used by the CP's Kubernetes cluster for fil-c2 tasks.
For more device choices, please refer to this page:https://github.com/filecoin-project/bellperson

Step 2: Collateral `SWAN` for ZK tasks

computing-provider collateral add --ecp --from <YOUR_WALLET_ADDRESS>  <amount>

If you want to withdraw the collateral SWAN:

computing-provider collateral withdraw --ecp --owner <YOUR_WALLET_ADDRESS> --account <YOUR_CP_ACCOUNT> <amount>

Step 3: Change the `tasktypes`

computing-provider account changeTaskTypes --ownerAddress <YOUR_OWNER_WALLET_ADDRESS> 1,2,3,4

Note: --task-types Supports 5 task types:

1: FIL-C2

2: Mining

3: AI

4: Inference

5: NodePort

If you need to run FCP and ECP at the same time, you need to set it to 1,2,3,4

Step 4: Deposit `SwanETH` for Sequencer Account

computing-provider sequencer add --from <YOUR_WALLET_ADDRESS>  <amount>

If you want to Withdraw SwanETH from Sequencer Account
computing-provider sequencer withdraw --owner <YOUR_OWNER_WALLET_ADDRESS>  <amount>

Step 5: Account Management

Use computing-provider account subcommands to update CP details:

computing-provider account -h
NAME:
   computing-provider account - Manage account info of CP

USAGE:
   computing-provider account command [command options] [arguments...]

COMMANDS:
   create                    Create a cp account to chain
   changeMultiAddress        Update MultiAddress of CP (/ip4/<public_ip>/tcp/<port>)
   changeOwnerAddress        Update OwnerAddress of CP
   changeWorkerAddress       Update workerAddress of CP
   changeBeneficiaryAddress  Update beneficiaryAddress of CP
   changeTaskTypes           Update taskTypes of CP (1:Fil-C2, 2:Mining, 3: AI, 4:Inference, 5:NodePort, 100:Exit), separated by commas
   help, h                   Show a list of commands or help for one command

OPTIONS:
   --help, -h  show help

Step 6: Check the Status of ZK task

To check the ZK task list, use the following command:

computing-provider ubi list --show-failed