cuda
此处容易踩坑,如果某些项目需要非容器化部署(在本机跑),需要提前搞清楚各包和cuda版本的兼容性问题,例如PyTorch
cuda版本查看 https://developer.nvidia.com/cuda-toolkit-archive
此处采用ubuntu24安装cuda12.4
1. 开启代理,此处采用v2ray
export http_proxy="http://127.0.0.1:11809"
export https_proxy="http://127.0.0.1:11809"
2. 安装cuda 12.4
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-ubuntu2204.pin
sudo mv cuda-ubuntu2204.pin /etc/apt/preferences.d/cuda-repository-pin-600
wget https://developer.download.nvidia.com/compute/cuda/12.4.0/local_installers/cuda-repo-ubuntu2204-12-4-local_12.4.0-550.54.14-1_amd64.deb
sudo dpkg -i cuda-repo-ubuntu2204-12-4-local_12.4.0-550.54.14-1_amd64.deb
sudo cp /var/cuda-repo-ubuntu2204-12-4-local/cuda-*-keyring.gpg /usr/share/keyrings/
sudo apt-get update
sudo apt-get -y install cuda-toolkit-12-4
3. 编辑配置文件
sudo vim ~/.bashrc
export PATH=/usr/local/cuda-12.4/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda-12.4/lib64:$LD_LIBRARY_PATH
source ~/.bashrc
一定要重启才能生效
sudo reboot
查看cuda版本
nvcc --version
4. 安装驱动
sudo apt-get install -y cuda-drivers
查看驱动信息
nvidia-smi
5. 实时GPU监控
watch -n 1 nvidia-smi
卸载步骤
sudo apt-get --purge remove "cuda*"
sudo apt-get --purge remove "nvidia*"
sudo apt-get autoremove
sudo apt-get autoclean
sudo rm -rf /usr/local/cuda-12.4