使用runner进行微调训练时,WSL提示两个错误,

RT,runner提示一个说“未找到基底模型,请尝试更好基底模型“,但我软件设置的基底模型是有的,不知道哪里出问题了。我理解WSL反馈的报错信息是下面这些,不知道有没有复制错误。

Preparing to unpack cuda-repo-wsl-ubuntu-12-2-local_12.2.0-1_amd64.deb ...
Unpacking cuda-repo-wsl-ubuntu-12-2-local (12.2.0-1) ...
dpkg-deb (subprocess): decompressing archive 'cuda-repo-wsl-ubuntu-12-2-local_12.2.0-1_amd64.deb' (size=611319782) member 'data.tar': lzma error: compressed data is corrupt
dpkg-deb: error: <decompress> subprocess returned error exit status 2
dpkg: error processing archive cuda-repo-wsl-ubuntu-12-2-local_12.2.0-1_amd64.deb (--install):
cannot copy extracted data for './var/cuda-repo-wsl-ubuntu-12-2-local/libcublas-12-2_12.2.1.16-1_amd64.deb' to '/var/cuda-repo-wsl-ubuntu-12-2-local/libcublas-12-2_12.2.1.16-1_amd64.deb.dpkg-new': unexpected end of file or stream
Errors were encountered while processing:
cuda-repo-wsl-ubuntu-12-2-local_12.2.0-1_amd64.deb
cp: cannot stat '/var/cuda-repo-wsl-ubuntu-12-2-local/cuda-*-keyring.gpg': No such file or directory

另一个提示错误是:“未安装匹配的CUDA。”,但我用 print(f"CUDA available: {torch.cuda.is_available()}") 命令返回 是CUDA available: True

RWKV_MY_TESTING
Traceback (most recent call last):
File "/mnt/d/rwkv/./finetune/lora/v5/train.py", line 308, in <module>
from src.trainer import train_callback, generate_init_weight
File "/mnt/d/rwkv/finetune/lora/v5/src/trainer.py", line 6, in <module>
from .model import LORA_CONFIG
File "/mnt/d/rwkv/finetune/lora/v5/src/model.py", line 56, in <module>
wkv5_cuda = load(
File "/usr/local/lib/python3.10/dist-packages/torch/utils/cpp_extension.py", line 1306, in load
return _jit_compile(
File "/usr/local/lib/python3.10/dist-packages/torch/utils/cpp_extension.py", line 1710, in _jit_compile
_write_ninja_file_and_build_library(
File "/usr/local/lib/python3.10/dist-packages/torch/utils/cpp_extension.py", line 1800, in _write_ninja_file_and_build_library
extra_ldflags = _prepare_ldflags(
File "/usr/local/lib/python3.10/dist-packages/torch/utils/cpp_extension.py", line 1893, in _prepare_ldflags
if (not os.path.exists(_join_cuda_home(extra_lib_dir)) and
File "/usr/local/lib/python3.10/dist-packages/torch/utils/cpp_extension.py", line 2407, in _join_cuda_home
raise OSError('CUDA_HOME environment variable is not set. '
OSError: CUDA_HOME environment variable is not set. Please set it to your CUDA install root.

用的啥显卡,建议将runner里的wsl重新安装一遍