"RuntimeError: std::exception"
The soft link of libmkldnn in the PyTorch 1.0 image conflicts with that of the native Torch. For details, see conv1d fails in PyTorch 1.0.
import os os.system("rm /home/work/anaconda3/lib/libmkldnn.so") os.system("rm /home/work/anaconda3/lib/libmkldnn.so.0")
Before creating a training job, use the ModelArts development environment to debug the training code to maximally eliminate errors in code migration.