-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"initialization error" on every operation #10
Comments
Based on the strace output it looks like CUDA is more than likely running as a subprocess of what you're trying to checkpoint. If you use the --get-restore-tid flag it should return a thread ID if the target pid is a CUDA process or an error otherwise. You can also do a similar check with --get-state it should return running|locked|checkpointed if CUDA is running or complain similarly otherwise. Full process tree support is currently being worked on but available as a PR on the main CRIU github if you'd like to try that : checkpoint-restore/criu#2416 |
I also encountered this error during restoration. Is there any BKM to debug it?
|
Can you provide a sample app and the calls to cuda-checkpoint for me to take a look at? During restore was cuda-checkpoint also run as root? |
I'm using Debian 12 with 2 1080ti.
When I want to toggle or make a checkpoint of Mathematica and Microsoft Edge it complains 'Could not checkpoint on process ID 17146: "initialization error"'
strace output
The text was updated successfully, but these errors were encountered: