Failed with error but no error in reality

Hi all,

I got an error that says the following. But when I went to the directory, the job was completed fine with no issue or error. In the main working directory, there is lost+found directory as well. But I do not understand the source of error as it ran successfully. The other workchain failed due to this. What is possible cause for this?

ailed with exit status 321: The XML output file could not be parsed.
2024-04-12 00:00:42 [7922 | REPORT]:       [13400|PwBaseWorkChain|report_error_handled]: Action taken: unrecoverable error, aborting...
2024-04-12 00:00:42 [7923 | REPORT]:       [13400|PwBaseWorkChain|inspect_process]: PwCalculation<13403> failed but a handler detected an unrecoverable problem, aborting
            ├── PwBaseWorkChain<13376> Finished [0] [3:results]
            │   └── PwCalculation<13379> Finished [0]
            ├── PwBaseWorkChain<13382> Finished [0] [3:results]
            │   └── PwCalculation<13385> Finished [0]
            ├── PwBaseWorkChain<13388> Finished [0] [3:results]
            │   └── PwCalculation<13391> Finished [0]
            ├── PwBaseWorkChain<13394> Finished [0] [3:results]
            │   └── PwCalculation<13397> Finished [0]
            ├── PwBaseWorkChain<13400> Finished [300] [2:while_(should_run_process)(2:inspect_process)]
            │   └── PwCalculation<13403> Finished [321]
            ├── PwBaseWorkChain<13406> Finished [0] [3:results]
            │   └── PwCalculation<13409> Finished [0]
            ├── PwBaseWorkChain<13412> Finished [0] [3:results]
            │   └── PwCalculation<13415> Finished [0]
            ├── PwBaseWorkChain<13418> Finished [0] [3:results]
            │   └── PwCalculation<13421> Finished [0]
            ├── PwBaseWorkChain<13424> Finished [0] [3:results]
            │   └── PwCalculation<13427> Finished [0]
            └── PwBaseWorkChain<13430> Finished [0] [3:results]
                └── PwCalculation<13433> Finished [0]
2024-04-11 23:57:30 [7917 | REPORT]:       [13406|PwBaseWorkChain|on_terminated]: remote folders will not be cleaned
2024-04-12 00:00:42 [7921 | REPORT]:       [13400|PwBaseWorkChain|report_error_handled]: PwCalculation<13403> failed with exit status 321: The XML output file could not be parsed.
2024-04-12 00:00:42 [7922 | REPORT]:       [13400|PwBaseWorkChain|report_error_handled]: Action taken: unrecoverable error, aborting...
2024-04-12 00:00:42 [7923 | REPORT]:       [13400|PwBaseWorkChain|inspect_process]: PwCalculation<13403> failed but a handler detected an unrecoverable problem, aborting
2024-04-12 00:00:42 [7924 | REPORT]:       [13400|PwBaseWorkChain|on_terminated]: remote folders will not be cleaned
2024-04-12 00:45:40 [7928 | REPORT]:       [13430|PwBaseWorkChain|results]: work chain completed after 1 iterations
2024-04-12 00:45:40 [7929 | REPORT]:       [13430|PwBaseWorkChain|on_terminated]: remote folders will not be cleaned
2024-04-12 01:46:32 [7933 | REPORT]:       [13424|PwBaseWorkChain|results]: work chain completed after 1 iterations

Hi @rkarkee :wave:

Sorry for the slow response. It’s a bit hard to say what went wrong without more info. Did the pw.x output print JOB DONE at the end? Perhaps the XML file got corrupted somehow?

Could you put it online somewhere and share the link (we don’t allow file uploads here due to limited default storage).

Best,
Marnik

Yes I have double checked the pw.x and there was no error and job ended with familiar looking steps and at the end JOB DONE.

I re ran deleting everything and such was not the issue.

Since I deleted the previous one, I no longer have that data but after deleting files and re running, it worked fine.

Alright, good to hear. I can only assume that somehow the XML wasn’t written correctly, maybe some hiccup from the file system.

If you run into the issue again, let us know!