There are a lot of issues that can cause a model to crash. For example, Out of device can cause a crash, some Out of Memory error message like “RuntimeError: synDeviceMalloc failed to allocate 9932111872 bytes”, may be seen. Users can try to reduce batch size to workaround the crash. Kernel fusing can also cause a crash; in this case, users can try to set env var RUN_TPC_FUSER=false to help isolate the problem. User can also post their issues to the Forum here, with all the details of the model run and any error messages.