How do I ensure that all the operators in my model are running on the HPU, instead of the Host CPU?

Some operators will always run on the HPU and others on the CPU. See the Model Performance Optimization Guide for instructions on dumping logs in order to see if an operator runs on CPU or on HPU