Training makes heavy use of floating-point operations for most applications, but for inference operations many network weights can be represented with considerably less precision than their original ...