INT8

8-bit Integer Quantization

Definition

INT8 quantization stores model weights and/or activations in 8-bit integers, halving memory usage compared to FP16 with minimal accuracy loss. It is supported by NVIDIA's TensorRT, bitsandbytes, and ONNX Runtime for efficient inference.

INT8 is often considered the safe default quantization level for production deployments where some accuracy tradeoff is acceptable.

Related Terms

INT4

4-bit Integer Quantization

Open Neural Network Exchange

← Back to Glossary

Ship secure code faster

Crash Override integrates security into the developer workflow. No context switching, no waiting on reviews.

Talk to a Human See the Product