Browse topics Hub · essay · articles · FAQ · glossary

Glossary · Foundations

Quantization

Reducing numeric precision in model weights to run inference cheaper and faster.

Quantization — Reducing numeric precision in model weights to run inference cheaper and faster..