Use Core ML Tools for machine learning model compression

Discover how to reduce the footprint of machine learning models in your app with Core ML Tools. Learn how to use techniques like palettization, pruning, and quantization to dramatically reduce model size while still achieving great accuracy. Explore comparisons between compression during the training stages and on fully trained models, and learn how compressed models can run even faster when your app takes full advantage of the Apple Neural Engine.

Watch Video (25 min)

✍️

No notes available yet

Be the hero who changes that. Watch the video, jot down what matters, and open a pull request – it's genuinely quick.

Learn how to contribute