This repository contains experiments and implementations of various model compression techniques. It explores different approaches to reduce model size, inference latency, memory footprint, and carbon footprint while maintaining model performance.
| Name | Name | Last commit date | ||
|---|---|---|---|---|