Model Checkpoint Compression With OpenZL
Lightning Talk: Model Checkpoint Compression With OpenZL - Nick Terrell & Teja Rao, Meta Introducing OpenZL, an innovative open-source lossless compression framework optimized for structured data types. Traditional lossless compression methods are ineffective at compressing tensors because they operate at the byte level rather than at the tensor level. OpenZL leverages type information to achieve remarkable size reductions—33% for bfloat16, 21% for float32, and 15% for float16—with best-in-class performance. Deployed at scale within Meta Platforms Inc., we observed a 15% reduction in checkpoint overhead, saving thousands of GPUs, and a 17% reduction in storage, saving more than a hundred petabytes. Unlike lossy compression methods like quantization, which can affect model quality, OpenZL maintains model quality and is easy to deploy at scale. We are exploring new AI use cases with OpenZL, leveraging its powerful compression capabilities in areas such as logits and image and video embeddings.