OptiML

The open-source library for AI model compression

What is OptiML?

OptiML is an open-source project dedicated to maintaining and supporting state-of-the-art model compression. It simplifies the process for AI developers to compress and evaluate models in one-shot or during fine-tuning for a downstream task. Additionally, it provides a robust platform for researchers to develop and experiment with innovative compression techniques on any model.

Why should I use OptiML?

Improve Performance and Reduce Costs

  • Desired Accuracy with Lower Costs: Produce models that achieve the desired accuracy while significantly reducing inferencing costs.
  • Sustainable AI: Reduce energy consumption, supporting sustainable AI practices.
Easily Integrate in MLOps Pipelines
  • Rapid Iteration: Quickly test and iterate on different compression strategies.
  • Optimized Fine-Tuning: Enhance fine-tuning by ensuring optimal performance with lower complexity.
Support any Hardware
  • Versatile Compatibility: Leverage compression gains on any hardware platform, from edge devices to cloud infrastructure.
  • Consistent Deployment: Deploy compressed models across various environments with consistent performance.

Research New Compression

  • Experiment with new approaches and easily evaluate on any model and benchmark against state-of-the-art.