OptiML Blog

This blog is the official source for the updates, articles and posts from the OptiML team.

Evaluating the Impact of Compression Techniques on Performance in Large Language

July 18, 2024

By OptiML Team

Large language models (LLMs) like GPT-4, PaLM, and LLaMA have demonstrated remarkable capabilities across multitask language understanding.

Impact of Calibration Data on Compression Techniques for Large Language Models

August 13, 2024

By OptiML Team

The process of compressing large language models (LLMs) like LLaMA-2-7B involves more than just reducing the number of parameters. The choice of calibration data—used during the compression process—plays a critical role in determining how well the compressed model will perform on downstream tasks.