SiPS 2024

4 – 6 November 2024

Keynote Speaker Song Han

Efficient Multi-modal LLM 

This talk presents efficient multi-modal LLM innovations across the full stack.   

I’ll first present VILA, a visual language model pre-training recipe beyond visual instruction tuning, enabling multi-image reasoning and in-context learning capabilities. Followed by SmoothQuant and AWQ for LLM quantization, and the TinyChat inference library.  AWQ and TinyChat enable VILA 3B deployable on Jetson Orin Nano, bringing new capabilities for mobile vision applications. Second, I’ll present efficient representation learning, including EfficientViT for high-resolution vision, accelerating SAM by 48x without performance loss; and condition-aware neural networks for adding control to diffusion models. Third, I’ll present StreamingLLM, a KV cache optimization technique for long conversation and LongLoRA, using sparse, shifted attention for long-context LLM. Finally, I’ll present PockEngine for efficient LLM fine-tuning.

Song Han is an associate professor at MIT EECS. He received his PhD degree from Stanford University. He proposed the “Deep Compression” technique including pruning and quantization that is widely used for efficient AI computing, and “Efficient Inference Engine” that first brought weight sparsity to modern AI chips, which is a top-5 cited paper in 50 years of ISCA. He pioneered the TinyML research that brings deep learning to IoT devices. His team’s recent work on large language model quantization and acceleration (SmoothQuant, AWQ, StreamingLLM)  improved the efficiency of LLM inference, adopted by NVIDIA TensorRT-LLM. Song received best paper awards at ICLR and FPGA, faculty awards from Amazon, Facebook, NVIDIA, Samsung and SONY. Song was named “35 Innovators Under 35” by MIT Technology Review, NSF CAREER Award, and Sloan Research Fellowship. Song was the cofounder of DeePhi (now part of AMD), and cofounder of OmniML (now part of NVIDIA). Song developed the EfficientML.ai course to disseminate efficient ML research.

Join our mailing list and stayed informed of SiPS 2024 Updates!