2024 MIT R&D Conference: Track 5 - AI - Efficient Multi-modal LLM on the Edge

Video details

Efficient Multi-modal LLM on the Edge
Song Han
Associate Professor, MIT Electrical Engineering & Computer Science Department
This talk presents efficient multi-modal LLM innovations with algorithm and system co-design. I’ll first present VILA, a visual language model deployable on the edge. It is capable of visual in-context learning, multi-image reasoning, video captioning and video QA. Followed by SmoothQuant and AWQ for LLM quantization, which enables VILA deployable on edge devices, bringing new capabilities for mobile vision applications. Second, I’ll present StreamingLLM, a KV cache optimization technique for long conversation and QUEST, leveraging sparsity for KV cache compression.

Interactive transcript

Search

Share

Video details

Interactive transcript

Search

Share

Keyword Highlighting Download Transcript

2024 MIT R&D Conference: Track 5 - AI - Efficient Multi-modal LLM on the Edge

More Videos From This Event

2024 MIT R&D Conference: Day 2 Welcome & Opening Remarks

2024 MIT R&D Conference: Innovations at Fujitsu

2024 MIT R&D Conference: MIT Breakthrough Tech AI Overview

2024 MIT R&D Conference: MIT Industry Research Collaboration

2024 MIT R&D Conference: MIT J-WAFS Overview

2024 MIT R&D Conference: MIT Microsystems Technology Laboratories Overview

2024 MIT R&D Conference: MIT Senseable City Lab Overview

2024 MIT R&D Conference: Panel Discussion: MTL's Impact in the Next 40 Years

2024 MIT R&D Conference: Panel Discussion: The Making of MTL 2

2024 MIT R&D Conference: Semiconductors, Microsystems & Workforce Development for the Tech Revln

2024 MIT R&D Conference: Startup Exchange Lightning Talks - 2Pi

2024 MIT R&D Conference: Startup Exchange Lightning Talks - Advanced Silicon Group

2024 MIT R&D Conference: Startup Exchange Lightning Talks - Concerto Biosciences

2024 MIT R&D Conference: Startup Exchange Lightning Talks - Delineate

2024 MIT R&D Conference: Startup Exchange Lightning Talks - Introduction

2024 MIT R&D Conference: Startup Exchange Lightning Talks - Jaxon

2024 MIT R&D Conference: Startup Exchange Lightning Talks - Lunar Station Corporation

2024 MIT R&D Conference: Startup Exchange Lightning Talks - Mobi Systems

2024 MIT R&D Conference: Startup Exchange Lightning Talks - qBraid

2024 MIT R&D Conference: Startup Exchange Lightning Talks - Wellsite Navigator

2024 MIT R&D Conference: The Next Generation of MTL Leaders & Innovators - Part 1

2024 MIT R&D Conference: The Next Generation of MTL Leaders & Innovators - Part 2

2024 MIT R&D Conference: Track 1 - Space - Whither the Space Enterprise - A View from the Lens of Technology and Policy

2024 MIT R&D Conference: Track 1 - Space - Automating the Identification of Chemical Mixture Components with Machine Learning

2024 MIT R&D Conference: Track 1 - Space - Earth to Orbit: An Update on the Global Launch Industry

2024 MIT R&D Conference: Track 1 - Space - Space Security Issues in Space, Traffic Management and Space Sustainability

2024 MIT R&D Conference: Track 2 - Mobility - Is EV Stalling US vs China Competition

2024 MIT R&D Conference: Track 2 - Mobility - Optimization of Electric Vehicle Charging Stations

2024 MIT R&D Conference: Track 2 - Mobility - Ten Key Trends in Surface Mobility 2024

2024 MIT R&D Conference: Track 3 - Innovations - Designing the X Transformational Powers of Design

2024 MIT R&D Conference: Track 3 - Innovations - Enabling Innovation In Industry and Academia Through Digital Transformation

2024 MIT R&D Conference: Track 3 - Innovations - Leveraging AI to Build a Culture of Innovation

2024 MIT R&D Conference: Track 3 - Innovations - Sourcing Innovation: Applications to AI

2024 MIT R&D Conference: Track 4 - Healthcare - Bioelectronics for Brain & Body

2024 MIT R&D Conference: Track 4 - Healthcare - Machine-Learning-Guided Quality Control of CAR-T Therapy Product Using Microfluidic Biophysical Cytometry

2024 MIT R&D Conference: Track 4 - Healthcare - Neural Computation Underlying Behavior

2024 MIT R&D Conference: Track 4 - Healthcare - Waves, Bits, & Molecules Lab at MIT

2024 MIT R&D Conference: Track 5 - AI - Analog Brain Inspired Computing

2024 MIT R&D Conference: Track 5 - AI - Is AI Ready to Transform Chemistry & Materials Science

2024 MIT R&D Conference: Track 5 - AI - The Road to Digital Twins in Semiconductor Manufacturing

2024 MIT R&D Conference: Track 6 - Quantum 2.0 - Compiling Machine Intelligence onto (Quantum) Optoelectronic Systems

2024 MIT R&D Conference: Track 6 - Quantum 2.0 - Quantum Computing

2024 MIT R&D Conference: Welcome & Introduction

2024 MIT R&D Conference: Track 6 - Quantum 2.0 - Quantum Materials for Quantum 2.0