Skip to main content
MIT Corporate Relations
MIT Corporate Relations
Search
×
Read
Watch
Attend
About
Connect
MIT Startup Exchange
Search
Sign-In
Register
Search
×
MIT ILP Home
Read
Faculty Features
Research
News
Watch
Attend
Conferences
Webinars
Learning Opportunities
About
Membership
Staff
For Faculty
Connect
Faculty/Researchers
Program Directors
MIT Startup Exchange
User Menu and Search
Search
Sign-In
Register
MIT ILP Home
Toggle menu
Search
Sign-in
Register
Read
Faculty Features
Research
News
Watch
Attend
Conferences
Webinars
Learning Opportunities
About
Membership
Staff
For Faculty
Connect
Faculty/Researchers
Program Directors
MIT Startup Exchange
2024 MIT R&D Conference: Track 5 - AI - Efficient Multi-modal LLM on the Edge
Conference Video
|
Duration: 20:30
November 19, 2024
View this past event
Preview
2024 MIT R&D Conference: Track 5 - AI - Efficient Multi-modal LLM on the Edge
Play
Video details
Efficient Multi-modal LLM on the Edge
Song Han
Associate Professor, MIT Electrical Engineering & Computer Science Department
This talk presents efficient multi-modal LLM innovations with algorithm and system co-design. I’ll first present VILA, a visual language model deployable on the edge. It is capable of visual in-context learning, multi-image reasoning, video captioning and video QA. Followed by SmoothQuant and AWQ for LLM quantization, which enables VILA deployable on edge devices, bringing new capabilities for mobile vision applications. Second, I’ll present StreamingLLM, a KV cache optimization technique for long conversation and QUEST, leveraging sparsity for KV cache compression.
Interactive transcript
Search
Previous
Next
Share
Share to facebook
Share to Twitter
Video details
Efficient Multi-modal LLM on the Edge
Song Han
Associate Professor, MIT Electrical Engineering & Computer Science Department
This talk presents efficient multi-modal LLM innovations with algorithm and system co-design. I’ll first present VILA, a visual language model deployable on the edge. It is capable of visual in-context learning, multi-image reasoning, video captioning and video QA. Followed by SmoothQuant and AWQ for LLM quantization, which enables VILA deployable on edge devices, bringing new capabilities for mobile vision applications. Second, I’ll present StreamingLLM, a KV cache optimization technique for long conversation and QUEST, leveraging sparsity for KV cache compression.
Interactive transcript
Search
Share
Share to facebook
Share to Twitter
Keyword Highlighting
Download Transcript
More Videos From This Event
See all videos
November 2024
|
Conference Video
2024 MIT R&D Conference: Day 2 Welcome & Opening Remarks
November 2024
|
Conference Video
2024 MIT R&D Conference: Innovations at Fujitsu
November 2024
|
Conference Video
2024 MIT R&D Conference: MIT Breakthrough Tech AI Overview
November 2024
|
Conference Video
2024 MIT R&D Conference: MIT Industry Research Collaboration
November 2024
|
Conference Video
2024 MIT R&D Conference: MIT J-WAFS Overview
November 2024
|
Conference Video
2024 MIT R&D Conference: MIT Microsystems Technology Laboratories Overview
November 2024
|
Conference Video
2024 MIT R&D Conference: MIT Senseable City Lab Overview
November 2024
|
Conference Video
2024 MIT R&D Conference: Panel Discussion: MTL's Impact in the Next 40 Years
November 2024
|
Conference Video
2024 MIT R&D Conference: Panel Discussion: The Making of MTL 2
November 2024
|
Conference Video
2024 MIT R&D Conference: Semiconductors, Microsystems & Workforce Development for the Tech Revln
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - 2Pi
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - Advanced Silicon Group
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - Concerto Biosciences
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - Delineate
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - Introduction
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - Jaxon
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - Lunar Station Corporation
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - Mobi Systems
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - qBraid
November 2024
|
Conference Video
2024 MIT R&D Conference: Startup Exchange Lightning Talks - Wellsite Navigator
November 2024
|
Conference Video
2024 MIT R&D Conference: The Next Generation of MTL Leaders & Innovators - Part 1
November 2024
|
Conference Video
2024 MIT R&D Conference: The Next Generation of MTL Leaders & Innovators - Part 2
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 1 - Space - Whither the Space Enterprise - A View from the Lens of Technology and Policy
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 1 - Space - Automating the Identification of Chemical Mixture Components with Machine Learning
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 1 - Space - Earth to Orbit: An Update on the Global Launch Industry
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 1 - Space - Space Security Issues in Space, Traffic Management and Space Sustainability
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 2 - Mobility - Is EV Stalling US vs China Competition
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 2 - Mobility - Optimization of Electric Vehicle Charging Stations
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 2 - Mobility - Ten Key Trends in Surface Mobility 2024
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 3 - Innovations - Designing the X Transformational Powers of Design
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 3 - Innovations - Enabling Innovation In Industry and Academia Through Digital Transformation
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 3 - Innovations - Leveraging AI to Build a Culture of Innovation
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 3 - Innovations - Sourcing Innovation: Applications to AI
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 4 - Healthcare - Bioelectronics for Brain & Body
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 4 - Healthcare - Machine-Learning-Guided Quality Control of CAR-T Therapy Product Using Microfluidic Biophysical Cytometry
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 4 - Healthcare - Neural Computation Underlying Behavior
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 4 - Healthcare - Waves, Bits, & Molecules Lab at MIT
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 5 - AI - Analog Brain Inspired Computing
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 5 - AI - Is AI Ready to Transform Chemistry & Materials Science
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 5 - AI - The Road to Digital Twins in Semiconductor Manufacturing
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 6 - Quantum 2.0 - Compiling Machine Intelligence onto (Quantum) Optoelectronic Systems
November 2024
|
Conference Video
2024 MIT R&D Conference: Track 6 - Quantum 2.0 - Quantum Computing
November 2024
|
Conference Video
2024 MIT R&D Conference: Welcome & Introduction