All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
Faster LLMs: Accelerate Inference with Speculative Decoding
9 months ago
ibm.com
How to Quadruple LLM Decoding Performance with Speculative Dec
…
Aug 1, 2024
qualcomm.com
0:18
Introducing LM Studio 0.3.10 with 🔮 Speculative Decoding!It's an LLM i
…
10 views
Feb 19, 2025
linkedin.com
Speculative Decoding — Think Fast⚡, Then Think Right✅
10 months ago
substack.com
6:18
What is Speculative Sampling? | Boosting LLM inference speed
3.8K views
Nov 20, 2024
YouTube
AssemblyAI
14:37
Understanding Speculative Decoding: Boosting LLM Efficienc
…
374 views
11 months ago
YouTube
MLWorks
8:44
How to PROPERLY Use Speculative Decoding in LM Studio to DOUBL
…
2 views
3 weeks ago
YouTube
AsapGuide
11:34
Generate 10 Tokens At Once - Faster LLM INFERENCE - AdaSPE
…
464 views
4 months ago
YouTube
Vuk Rosić
29:48
Lossless LLM inference acceleration with Speculators
478 views
3 months ago
YouTube
Red Hat
0:46
Speculative Decoding Turbocharge Your LLM Inference! #ai, #llm, #inf
…
25 views
1 month ago
YouTube
The Code Architect
7:40
Speculative Decoding: 3× Faster LLM Inference with Zero Quality L
…
271 views
2 months ago
YouTube
Tales Of Tensors
2:30
SpecView: An Interactive Visualization System for Speculati
…
1 views
6 days ago
YouTube
nguyenlab
7:39
[LLM 原理] 高效推理 Speculative Decoding 投机探测采样
4K views
8 months ago
bilibili
五道口纳什
54:05
LLMs | Efficient LLM Decoding-I | Lec15.1
2.3K views
Oct 4, 2024
YouTube
LCS2
22:36
MASSIVELY speed up local AI models with Speculative Decodin
…
19.6K views
Mar 5, 2025
YouTube
GosuCoder
2:27:59
COLING 2025 Tutorial: Speculative Decoding for Efficient LLM Inference
390 views
Jan 23, 2025
bilibili
云安Ann
7:06
The Secret to Faster LLMs: How Speculative Decoding Works
7 views
2 months ago
YouTube
Zaharah
7:00
Speculative Decoding with OpenVINO | Intel Software
196.9K views
7 months ago
YouTube
Intel Software
1:06
This Trick Makes LLMs 2X Faster
504 views
1 week ago
YouTube
OpenCV University
24:17
Fast Inference from Transformers via Speculative Decoding
1.2K views
Sep 12, 2023
YouTube
Arxiv Papers
9:39
Faster LLMs: Accelerate Inference with Speculative Decoding
20.9K views
9 months ago
YouTube
IBM Technology
1:08:32
LLM推理加速新范式!推测解码(Speculative Decoding)最新综述
3.2K views
Mar 2, 2024
bilibili
NICE学术
5:56
NVIDIA TiDAR: 5.9x Faster LLM Inference! Diffusion Speed, AR Qu
…
225 views
3 months ago
YouTube
PaperLens
37:34
Speculative Decoding Explained
7.7K views
Dec 21, 2023
YouTube
Trelis Research
17:56
Behind the Stack, Ep 11 - Speculative Decoding
63 views
4 months ago
YouTube
Doubleword
12:46
Speculative Decoding: When Two LLMs are Faster than One
26.1K views
Oct 12, 2023
YouTube
Efficient NLP
12:42
Fast Inference from Transformers via Speculative Decoding
134 views
Nov 5, 2024
YouTube
AI Papers Podcast Daily
3:42
AdaSPEC: Selective KD for Faster LLM Spec Decoding
6 views
2 months ago
YouTube
AI Research Roundup
0:36
How AI Replies So Fast! ⚡ Speculative Decoding
130 views
2 months ago
YouTube
Mr. Doubty – Short. Smart. Techy
6:53
How Speculative Decoding Makes LLMs 2.5x Faster (The Secret to F
…
121 views
5 months ago
YouTube
FranksWorld of AI
See more videos
More like this
Feedback