Intro to Fine-Tuning on Modal: SFT, RL, and more
Modal
Intro to Fine-Tuning on Modal: SFT, RL, and more
57:05
Scaling Reinforcement Learning on Modal
Modal
Scaling Reinforcement Learning on Modal
34:32
Building AI agents from scratch using the OpenAI Agent SDK and Modal.
Modal
Building AI agents from scratch using the OpenAI Agent SDK and Modal.
55:16
Truly Serverless GPUs: A Deep Dive Inside Modal's Fast Cold Starts
Modal
Truly Serverless GPUs: A Deep Dive Inside Modal's Fast Cold Starts
1:17:20
Modal | Unstick your AI
Modal
Modal | Unstick your AI
1:16
Speculation is all you need: Intro to Speculative Decoding for High Performance Inference
Modal
Speculation is all you need: Intro to Speculative Decoding for High Performance Inference
40:19
Inside Modal Sandboxes: How Agents Code at Scale
Modal
Inside Modal Sandboxes: How Agents Code at Scale
54:28
How Ramp built a background coding agent that writes over half of its pull requests
Modal
How Ramp built a background coding agent that writes over half of its pull requests
21:00
High Performance LLM Inference in Production
Modal
High Performance LLM Inference in Production
1:09:32
Introducing: Modal Notebooks
Modal
Introducing: Modal Notebooks
1:19
GPU has fallen off the bus
Modal
GPU has fallen off the bus
0:49
Stockholm tech scene on fire 🔥 Modal x Lovable
Modal
Stockholm tech scene on fire 🔥 Modal x Lovable
0:25
⚡Blazing Fast LLaMA 3: Crush Latency with TensorRT LLM
Modal
⚡Blazing Fast LLaMA 3: Crush Latency with TensorRT LLM
6:51
Your Self-Hosted Chatbot Just Went Viral—Can It Handle the Traffic?
Modal
Your Self-Hosted Chatbot Just Went Viral—Can It Handle the Traffic?
6:06
Deploy DeepSeek R1 with Modal’s High-Performance AI Infrastructure
Modal
Deploy DeepSeek R1 with Modal’s High-Performance AI Infrastructure
4:15
Getting started with Modal
Modal
Getting started with Modal
11:18
How to run code on a GPU in less than 10 lines of code
Modal
How to run code on a GPU in less than 10 lines of code
4:31
Productionizing diffusion models with Modal: QArt Codes deep dive
Modal
Productionizing diffusion models with Modal: QArt Codes deep dive
53:59
Making GPUs go brrr on Modal
Modal
Making GPUs go brrr on Modal
55:12
MLOps on Modal
Modal
MLOps on Modal
36:22
Building a Stable Diffusion + LoRA image generation pipeline on Modal
Modal
Building a Stable Diffusion + LoRA image generation pipeline on Modal
13:17
Full stack web applications in pure Python with Modal & FastHTML
Modal
Full stack web applications in pure Python with Modal & FastHTML
43:55
Cloud Native Development on Modal
Modal
Cloud Native Development on Modal
38:42
Building End to End ML Applications on Modal
Modal
Building End to End ML Applications on Modal
51:09
Running a High Throughput OpenAI-Compatible vLLM Inference Server on Modal
Modal
Running a High Throughput OpenAI-Compatible vLLM Inference Server on Modal
44:31
Modal Theme Song
Modal
Modal Theme Song
0:22