AI Engineering with Modal

A hands-on guide to building and deploying scalable AI systems with Modal.
Udemy
platform
English
language
Other
category
instructor
AI Engineering with Modal
53
students
5.5 hours
content
Jul 2025
last update
$19.99
regular price

What you will learn

Build and deploy scalable AI infrastructure by defining custom container images, specifying GPU/CPU resources, and managing persistent data with Modal Volumes.

Develop a complete Automatic Speech Recognition (ASR) pipeline to transcribe long audio files in parallel using GPU-accelerated models on Modal.

Fine-tune a transformer encoder model for a text classification task on a custom dataset, leveraging Modal for GPU-powered training and experiment management.

Deploy trained machine learning models as scalable, live web APIs using Modal's built-in FastAPI endpoints for real-time inference.

Launch a high-throughput, OpenAI-compatible API for Large Language Model (LLM) inference using vLLM on Modal.

Build a High-Throughput Image Processing Pipeline and Build an Image Similarity Search Endpoint

Course Gallery

AI Engineering with Modal – Screenshot 1
Screenshot 1AI Engineering with Modal
AI Engineering with Modal – Screenshot 2
Screenshot 2AI Engineering with Modal
AI Engineering with Modal – Screenshot 3
Screenshot 3AI Engineering with Modal
AI Engineering with Modal – Screenshot 4
Screenshot 4AI Engineering with Modal

Loading charts...

6620225
udemy ID
17/05/2025
course created date
24/07/2025
course indexed date
Bot
course submited by