AI Engineering with Modal
A hands-on guide to building and deploying scalable AI systems with Modal.

53
students
5.5 hours
content
Jul 2025
last update
$19.99
regular price
What you will learn
Build and deploy scalable AI infrastructure by defining custom container images, specifying GPU/CPU resources, and managing persistent data with Modal Volumes.
Develop a complete Automatic Speech Recognition (ASR) pipeline to transcribe long audio files in parallel using GPU-accelerated models on Modal.
Fine-tune a transformer encoder model for a text classification task on a custom dataset, leveraging Modal for GPU-powered training and experiment management.
Deploy trained machine learning models as scalable, live web APIs using Modal's built-in FastAPI endpoints for real-time inference.
Launch a high-throughput, OpenAI-compatible API for Large Language Model (LLM) inference using vLLM on Modal.
Build a High-Throughput Image Processing Pipeline and Build an Image Similarity Search Endpoint
Course Gallery




Loading charts...
6620225
udemy ID
17/05/2025
course created date
24/07/2025
course indexed date
Bot
course submited by