Prompt Engineering for Vision ModelsProduct25/100 via “vision-task-decomposition-prompting”
A free DeepLearning.AI short course on how to prompt computer vision models with natural language, bounding boxes, segmentation masks, coordinate points, and other images.
Unique: Applies chain-of-thought and task decomposition patterns from language model reasoning to the vision domain, teaching how to structure visual analysis as a sequence of focused prompts rather than attempting to solve complex tasks in a single pass
vs others: Extends beyond single-prompt vision guidance by addressing the emerging pattern of vision-based agents and workflows, providing patterns for orchestrating multiple vision model calls to achieve complex analysis that would be difficult or impossible in a single prompt