What will I get if I subscribe to this Certificate?

When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Fine-Tuning and Evaluating Vision AI Models

This course is part of Eyes on AI - Computer Vision Engineering Professional Certificate

Instructor: Professionals from the Industry

Included with Learn more

Ask Coursera

13 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

13 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Apply transfer learning and learning-rate analysis to improve computer vision model accuracy
Evaluate model calibration, object detection metrics, and dataset annotation quality
Diagnose segmentation errors and refine model outputs using post-processing techniques

Skills you'll gain

Tools you'll learn

Model Deployment

Details to know

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your Machine Learning expertise

This course is part of the Eyes on AI - Computer Vision Engineering Professional Certificate

When you enroll in this course, you'll also be enrolled in this Professional Certificate.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate from Coursera

There are 13 modules in this course

Building high-performing computer vision systems requires more than training a model—it requires careful evaluation, reliable predictions, and continuous refinement. In this course, you'll learn how to fine-tune and evaluate computer vision models used in real-world AI systems.

You'll begin by applying transfer learning techniques to improve model accuracy on domain-specific datasets and analyzing learning-rate schedules to understand training behavior. Next, you'll evaluate the calibration of classification models and apply post-hoc correction methods to improve prediction reliability. The course also explores data preparation and annotation practices for object detection. You'll analyze object-size distributions to configure anchor boxes and evaluate detector performance using standard metrics. Finally, you'll examine image segmentation models. You'll learn how to address class imbalance, analyze segmentation errors, and apply post-processing techniques to improve prediction quality. By the end of the course, you'll be able to evaluate, diagnose, and refine computer vision models across classification, detection, and segmentation tasks.

You’ll learn how to adapt a pre-trained ViT-B/16 model to a new domain using transfer learning. You’ll practice freezing and selectively unfreezing layers, explore how the model’s internal representations shift during fine-tuning, and document your choices in an experiment log. By the end, you’ll know how to unfreeze the final four transformer blocks, prepare your dataset effectively, and run a clean, reproducible training workflow that aligns with industry practice.

What's included

3 videos1 reading1 assignment

3 videosTotal 9 minutes

Introduction and Welcome3 minutes
Why Transfer Learning Accelerates Vision Training2 minutes
Walkthrough: Unfreezing the Final Four Transformer Blocks in Keras4 minutes

1 readingTotal 10 minutes

How ViT-B/16 Learns Features and Why Layer Unfreezing Matters10 minutes

1 assignmentTotal 15 minutes

Hands-On Activity: Fine-Tune ViT-B/16 for Retail Images and Log Experiment Decisions15 minutes

You’ll explore how learning-rate schedules shape the trajectory of model training. You’ll compare cosine decay and the one-cycle policy, analyze their signatures in training curves, and choose the schedule that maximizes validation accuracy while reducing training time. By the end, you’ll be able to interpret LR curves, diagnose plateaus or instability, and make informed decisions about training efficiency.

What's included

2 videos1 reading2 assignments

2 videosTotal 3 minutes

Why Learning-Rate Schedules Shape Convergence2 minutes
Visualizing LR Schedules & Training Curves in Keras2 minutes

1 readingTotal 10 minutes

Cosine versus One-Cycle Policies and Their Influence on Training10 minutes

2 assignmentsTotal 35 minutes

Graded Quiz: Optimize AI: Fine-Tune & Maximize Accuracy20 minutes
Hands-On Activity: Compare LR Schedules & Choose One That Improves Training Time15 minutes

You’ll assess how well a model’s predicted probabilities match real outcomes using ECE and reliability diagrams. By the end, you’ll compute calibration metrics, diagnose over/under-confidence, and apply temperature scaling to improve trust in predictions.

What's included

3 videos1 reading1 assignment

3 videosTotal 9 minutes

Introduction and Welcome3 minutes
Understanding Calibration: Metrics and Diagnostics4 minutes
Improving Calibration: Temperature Scaling in Practice3 minutes

1 readingTotal 10 minutes

How to Measure and Interpret Model Calibration10 minutes

1 assignmentTotal 15 minutes

Hands-On Activity: Calibrate a Classification Model Using ECE and Temperature Scaling15 minutes

You’ll design a serverless batch-inference workflow using AWS S3, Lambda, and DynamoDB. By the end, you will configure an end-to-end pipeline that runs a calibrated model, processes batch files, and stores predictions for analytics.

What's included

2 videos1 reading2 assignments

2 videosTotal 8 minutes

Why Serverless Pipelines Matter for Scalable AI4 minutes
Common Pitfalls in Deploying ML Pipelines4 minutes

1 readingTotal 10 minutes

Designing Batch-Inference Workflows with AWS Lambda10 minutes

2 assignmentsTotal 35 minutes

Graded Quiz: Assess Financial Deals & Manage Risk20 minutes
Hands-On Activity: Deploy a Calibrated Batch-Inference Pipeline with AWS Lambda15 minutes

You will walk through how annotation teams plan tasks, define rules, coach annotators, and measure dataset quality. You will practice reviewing examples, identifying inconsistencies, and applying a structured audit that produces a production-ready bounding-box dataset.

What's included

3 videos2 readings1 assignment

3 videosTotal 15 minutes

Why Quality Annotation Shapes Model Accuracy5 minutes
Quality-Controlled Annotation: Rules and Edge Cases5 minutes
How Teams Run a CVAT Labeling Sprint5 minutes

2 readingsTotal 20 minutes

Avoiding Common Bounding-Box Errors10 minutes
IoU Audits and Reviewer Checklists10 minutes

1 assignmentTotal 20 minutes

Hands-On Activity: Audit and Correct 20 Bounding Boxes in a Mini Sprint20 minutes

You will examine how bounding-box dimensions reveal object scales in a dataset. You will run clustering to generate three anchor sets and understand how these values shape model training and performance.

What's included

3 videos2 readings2 assignments

3 videosTotal 15 minutes

Why Anchor Boxes Matter for Detection4 minutes
Understanding Box Dimensions and Object Scale6 minutes
Generate and Insert Anchors into YOLOv5 Config5 minutes

2 readingsTotal 20 minutes

k-Means Clustering for Bounding-Box Dimensions10 minutes
Visualizing Anchor Fit and Diagnosing Mismatch10 minutes

2 assignmentsTotal 35 minutes

Graded Quiz: Bounding-Box Quality and Anchor Selection Check20 minutes
Hands-On Activity: Run k-Means and Propose Three Anchors15 minutes

You will explore why evaluation metrics matter, what mAP represents, and how metric breakdowns guide improvement decisions. You will connect evaluation to real deployment KPIs, such as accuracy targets and latency constraints.

What's included

3 videos2 readings1 assignment

3 videosTotal 8 minutes

Introduction and Welcome3 minutes
Why Evaluation Comes First in Real-Time Detection3 minutes
Interpreting mAP: What To Look For in Real Projects2 minutes

2 readingsTotal 20 minutes

Core Detection Metrics: mAP, APsmall, Precision, Recall10 minutes
Diagnosing Low AP on Small Objects10 minutes

1 assignmentTotal 20 minutes

Hands-On Activity: Compute mAP from Provided COCO-Format Predictions20 minutes

You will explore the components of real-time detection, including model selection, preprocessing, inference optimization, tracking, and system-level constraints. You will evaluate trade-offs such as accuracy vs. speed, batch size vs. latency, and resolution vs. FPS.

What's included

3 videos2 readings2 assignments

3 videosTotal 10 minutes

Choosing the Right Model for Real-Time Requirements3 minutes
Tracker Basics: DeepSORT, BYTETrack, OC-SORT2 minutes
Integrating YOLOv8 with DeepSORT in OpenCV5 minutes

2 readingsTotal 20 minutes

Where Latency Comes From: IO, Inference, NMS, and Tracking10 minutes
Benchmarking FPS and Latency on Embedded Device10 minutes

2 assignmentsTotal 40 minutes

Graded Quiz: Build & Evaluate Real-Time Object Detectors20 minutes
Hands-On Activity: Build a YOLOv8 + DeepSORT Pipeline Loop20 minutes

You will explore why class imbalance disrupts training and practice applying class-balancing strategies, including focal-dice hybrid loss, weighting, and sampling. You will work through a realistic low-foreground medical dataset scenario and monitor recall after 15 epochs.

What's included

3 videos1 reading1 assignment

You will quantify segmentation errors that arise in real deployments. Using skimage.measure, you will evaluate predicted masks and identify issues such as over-segmentation of elongated objects. You will write error logs that highlight recurring patterns.

What's included

2 videos1 reading2 assignments

You will learn how to evaluate segmentation results using metrics and visualizations. We explore IoU, Dice, class-wise breakdowns, and overlay inspections that reveal where and why your model struggles. You’ll practice generating and interpreting these outputs, just like teams diagnosing performance before deploying a model.

What's included

3 videos1 reading2 assignments

3 videosTotal 11 minutes

Welcome and Why Segmentation Evaluation Matters3 minutes
Understanding IoU, Dice, and Class-Wise Metrics4 minutes
Heat Maps in Action: Seeing Class Performance4 minutes

1 readingTotal 10 minutes

How to Read Segmentation Outputs Like a Practitioner10 minutes

2 assignmentsTotal 15 minutes

Hands-On Activity: Build Your First Class-Wise IoU Table and Heat Map10 minutes
Practice Quiz: Segmentation Metrics & Diagnostics5 minutes

You will design and test a lightweight refinement pipeline that improves segmentation quality. You will also explore CRFs, boundary smoothing, hole-filling, morphological filters, and noise cleanup. You will build a pipeline and measure before-and-after improvements.

What's included

3 videos1 reading3 assignments

3 videosTotal 13 minutes

Why Post-Processing Is a Key Part of CV Pipelines4 minutes
Smoothing, Filtering, and Boundary Refinement Techniques5 minutes
Building a Step-by-Step Refinement Workflow4 minutes

1 readingTotal 10 minutes

How CRFs Add Structure: A Simple Guide10 minutes

3 assignmentsTotal 50 minutes

Graded Quiz: Evaluate and Refine a Segmentation Model30 minutes
Hands-On Activity: Add a CRF Refiner and Measure Improvements15 minutes
Practice Quiz: Refinement & CRF Improvements5 minutes

Modern vision systems often combine multiple model components such as classification, object detection, and segmentation. Preparing these systems for production requires more than training individual models. Engineers must evaluate fine-tuning strategies, analyze model confidence behavior, assess detection performance against operational KPIs, and diagnose segmentation errors that may affect reliability. In this project, you will act as a computer vision engineer responsible for evaluating a multi-task vision system before deployment. You will analyze fine-tuning decisions, examine model calibration reliability, interpret detection metrics, diagnose segmentation weaknesses, and assess dataset quality before approving deployment readiness. The project integrates several core evaluation activities used in real-world vision engineering workflows. You will interpret training behavior to assess transfer learning strategies, analyze calibration metrics to improve prediction reliability, evaluate detection performance using task-specific KPIs, and diagnose segmentation errors through metric analysis and qualitative inspection. Rather than optimizing a single component, the project requires you to assess the entire vision pipeline and recommend coordinated improvements across tasks. Your final deliverable will be a Vision Model Evaluation & Refinement Report, a structured technical analysis that identifies weaknesses, prioritizes corrective actions, and justifies engineering decisions across classification, detection, and segmentation modules. This project mirrors real-world responsibilities of computer vision engineers who must evaluate multiple model components simultaneously and communicate a clear production-readiness recommendation to engineering and product stakeholders.

What's included

2 readings1 assignment

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Professionals from the Industry

513 Courses123,760 learners

Offered by

Coursera

Explore more from Machine Learning

Status: Free Trial
Coursera
Vision Models: Train and Evaluate
Course
Status: Free Trial
Coursera
Optimize AI: Fine-Tune & Maximize Accuracy
Course
Status: Free Trial
Coursera
Optimizing and Deploying Computer Vision Models
Course
Status: Free Trial
Coursera
Deploy & Evaluate Vision Models Effectively
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Unlock access to 10,000+ courses with a subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 4,700 global companies that choose Coursera for Business

Frequently asked questions

This course is designed for learners with prior machine learning knowledge. Familiarity with neural networks and computer vision concepts will help you follow the evaluation and optimization techniques.

You'll learn how to fine-tune models, evaluate prediction reliability, analyze object detection performance, and diagnose segmentation errors to improve real-world vision systems.

The course focuses on evaluation and refinement tasks commonly performed by machine learning engineers, including model calibration, performance analysis, dataset quality checks, and system-level optimization.

To access course materials, assignments, and earn a Certificate, you'll need to purchase the Certificate experience when you enroll in a course. Eligible learners may also have the option to start with a Free Trial. Some courses may also offer a Full Course, No Certificate option. This lets you access course materials, submit required assessments, and receive a final grade, but you won't be able to earn or purchase a Certificate.

Fine-Tuning and Evaluating Vision AI Models

Fine-Tuning and Evaluating Vision AI Models

What you'll learn

Skills you'll gain

Tools you'll learn

Details to know

See how employees at top companies are mastering in-demand skills

Build your Machine Learning expertise

There are 13 modules in this course

Optimize AI: Fine-Tune & Maximize Accuracy: Fine-Tuning ViT-B/16 with Transfer Learning for Domain-Specific Datasets

What's included

Optimize AI: Fine-Tune & Maximize Accuracy: Optimizing Training with Cosine and One-Cycle Learning-Rate Schedules

What's included

Calibrate and Serve Confident AI Predictions: Evaluate and Improve Model Calibration

What's included

Calibrate and Serve Confident AI Predictions: Build and Deploy a Serverless Batch-Inference Pipeline

What's included

Annotate and Analyze Objects for Vision: Build a Clean Dataset: Quality-Controlled Bounding-Box Annotation

What's included

Annotate and Analyze Objects for Vision: Tune Detection Models: Anchor Boxes from Object-Size Clustering

What's included

Build & Evaluate Real-Time Object Detectors: Understanding Object Detection Metrics and KPIs

What's included

Build & Evaluate Real-Time Object Detectors: Designing and Integrating a Real-Time Detection Pipeline

What's included

Balance and Analyze Image Segmentation: Balancing Segmentation Data for Stable Model Training

What's included

Balance and Analyze Image Segmentation: Detecting Systematic Errors in Segmentation Masks

What's included

Refine Segmentation: Boost Your AI Vision: Measure What Matters: Evaluating Segmentation Quality

What's included

Refine Segmentation: Boost Your AI Vision: Refine and Improve: Building a Post-Processing Pipeline

What's included

Project: Vision Model Evaluation & Refinement Report

What's included

Earn a career certificate

Instructor

Offered by

Explore more from Machine Learning

Vision Models: Train and Evaluate

Optimize AI: Fine-Tune & Maximize Accuracy

Optimizing and Deploying Computer Vision Models

Deploy & Evaluate Vision Models Effectively

Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.

Unlock access to 10,000+ courses with a subscription

Advance your career with an online degree

Join over 4,700 global companies that choose Coursera for Business

Frequently asked questions

Is Fine-Tuning and Evaluating Vision AI Models suitable for beginners?

What practical skills will I gain in Fine-Tuning and Evaluating Vision AI Models?

How does Fine-Tuning and Evaluating Vision AI Models relate to real AI engineering work?

More questions