Google Cloud
AI Infrastructure: Networking Techniques

Enjoy unlimited growth with a year of Coursera Plus for $199 (regularly $399). Save now.

Google Cloud

AI Infrastructure: Networking Techniques

Included with Coursera Plus

Gain insight into a topic and learn the fundamentals.
Intermediate level
Some related experience required
3 hours to complete
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level
Some related experience required
3 hours to complete
Flexible schedule
Learn at your own pace

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 6 modules in this course

This module offers an overview of the course and outlines the learning objectives.

What's included

1 plugin

This module details the specialized networking requirements for AI workloads compared to traditional web applications. It covers the specific bandwidth and latency demands of each pipeline stage—from ingestion to inference—and analyzes the "rail-aligned" network architectures of Google Cloud's A3 and A4 GPU machine types designed to maximize "Goodput."

What's included

1 assignment3 plugins

This module details strategies for efficiently moving massive datasets into the cloud. It covers the use of the Cross-Cloud Network and Cloud Interconnect to establish high-bandwidth pipelines, and outlines configuration best practices—such as enabling Jumbo Frames (MTU)—to reduce protocol overhead and optimize throughput.

What's included

1 assignment2 plugins

This module details the critical role of low-latency networking in distributed model training. It covers the necessity of Remote Direct Memory Access (RDMA) for gradient synchronization, the benefits of Google's Titanium offload architecture in freeing up CPU resources, and the topology choices required to scale clusters without bottlenecks.

What's included

1 assignment3 plugins

This module details the networking challenges specific to Generative AI inference, such as bursty traffic and long-lived connections. It covers optimizing Time-to-First-Token using the GKE Inference Gateway and "Queue Depth" routing, while also addressing best practices for network reliability and Identity and Access Management (IAM).

What's included

1 assignment5 plugins

Student PDF links to all modules

What's included

1 reading

Instructor

Google Cloud Training
Google Cloud
2,020 Courses3,779,342 learners

Offered by

Google Cloud

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions