Welcome to Enterprise Data Wrangling with SQL and Python. This course will cover data wrangling principles and techniques for business. Key topics include data extraction, profiling, cleansing, integration, transformation, and automating data processes for business purposes. In the course, you will apply principles and techniques using data transformation tools, programming languages, and data process automation tools. The course offers you an opportunity to learn how to embed appropriate communication mechanisms for collaboration to identify and resolve real-world data challenges revealed in datasets and business processes, creating business value in today’s disparate computing and dynamic business environment.
In this module, you will learn about the structure of relational databases and how to use SQL queries for information retrieval, focusing on single-row and group functions. In the next module, you will build upon this foundation by exploring data manipulation and data joining techniques.
What's included
6 videos17 readings1 assignment
Show info about module content
6 videos•Total 17 minutes
Meet Your Faculty•2 minutes
Relational Database Data Structure•3 minutes
Unique Values •3 minutes
Constraints•5 minutes
Working with Text Values•2 minutes
TIMESTAMP •3 minutes
17 readings•Total 85 minutes
Course Introduction•2 minutes
Syllabus - Enterprise Data Wrangling with SQL and Python•10 minutes
Academic Integrity•1 minute
Data Wrangling Key Questions •2 minutes
Data Wrangling Steps•5 minutes
Relational Databases•2 minutes
Key SQL Concepts•10 minutes
Overview of Predefined Functions•2 minutes
Single-Row vs. Group Functions•10 minutes
Overview of Single-Row Functions•1 minute
Common Examples of Single-Row Functions•5 minutes
Uses of Single-Row Functions•4 minutes
Single-Row Function Example•10 minutes
Overview of Group Functions•1 minute
Common Examples of Group Functions•4 minutes
Uses of Group Functions•6 minutes
Group Function Example•10 minutes
1 assignment•Total 45 minutes
Assess Your Learning: Introduction to Data Wrangling and Relational Databases•45 minutes
Managing and Retrieving Data with SQL
Module 2•3 hours to complete
Module details
The module also highlights how effective data manipulation and joining contribute to the broader goals of data wrangling and preparation, ensuring that data is both well-organized and ready for analysis.
What's included
22 readings1 assignment
Show info about module content
22 readings•Total 146 minutes
Data Manipulation Overview•2 minutes
DML Core Operations•10 minutes
INSERT•5 minutes
Update•10 minutes
Delete•8 minutes
TCL Core Operations•1 minute
Commit•3 minutes
Rollback•5 minutes
Summary Table•10 minutes
Data Joining Overview•2 minutes
Summary Table with Notes•10 minutes
Data Joining Types Overview•10 minutes
Inner Join Overview•4 minutes
Inner Join Example•10 minutes
Join Overview•4 minutes
Left Join Example•10 minutes
Right Join Overview•4 minutes
Right Join Example•10 minutes
Full Outer Join Overview•4 minutes
Full Outer Join Example•10 minutes
Cross Join Overview•4 minutes
Cross Join Example•10 minutes
1 assignment•Total 45 minutes
Assess Your Learning: Managing and Retrieving Data with SQL•45 minutes
Data Profiling and Discovery
Module 3•1 hour to complete
Module details
In this module, you will learn how to explore datasets using Python. You’ll practice techniques to inspect dataset structure (rows, columns, and data types), and detect missing, invalid, or inconsistent data. You will also learn how to generate descriptive statistics and distribution summaries, as well as interpret profiling results to guide data cleansing and improve overall data quality.
What's included
2 videos13 readings1 assignment
Show info about module content
2 videos•Total 6 minutes
Data Profiling•3 minutes
Data Profiling Example•3 minutes
13 readings•Total 61 minutes
Data Profiling Overview•10 minutes
Discovering Data Structure Overview•2 minutes
Rows and Columns•7 minutes
Data Type•7 minutes
Non-Null Entries•7 minutes
Discovering Data Content Overview•3 minutes
Summary Statistics•4 minutes
Descriptive Statistics•4 minutes
Frequency Distribution•4 minutes
Missing Values•3 minutes
Duplicate Data•3 minutes
Incorrect or Ambiguous Data•4 minutes
Data Profiling•3 minutes
1 assignment•Total 15 minutes
Assess Your Learning: Data Profiling and Discovery•15 minutes
Data Cleansing
Module 4•1 hour to complete
Module details
By the end of this module, you will be able to apply practical data cleansing techniques to improve data quality and make your analysis more accurate and trustworthy.
Founded in 1898, Northeastern is a global research university with a distinctive, experience-driven approach to education and discovery. The university is a leader in experiential learning, powered by the world’s most far-reaching cooperative education program. The spirit of collaboration guides a use-inspired research enterprise focused on solving global challenges in health, security, and sustainability.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I purchase the Certificate?
When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.