Data Science Linear Regression In Python

Data science, machine learning, and artificial intelligence in Python for students and professionals

Last updated 2022-01-10 | 4.7

- Derive and solve a linear regression model
- and apply it appropriately to data science problems
- Program your own version of a linear regression model in Python

What you'll learn

Derive and solve a linear regression model
and apply it appropriately to data science problems
Program your own version of a linear regression model in Python

* Requirements

* How to take a derivative using calculus
* Basic Python programming
* For the advanced section of the course
* you will need to know probability

Description

This course teaches you about one popular technique used in machine learning, data science and statistics: linear regression. We cover the theory from the ground up: derivation of the solution, and applications to real-world problems. We show you how one might code their own linear regression module in Python.

Linear regression is the simplest machine learning model you can learn, yet there is so much depth that you'll be returning to it for years to come. That's why it's a great introductory course if you're interested in taking your first steps in the fields of:

  • deep learning

  • machine learning

  • data science

  • statistics

In the first section, I will show you how to use 1-D linear regression to prove that Moore's Law is true.

What's that you say? Moore's Law is not linear?

You are correct! I will show you how linear regression can still be applied.

In the next section, we will extend 1-D linear regression to any-dimensional linear regression - in other words, how to create a machine learning model that can learn from multiple inputs.

We will apply multi-dimensional linear regression to predicting a patient's systolic blood pressure given their age and weight.

Finally, we will discuss some practical machine learning issues that you want to be mindful of when you perform data analysis, such as generalization, overfitting, train-test splits, and so on.

This course does not require any external materials. Everything needed (Python, and some Python libraries) can be obtained for FREE.

If you are a programmer and you want to enhance your coding abilities by learning about data science, then this course is for you. If you have a technical or mathematical background, and you want to know how to apply your skills as a software engineer or "hacker", this course may be useful.

This course focuses on "how to build and understand", not just "how to use". Anyone can learn to use an API in 15 minutes after reading some documentation. It's not about "remembering facts", it's about "seeing for yourself" via experimentation. It will teach you how to visualize what's happening in the model internally. If you want more than just a superficial look at machine learning models, this course is for you.

"If you can't implement it, you don't understand it"

  • Or as the great physicist Richard Feynman said: "What I cannot create, I do not understand".

  • My courses are the ONLY courses where you will learn how to implement machine learning algorithms from scratch

  • Other courses will teach you how to plug in your data into a library, but do you really need help with 3 lines of code?

  • After doing the same thing with 10 datasets, you realize you didn't learn 10 things. You learned 1 thing, and just repeated the same 3 lines of code 10 times...


Suggested Prerequisites:

  • calculus (taking derivatives)

  • matrix arithmetic

  • probability

  • Python coding: if/else, loops, lists, dicts, sets

  • Numpy coding: matrix and vector operations, loading a CSV file


WHAT ORDER SHOULD I TAKE YOUR COURSES IN?:

  • Check out the lecture "Machine Learning and AI Prerequisite Roadmap" (available in the FAQ of any of my courses, including the free Numpy course)

Who this course is for:

  • People who are interested in data science, machine learning, statistics and artificial intelligence
  • People new to data science who would like an easy introduction to the topic
  • People who wish to advance their career by getting into one of technology's trending fields, data science
  • Self-taught programmers who want to improve their computer science theoretical skills
  • Analytics experts who want to learn the theoretical basis behind one of statistics' most-used algorithms

Course content

9 sections • 54 lectures

Introduction and Outline Preview 07:41

How to Succeed in this Course Preview 05:51

Statistics vs. Machine Learning Preview 09:58

What is machine learning? How does linear regression play a role? Preview 05:13

We will discuss a broad outline of what machine learning is, and how linear regression fits into the ecosystem of machine learning. We will discuss some examples of linear regression to give you a feel for what it can be used for.

What can linear regression be used for?

Define the model in 1-D, derive the solution (Updated Version) Preview 12:43

Define the model in 1-D, derive the solution Preview 14:52

Coding the 1-D solution in Python Preview 07:38

Exercise: Theory vs. Code Preview 01:19

Determine how good the model is - r-squared Preview 05:50

R-squared in code Preview 02:15

Introduction to Moore's Law Problem Preview 02:30

Demonstrating Moore's Law in Code Preview 08:00

Moore's Law Derivation Preview 06:02

R-squared Quiz 1 Preview 01:48

Suggestion Box Preview 03:03

Define the multi-dimensional problem and derive the solution (Updated Version) Preview 09:34

Define the multi-dimensional problem and derive the solution Preview 17:07

How to solve multiple linear regression using only matrices Preview 01:55

Coding the multi-dimensional solution in Python Preview 07:29

Polynomial regression - extending linear regression (with Python code) Preview 07:56

Predicting Systolic Blood Pressure from Age and Weight Preview 05:45

R-squared Quiz 2 Preview 02:05

What do all these letters mean? Preview 06:23

Interpreting the Weights Preview 04:00

Generalization error, train and test sets Preview 02:49

Generalization and Overfitting Demonstration in Code Preview 07:32

Categorical inputs Preview 05:21

One-Hot Encoding Quiz Preview 02:07

Probabilistic Interpretation of Squared Error Preview 05:15

L2 Regularization - Theory Preview 04:21

L2 Regularization - Code Preview 04:13

The Dummy Variable Trap Preview 03:58

Gradient Descent Tutorial Preview 04:30

Gradient Descent for Linear Regression Preview 02:13

Bypass the Dummy Variable Trap with Gradient Descent Preview 04:17

L1 Regularization - Theory Preview 03:05

L1 Regularization - Code Preview 04:25

L1 vs L2 Regularization Preview 03:05

Why Divide by Square Root of D? Preview 06:32

Brief overview of advanced linear regression and machine learning topics Preview 05:14

Exercises, practice, and how to get good at this Preview 03:54

Anaconda Environment Setup Preview 20:20

How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow Preview 17:32

How to Code by Yourself (part 1) Preview 15:54

How to Code by Yourself (part 2) Preview 09:23

Proof that using Jupyter Notebook is the same as not using it Preview 12:29

Python 2 vs Python 3 Preview 04:38

How to Succeed in this Course (Long Version) Preview 10:24

Is this for Beginners or Experts? Academic or Practical? Fast or slow-paced? Preview 22:04

Machine Learning and AI Prerequisite Roadmap (pt 1) Preview 11:18

Machine Learning and AI Prerequisite Roadmap (pt 2) Preview 16:07