Please follow the steps to complete your e-attendance
- Click this link
- Get your Student ID
- Class ID: 34764
- User Guide
Primary Requirements
- Some programming experience (e.g. C, C++, Java, QBasic (!) etc. )
- At least high school level math skills will be required.
- Passion to learn
IDE Requirements
- Most popular IDE for Data Science is Anaconda. You can download and install from here. Make sure your download Python 3.7 distribution.
F.A.Q
» I don’t have the admin permission to install any software (Don’t worry !)
- Google Colab [if you already have Google Account ]
Module 1: Python – A Quick Review
In this module, you will get a quick review on Python Language. We will not going in depth but we will try to discuss some important components of Python Language. Please note, this is not meant to be a comprehensive overview of Python or programming in general
Hands-on : Environment Setup and Jupyter Notebook Intro.
Hands-on : Python Code Along
Hands-on : Python Review Exercise
Module 2: Python for Data Analysis ( Pandas )
Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.
Hands-on : Using Python Pandas Library
Module 3: Data Visualization/EDA/Data Analysis ( Seaborn)
In this part of the course we will discuss methods of descriptive statistics. You will learn what cases and variables are and how you can compute measures of central tendency (mean, median and mode) and dispersion (standard deviation and variance). Next, we discuss how to assess relationships between variables, and we introduce the concepts correlation and regression.
Hands-on : Using Python Seaborn Visualization Library
Module 4: Data Analytics / Machine Learning
In this part of the course we will discuss one of the best known Machine Learning Library Scikit-Learn, a package that provides efficient versions of a large number of common algorithms. Scikit-Learn is characterized by a clean, uniform, and streamlined API, as well as by very useful and complete online documentation. A benefit of this uniformity is that once you understand the basic use and syntax of Scikit-Learn for one type of model, switching to a new model or algorithm is very straightforward.
Hands-on : Using Python scikit-learn Library
Presentation File:
Azure ML Cheat Sheet
Algorithm Summary
Source: http://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Related Materials:
- Data Concept
- To know more about Data Concept you can click [this] link.
- ML Performance Metrics:
- AzureML End-to-End Lecture Series
Regression Performance Matrix
Classification Performance Matrix
Open Jupyter Notebook
- Please follow this link to know about how to open Jupyter Notebook from your Local Machine (in your specified directory)
- For a complete User Manual check out the Bryn Mawr College Computer Science Guide.
- Google Colab Notebook (Python Intro)
- Exercise Zip
- Click here (redirect to GDrive)
- Exercise Solution
Code Along for Python Pandas (Goolge Colab link)
Link: https://colab.research.google.com/drive/17WecyD7c79Go950fpo2T33KmghetQnsn?usp=sharing
Dataset: Airplane_Crashes_and_Fatalities_Since_1908.csv
Exercise: 1 for Pandas
- Download .ipynb file and dataset from here [1. Pandas-Exercises]( It’s a zip file . You need to Unzip and use)
- Solution Colab Link is here
Exercise: 2 for Pandas
- Download .ipynb file and data from here [2. Ecommerce Purchases] ( It’s a zip file . You need to Unzip and use)
- Solution Colab Link is here
Code Along for Python Seaborn (Goolge Colab link)
Link: https://colab.research.google.com/drive/1G7yvnKqoUgmaZxsIR4CcUK_3aroX9lFT?usp=sharing
Exercise: 1 for Seaborn
- Download .ipynb file from here [Seaborn Exercises] ( It’s a zip file . You need to Unzip and use)
- Solution Colab Link is here
Exercise: 2 (Capstone)
Code Along for Python Machine Learning – Sklearn (Goolge Colab link)
Link: https://colab.research.google.com/drive/1mUjlRy2eIskEKIpAz2MCg9ykgCcV_mvx?usp=sharing
Dataset:
Code Along Machine Learning End to End :
Link: https://colab.research.google.com/drive/1Ai5tJYi5mCKfovr82q4nXHooG2YcLmxt?usp=sharing
Dataset: WA_Fn-UseC_-Telco-Customer-Churn
Data Labeling:
Build object detection models using Detecto
Code Along: