uCertify

Python® for Data Science for Beginners

Use Python for Data Science – because basic is for spreadsheets!

(PYTHON-DS.AE1) / ISBN : 978-1-64459-462-9

Lessons

Lab

TestPrep

Get A Free Trial

This course includes:

Free pre-assessment and first 2 lessons

23+ Interactive Lessons | 40+ Exercises

Accessible on mobile and tablet too

Certificate of completion

Are you an instructor?

Access detailed information about the course content, learning objectives, activities, and assessments before adding it to your curriculum.

About This Course

This Python for data science training course is your map, starting from the ABCs of coding and guiding you through the thrilling experience of data analysis. You’ll encounter Google Colab and Jupyter Notebook along the way. In addition, harness the strength of Numpy and Pandas for data conditioning and visualization by enrolling in this Python for Data Science course. Along the way, you’ll decode complex numbers with machine learning (ML), unearth hidden patterns, and become a professional data scientist.

Skills You’ll Get

Gain proficiency in Python coding
Learn to install and use essential Python tools
Learn data handling and processing from various data sources
Clean and condition data to maintain accuracy and reliability
Visualize data with graphs and plots, turning raw numbers into compelling stories
Apply machine learning to identify patterns and trends in data
Become proficient in using Google Colab and Jupyter Notebooks to streamline your workflow
Perform exploratory data analysis (EDA) to picture data better
Optimize models for better performance and maximum impact

Interactive Lessons

23+ Interactive Lessons 40+ Exercises | 170+ Quizzes | 76+ Flashcards | 76+ Glossary of terms

Gamified TestPrep

Hands-On Labs

30+ LiveLab | 15+ Video tutorials | 24+ Minutes

Download Course Outline

Introduction

About This Course
False Assumptions
Icons Used in This Course
Where to Go from Here

Discovering the Match between Data Science and Python

Defining the Sexiest Job of the 21st Century
Creating the Data Science Pipeline
Understanding Python’s Role in Data Science
Learning to Use Python Fast

Introducing Python’s Capabilities and Wonders

Why Python?
Working with Python
Performing Rapid Prototyping and Experimentation
Considering Speed of Execution
Visualizing Power
Using the Python Ecosystem for Data Science

Setting Up Python for Data Science

Considering the Off-the-Shelf Cross-Platform Scientific Distributions
Installing Anaconda on Windows
Installing Anaconda on Linux
Installing Anaconda on Mac OS X
Downloading the Datasets and Example Code

Working with Google Colab

Defining Google Colab
Getting a Google Account
Working with Notebooks
Performing Common Tasks
Using Hardware Acceleration
Executing the Code
Viewing Your Notebook
Sharing Your Notebook
Getting Help

Understanding the Tools

Using the Jupyter Console
Using Jupyter Notebook
Performing Multimedia and Graphic Integration

Working with Real Data

Uploading, Streaming, and Sampling Data
Accessing Data in Structured Flat-File Form
Sending Data in Unstructured File Form
Managing Data from Relational Databases
Interacting with Data from NoSQL Databases
Accessing Data from the Web

Conditioning Your Data

Juggling between NumPy and pandas
Validating Your Data
Manipulating Categorical Variables
Dealing with Dates in Your Data
Dealing with Missing Data
Slicing and Dicing: Filtering and Selecting Data
Concatenating and Transforming
Aggregating Data at Any Level

Shaping Data

Working with HTML Pages
Working with Raw Text
Using the Bag of Words Model and Beyond
Working with Graph Data

Putting What You Know in Action

Contextualizing Problems and Data
Considering the Art of Feature Creation
Performing Operations on Arrays

Getting a Crash Course in MatPlotLib

Starting with a Graph
Setting the Axis, Ticks, Grids
Defining the Line Appearance
Using Labels, Annotations, and Legends

Visualizing the Data

Choosing the Right Graph
Creating Advanced Scatterplots
Plotting Time Series
Plotting Geographical Data
Visualizing Graphs

Stretching Python’s Capabilities

Playing with Scikit-learn
Performing the Hashing Trick
Considering Timing and Performance
Running in Parallel on Multiple Cores

Exploring Data Analysis

The EDA Approach
Defining Descriptive Statistics for Numeric Data
Counting for Categorical Data
Creating Applied Visualization for EDA
Understanding Correlation
Modifying Data Distributions

Reducing Dimensionality

Understanding SVD
Performing Factor Analysis and PCA
Understanding Some Applications

Clustering

Clustering with K-means
Performing Hierarchical Clustering
Discovering New Groups with DBScan

Detecting Outliers in Data

Considering Outlier Detection
Examining a Simple Univariate Method
Developing a Multivariate Approach

Exploring Four Simple and Effective Algorithms

Guessing the Number: Linear Regression
Moving to Logistic Regression
Making Things as Simple as Naïve Bayes
Learning Lazily with Nearest Neighbors

Performing Cross-Validation, Selection, and Optimization

Pondering the Problem of Fitting a Model
Cross-Validating
Selecting Variables Like a Pro
Pumping Up Your Hyperparameters

Increasing Complexity with Linear and Nonlinear Tricks

Using Nonlinear Transformations
Regularizing Linear Models
Fighting with Big Data Chunk by Chunk
Understanding Support Vector Machines
Playing with Neural Networks

Understanding the Power of the Many

Starting with a Plain Decision Tree
Making Machine Learning Accessible
Boosting Predictions

Ten Essential Data Resources

Discovering the News with Subreddit
Getting a Good Start with KDnuggets
Locating Free Learning Resources with Quora
Gaining Insights with Oracle’s Data Science Blog
Accessing the Huge List of Resources on Data Science Central
Learning New Tricks from the Aspirational Data Scientist
Obtaining the Most Authoritative Sources at Udacity
Receiving Help with Advanced Topics at Conductrics
Obtaining the Facts of Open Source Data Science from Masters
Zeroing In on Developer Resources with Jonathan Bower

Ten Data Challenges You Should Take

Meeting the Data Science London + Scikit-learn Challenge
Predicting Survival on the Titanic
Finding a Kaggle Competition that Suits Your Needs
Honing Your Overfit Strategies
Trudging Through the MovieLens Dataset
Getting Rid of Spam E-mails
Working with Handwritten Information
Working with Pictures
Analyzing Amazon.com Reviews
Interacting with a Huge Graph

Conditioning Your Data

Checking the Version of Pandas
Creating Categorical Variables
Finding the Missing Data
Encoding Missingness
Sorting and Shuffling
Creating n-grams
Calculating TF-IDF
Modifying Graphs Using NetworkX
Creating an Adjacency Matrix Using NetworkX
Defining a Plot
Creating a Line Plot
Creating a Legend
Creating a Pie Chart
Creating a Scatterplot
Creating an Undirected Graph
Using Parallel Coordinates
Calculating Descriptive Statistics
Visualizing the Validation Curve
Visualizing a Subset of Images
Adding New Cases and Variables

Shaping Data

Extracting a Telephone Number

Putting What You Know in Action

Using Vectorization
Performing Matrix Multiplication

Stretching Python’s Capabilities

Building a Predictor

Exploring Data Analysis

Loading the Iris Dataset

Reducing Dimensionality

Creating a Numpy Array

Clustering

Understanding Centroid-Based Algorithms

Exploring Four Simple and Effective Algorithms

Using K-Nearest Neighbors and PCA

Performing Cross-Validation, Selection, and Optimization

Loading the Boston Housing Dataset

Understanding the Power of the Many

Optimizing the Depth of Decision Tree

Any questions?
Check out the FAQs

Learn everything you need to know about our beginner’s guide to Python data science.

You need to know how to write basic Python code, work with libraries like Pandas and NumPy, and understand data structures and basic algorithms.

No prerequisites needed! This Python for data science for beginners course starts from scratch, so no prior programming experience is needed. Just bring your curiosity and enthusiasm.

All you need is a fast WiFi connection, a modern browser, and a willingness to learn. We’ll guide you through everything else.

No, this Python for data science course is designed for individual study, but there are tons of online forums and communities where you can share your progress and get help if needed.

The big ones you’ll use are Pandas, NumPy, and Matplotlib. These libraries will help you handle data, perform analysis, and visualize your findings.

Related Courses

All Courses

Lab

CCNA 200-301 Pearson uCertify Network Simulator

ISBN: 9781616918378

200-301-SIMULATOR.AB1

Try

Lessons AI Tutor

Accounting Course 101

ISBN: 9781644597002

ACCOUNT-WRKBK.AE1

Try

Lessons Lab

Accounting All-in-One

ISBN: 9781644594490

ACCOUNTS.AE1

Try

Lessons TestPrep

ACCUPLACER For Beginners

ISBN: 9781644595732

ACCUPLACER.AE1

Try

Lessons TestPrep

ACT Prep 2024

ISBN: 9781644594889

ACT-PREP.AE1

Try

Lessons Lab TestPrep AI Tutor

Mastering Active Directory

ISBN: 9781644595909

ACTV-DIRECT.AJ1

Try

Lessons Lab AI Tutor

Adversarial Machine Learning

ISBN: 9798900590165

ADV-ML.AU1

Try

This course includes:

Free pre-assessment and first 2 lessons

23+ Interactive Lessons | 40+ Exercises

Accessible on mobile and tablet too

Certificate of completion

Are you an instructor?

Access detailed information about the course content, learning objectives, activities, and assessments before adding it to your curriculum.

Python® for Data Science for Beginners

Are you an instructor?