Python® for Data Science for Beginners

uCertify PYTHON-DS.AE1

###### Python® for Data Science for Beginners

ISBN: 978-1-64459-462-9uCertify PYTHON-DS.AE1

(PYTHON-DS.AE1) / ISBN : 978-1-64459-462-9

This course includes

Lessons

TestPrep

Hands-On Labs

23+ Lessons | 40+ Exercises | 170+ Quizzes | 76+ Flashcards | 76+ Glossary of terms

30+ LiveLab | 15+ Video tutorials | 24+ Minutes

1

- About This Course
- False Assumptions
- Icons Used in This Course
- Where to Go from Here

2

- Defining the Sexiest Job of the 21st Century
- Creating the Data Science Pipeline
- Understanding Python’s Role in Data Science
- Learning to Use Python Fast

3

- Why Python?
- Working with Python
- Performing Rapid Prototyping and Experimentation
- Considering Speed of Execution
- Visualizing Power
- Using the Python Ecosystem for Data Science

4

- Considering the Off-the-Shelf Cross-Platform Scientific Distributions
- Installing Anaconda on Windows
- Installing Anaconda on Linux
- Installing Anaconda on Mac OS X
- Downloading the Datasets and Example Code

5

- Defining Google Colab
- Getting a Google Account
- Working with Notebooks
- Performing Common Tasks
- Using Hardware Acceleration
- Executing the Code
- Viewing Your Notebook
- Sharing Your Notebook
- Getting Help

6

- Using the Jupyter Console
- Using Jupyter Notebook
- Performing Multimedia and Graphic Integration

7

- Uploading, Streaming, and Sampling Data
- Accessing Data in Structured Flat-File Form
- Sending Data in Unstructured File Form
- Managing Data from Relational Databases
- Interacting with Data from NoSQL Databases
- Accessing Data from the Web

8

- Juggling between NumPy and pandas
- Validating Your Data
- Manipulating Categorical Variables
- Dealing with Dates in Your Data
- Dealing with Missing Data
- Slicing and Dicing: Filtering and Selecting Data
- Concatenating and Transforming
- Aggregating Data at Any Level

9

- Working with HTML Pages
- Working with Raw Text
- Using the Bag of Words Model and Beyond
- Working with Graph Data

10

- Contextualizing Problems and Data
- Considering the Art of Feature Creation
- Performing Operations on Arrays

11

- Starting with a Graph
- Setting the Axis, Ticks, Grids
- Defining the Line Appearance
- Using Labels, Annotations, and Legends

12

- Choosing the Right Graph
- Creating Advanced Scatterplots
- Plotting Time Series
- Plotting Geographical Data
- Visualizing Graphs

13

- Playing with Scikit-learn
- Performing the Hashing Trick
- Considering Timing and Performance
- Running in Parallel on Multiple Cores

14

- The EDA Approach
- Defining Descriptive Statistics for Numeric Data
- Counting for Categorical Data
- Creating Applied Visualization for EDA
- Understanding Correlation
- Modifying Data Distributions

15

- Understanding SVD
- Performing Factor Analysis and PCA
- Understanding Some Applications

16

- Clustering with K-means
- Performing Hierarchical Clustering
- Discovering New Groups with DBScan

17

- Considering Outlier Detection
- Examining a Simple Univariate Method
- Developing a Multivariate Approach

18

- Guessing the Number: Linear Regression
- Moving to Logistic Regression
- Making Things as Simple as Naïve Bayes
- Learning Lazily with Nearest Neighbors

19

- Pondering the Problem of Fitting a Model
- Cross-Validating
- Selecting Variables Like a Pro
- Pumping Up Your Hyperparameters

20

- Using Nonlinear Transformations
- Regularizing Linear Models
- Fighting with Big Data Chunk by Chunk
- Understanding Support Vector Machines
- Playing with Neural Networks

21

- Starting with a Plain Decision Tree
- Making Machine Learning Accessible
- Boosting Predictions

22

- Discovering the News with Subreddit
- Getting a Good Start with KDnuggets
- Locating Free Learning Resources with Quora
- Gaining Insights with Oracle’s Data Science Blog
- Accessing the Huge List of Resources on Data Science Central
- Learning New Tricks from the Aspirational Data Scientist
- Obtaining the Most Authoritative Sources at Udacity
- Receiving Help with Advanced Topics at Conductrics
- Obtaining the Facts of Open Source Data Science from Masters
- Zeroing In on Developer Resources with Jonathan Bower

23

- Meeting the Data Science London + Scikit-learn Challenge
- Predicting Survival on the Titanic
- Finding a Kaggle Competition that Suits Your Needs
- Honing Your Overfit Strategies
- Trudging Through the MovieLens Dataset
- Getting Rid of Spam E-mails
- Working with Handwritten Information
- Working with Pictures
- Analyzing Amazon.com Reviews
- Interacting with a Huge Graph

7

- Checking the Version of Pandas
- Creating Categorical Variables
- Finding the Missing Data
- Encoding Missingness
- Sorting and Shuffling
- Creating n-grams
- Calculating TF-IDF
- Modifying Graphs Using NetworkX
- Creating an Adjacency Matrix Using NetworkX
- Defining a Plot
- Creating a Line Plot
- Creating a Legend
- Creating a Pie Chart
- Creating a Scatterplot
- Creating an Undirected Graph
- Using Parallel Coordinates
- Calculating Descriptive Statistics
- Visualizing the Validation Curve
- Visualizing a Subset of Images
- Adding New Cases and Variables

8

- Extracting a Telephone Number

9

- Using Vectorization
- Performing Matrix Multiplication

12

- Building a Predictor

13

- Loading the Iris Dataset

14

- Creating a Numpy Array

15

- Understanding Centroid-Based Algorithms

17

- Using K-Nearest Neighbors and PCA

18

- Loading the Boston Housing Dataset

20

- Optimizing the Depth of Decision Tree