The Data Analyst Course: Complete Data Analyst Bootcamp 2023
Last updated 10/2022
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz
Language: English | Size: 8.85 GB | Duration: 20h 55m
Last updated 10/2022
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz
Language: English | Size: 8.85 GB | Duration: 20h 55m
Complete Data Analyst Training: Python, NumPy, Pandas, Data Collection, Preprocessing, Data Types, Data Visualization
What you'll learn
The course provides the complete preparation you need to become a data analyst
Fill up your resume with in-demand data skills: Python programming, NumPy, pandas, data preparation - data collection, data cleaning, data preprocessing, data visualization; data analysis, data analytics
Acquire a big picture understanding of the data analyst role
Learn beginner and advanced Python
Study mathematics for Python
We will teach you NumPy and pandas, basics and advanced
Be able to work with text files
Understand different data types and their memory usage
Learn how to obtain interesting, real-time information from an API with a simple script
Clean data with pandas Series and DataFrames
Complete a data cleaning exercise on absenteeism rate
Expand your knowledge of NumPy – statistics and preprocessing
Go through a complete loan data case study and apply your NumPy skills
Master data visualization
Learn how to create pie, bar, line, area, histogram, scatter, regression, and combo charts
Engage with coding exercises that will prepare you for the job
Practice with real-world data
Solve a final capstone project
Requirements
No prior experience is required. We will start from the very basics
You’ll need to install Anaconda. We will show you how to do that step by step
Description
The problemMost data analyst, data science, and coding courses miss a critical practical step. They don’t teach you how to work with raw data, how to clean, and preprocess it. This creates a sizeable gap between the skills you need on the job and the abilities you have acquired in training. Truth be told, real-world data is messy, so you need to know how to overcome this obstacle to become an independent data professional.The bootcamps we have seen online and even live classes neglect this aspect and show you how to work with ‘clean’ data. But this isn’t doing you a favour. In reality, it will set you back both when you are applying for jobs, and when you’re on the job.The solutionOur goal is to provide you with complete preparation. And this course will turn you into a job-ready data analyst. To take you there, we will cover the following fundamental topics extensively.Theory about the field of data analyticsBasic PythonAdvanced PythonNumPyPandasWorking with text filesData collectionData cleaningData preprocessingData visualizationFinal practical exampleEach of these subjects builds on the previous ones. And this is precisely what makes our curriculum so valuable. Everything is shown in the right order and we guarantee that you are not going to get lost along the way, as we have provided all necessary steps in video (not a single one skipped). In other words, we are not going to teach you how to analyse data before you know how to gather and clean it.So, to prepare you for the entry-level job that leads to a data science position - data analyst - we created The Data Analyst Course.This is a rather unique training program because it teaches the fundamentals you need on the job. A frequently neglected aspect of vital importance.Moreover, our focus is to teach topics that flow smoothly and complement each other. The course provides complete preparation for someone who wants to become a data analyst at a fraction of the cost of traditional programs (not to mention the amount of time you will save). We believe that this resource will significantly boost your chances of landing a job, as it will prepare you for practical tasks and concepts that are frequently included in interviews.The topics we will cover1. Theory about the field of data analytics2. Basic Python3. Advanced Python4. NumPy5. Pandas6. Working with text files7. Data collection8. Data cleaning9. Data preprocessing10. Data visualization11. Final practical example1. Theory about the field of data analyticsHere we will focus on the big picture. But don’t imagine long boring pages with terms you’ll have to check up in a dictionary every minute. Instead, this is where we want to define who a data analyst is, what they do, and how they create value for an organization.Why learn it?You need a general understanding to appreciate how every part of the course fits in with the rest of the content. As they say, if you know where you are going, chances are that you will eventually get there. And since data analyst and other data jobs are relatively new and constantly evolving, we want to provide you with a good grasp of the data analyst role specifically. Then, in the following chapters, we will teach you the actual tools you need to become a data analyst.2. Basic PythonThis course is centred around Python. So, we’ll start from the very basics. Don’t be afraid if you do not have prior programming experience.Why learn it?You need to learn a programming language to take full advantage of the data-rich world we live in. Unless you are equipped with such a skill, you will always be dependent on other people’s ability to extract and manipulate data, and you want to be independent while doing analysis, right? Also, you don’t necessarily need to learn many programming languages at once. It is enough to be very skilled at just one, and we’ve naturally chosen Python which has established itself as the number one language for data analysis and data science (thanks to its rich libraries and versatility).3. Advanced PythonWe will introduce advanced Python topics such as working with text data and using tools such as list comprehensions and anonymous functions.Why learn it?These lessons will turn you into a proficient Python user who is independent on the job. You will be able to use Python’s core strengths to your advantage. So, here it is not just about the topics, it is also about the depth in which we explore the most relevant Python tools.4. NumPyNumPy is Python’s fundamental package for scientific computing. It has established itself as the go-to tool when you need to compute mathematical and statical operations.Why learn it?A large portion of a data analyst’s work is dedicated to preprocessing datasets. Unquestionably, this involves tons of mathematical and statistical techniques that NumPy is renowned for. In addition, the package introduces multi-dimensional array structures and provides a plethora of built-in functions and methods to use while working with them. In other words, NumPy can be described as a computationally stable state-of-the-art Python instrument that provides flexibility and can take your analysis to the next level.5. PandasThe pandas library is one of the most popular Python tools that facilitate data manipulation and analysis. It is very valuable because you can use it to manipulate all sorts of information - numerical tables and time series data, as well as text.Why learn it?Pandas is the other main tool an analyst needs to clean and preprocess the data they are working with. Its data manipulation features are second to none in Python because of the diversity and richness it provides in terms of methods and functions. The combined ability to work with both NumPy and pandas is extremely powerful as the two libraries complement each other. You need to be capable to operate with both to produce a complete and consistent analysis independently.6. Working with text filesExchanging information with text files is practically how we exchange information today. In this part of the course, we will use the Python, pandas, and NumPy tools learned earlier to give you the essentials you need when importing or saving data.Why learn it?In many courses, you are just given a dataset to practice your analytical and programming skills. However, we don’t want to close our eyes to reality, where converting a raw dataset from an external file into a workable Python format can be a massive challenge.7. Data collectionIn the real world, you don’t always have the data readily available for you. In this part of the course, you will learn how to retrieve data from an API.Why learn it?You need to know how to source your data, right? To be a well-rounded analyst you must be able to collect data from outside sources. This is rarely a one-click process. This section aims at providing you with all the necessary tools to do that on your own.8. Data cleaningThe next logical step is to clean your data. This is where you will apply the pandas skills acquired earlier in practice. All lessons throughout the course have a real-world perspective.Why learn it?A large part of a data analyst’s job in the real world involves cleaning data and preparing it for the actual analysis. You can’t expect that you’ll deal with flawless data sources, right? So, it will be up to you to overcome this stage and clean your data.9. Data preprocessingEven when your dataset is clean and in an understandable shape, it isn’t quite ready to be processed for visualizations and analysis just yet. There is a crucial step in between, and that’s data preprocessing.Why learn it?Data preprocessing is where a data analyst can demonstrate how good or great they are at their job. This stage of the work requires the ability to choose the right statistical tool that will improve the quality of your dataset and the knowledge to implement it with advanced pandas and NumPy techniques. Only when you’ve completed this step can you say that your dataset is preprocessed and ready for the next part, which is data visualization.10. Data visualizationData visualization is the face of data. Many people look at the data and see nothing. The reason for that is that they are not creating good visualizations. Or even worse – they are creating nice graphs but cannot interpret them accurately.Why learn it?This part of the course will teach you how to use your data to produce meaningful insights. At the end of the day, data charts are what conveys the most information in the shortest amount of time. And nothing speaks better than a well crafted and meaningful data visualization.11. Practical exampleThe course contains plenty of exercises and practical cases. In the end, we have included a comprehensive practical example that will show you how everything you have learned along the way comes nicely together. This is where you will be able to appreciate how far you have come in your journey to becoming a data analyst and starting your data career.What you getA program worth $1,250Active Q&A supportAll the knowledge to become a data analystA community of aspiring data analystsA certificate of completionAccess to frequent future updatesReal-world trainingGet ready to become a data analyst from scratchWhy wait? Every day is a missed opportunity.Click the “Buy Now” button and become a part of our data analyst program today.
Overview
Section 1: Introduction to the Course
Lecture 1 A Practical Example - What Will You Learn in This Course?
Lecture 2 What Does the Course Cover?
Lecture 3 Download All Resources
Lecture 4 FAQ
Section 2: Introduction to Data Analytics
Lecture 5 Introduction to the World of Business and Data
Lecture 6 Relevant Terms Explained
Lecture 7 Data Analyst Compared to Other Data Jobs
Lecture 8 Data Analyst Job Description
Lecture 9 Why Python
Section 3: Setting up the Environment
Lecture 10 Introduction
Lecture 11 Programming Explained in a Few Minutes
Lecture 12 Jupyter - Introduction
Lecture 13 Jupyter - Installing Anaconda
Lecture 14 Jupyter - Intro to Using Jupyter
Lecture 15 Jupyter - Working with Notebook Files
Lecture 16 Jupyter - Using Shortcuts
Lecture 17 Jupyter - Handling Error Messages
Lecture 18 Jupyter - Restarting the Kernel
Section 4: Python Basics
Lecture 19 Python Variables
Lecture 20 Types of Data - Numbers and Boolean Values
Lecture 21 Types of Data - Strings
Lecture 22 Basic Python Syntax - Arithmetic Operators
Lecture 23 Basic Python Syntax - The Double Equality Sign
Lecture 24 Basic Python Syntax - Reassign Values
Lecture 25 Basic Python Syntax - Add Comments
Lecture 26 Basic Python Syntax - Line Continuation
Lecture 27 Basic Python Syntax - Indexing Elements
Lecture 28 Basic Python Syntax - Indentation
Lecture 29 Operators - Comparison Operators
Lecture 30 Operators - Logical and Identity Operators
Lecture 31 Conditional Statements - The IF Statement
Lecture 32 Conditional Statements - The ELSE Statement
Lecture 33 Conditional Statements - The ELIF Statement
Lecture 34 Conditional Statements - A Note on Boolean Values
Lecture 35 Functions - Defining a Function in Python
Lecture 36 Functions - Creating a Function with a Parameter
Lecture 37 Functions - Another Way to Define a Function
Lecture 38 Functions - Using a Function in Another Function
Lecture 39 Functions - Combining Conditional Statements and Functions
Lecture 40 Functions - Creating Functions That Contain a Few Arguments
Lecture 41 Functions - Notable Built-in Functions in Python
Lecture 42 Sequences - Lists
Lecture 43 Sequences - Using Methods
Lecture 44 Sequences - List Slicing
Lecture 45 Sequences - Tuples
Lecture 46 Sequences - Dictionaries
Lecture 47 Iteration - For Loops
Lecture 48 Iteration - While Loops and Incrementing
Lecture 49 Iteration - Create Lists with the range() Function
Lecture 50 Iteration - Use Conditional Statements and Loops Together
Lecture 51 Iteration - Conditional Statements, Functions, and Loops
Lecture 52 Iteration - Iterating over Dictionaries
Section 5: Fundamentals for Coding in Python
Lecture 53 Object-Oriented Programming (OOP)
Lecture 54 Modules, Packages, and the Python Standard Library
Lecture 55 Importing Modules
Lecture 56 Introduction to Using NumPy and pandas
Lecture 57 What is Software Documentation?
Lecture 58 The Python Documentation
Section 6: Mathematics for Python
Lecture 59 What Is а Matrix?
Lecture 60 Scalars and Vectors
Lecture 61 Linear Algebra and Geometry
Lecture 62 Arrays in Python
Lecture 63 What Is a Tensor?
Lecture 64 Adding and Subtracting Matrices
Lecture 65 Errors When Adding Matrices
Lecture 66 Transpose
Lecture 67 Dot Product of Vectors
Lecture 68 Dot Product of Matrices
Lecture 69 Why is Linear Algebra Useful
Section 7: NumPy Basics
Lecture 70 The NumPy Package and Why We Use It
Lecture 71 Installing/Upgrading NumPy
Lecture 72 Ndarray
Lecture 73 The NumPy Documentation
Lecture 74 NumPy Basics - Exercise
Section 8: Pandas - Basics
Lecture 75 Introduction to the pandas Library
Lecture 76 Installing and Running pandas
Lecture 77 Introduction to pandas Series
Lecture 78 Working with Attributes in Python
Lecture 79 Using an Index in pandas
Lecture 80 Label-based vs Position-based Indexing
Lecture 81 More on Working with Indices in Python
Lecture 82 Using Methods in Python - Part I
Lecture 83 Using Methods in Python - Part II
Lecture 84 Parameters vs Arguments
Lecture 85 the pandas Documentation
Lecture 86 Introduction to pandas DataFrames
Lecture 87 Creating DataFrames from Scratch - Part I
Lecture 88 Creating DataFrames from Scratch - Part II
Lecture 89 Additional Notes on Using DataFrames
Lecture 90 pandas Basics - Conclusion
Section 9: Working with Text Files
Lecture 91 Working with Files in Python - An Introduction
Lecture 92 File vs File Object, Read vs Parse
Lecture 93 Structured vs Semi-Structured and Unstructured Data
Lecture 94 Data Connectivity through Text Files
Lecture 95 Principles of Importing Data in Python
Lecture 96 More on Text Files (*.txt vs *.csv)
Lecture 97 Fixed-width Files
Lecture 98 Common Naming Conventions Used in Programming
Lecture 99 Importing Text Files in Python ( open() )
Lecture 100 Importing Text Files in Python ( with open() )
Lecture 101 Importing *.csv Files with pandas - Part I
Lecture 102 Importing *.csv Files with pandas - Part II
Lecture 103 Importing *.csv Files with pandas - Part III
Lecture 104 Importing Data with the "index_col" Parameter
Lecture 105 Importing Data with NumPy - .loadtxt() vs genfromtxt()
Lecture 106 Importing Data with NumPy - Partial Cleaning While Importing
Lecture 107 Importing Data with NumPy - Exercise
Lecture 108 Importing *.json Files
Lecture 109 Prelude to Working with Excel Files in Python
Lecture 110 Working with Excel Data (the *.xlsx Format)
Lecture 111 An Important Exercise on Importing Data in Python
Lecture 112 Importing Data with the pandas' "Squeeze" Method
Lecture 113 A Note on Importing Files in Jupyter
Lecture 114 Saving Your Data with pandas
Lecture 115 Saving Your Data with NumPy - np.save()
Lecture 116 Saving Your Data with NumPy - np.savez()
Lecture 117 Saving Your Data with NumPy - np.savetxt()
Lecture 118 Saving Your Data with NumPy - Exercise
Lecture 119 Working with Text Files - Conclusion
Section 10: Working with Text Data
Lecture 120 Working with Text Data and Argument Specifiers
Lecture 121 Manipulating Python Strings
Lecture 122 Using Various Python String Methods - Part I
Lecture 123 Using Various Python String Methods - Part II
Lecture 124 String Accessors
Lecture 125 Using the .format() Method
Section 11: Must-Know Python Tools
Lecture 126 Iterating Over Range Objects
Lecture 127 Nested For Loops - Introduction
Lecture 128 Triple Nested For Loops
Lecture 129 List Comprehensions
Lecture 130 Anonymous (Lambda) Functions
Section 12: Data Gathering/Data Collection
Lecture 131 What is data gathering/data collection?
Section 13: APIs (POST requests are not needed for this course)
Lecture 132 Overview of APIs
Lecture 133 GET and POST Requests
Lecture 134 Data Exchange Format for APIs: JSON
Lecture 135 Introducing the Exchange Rates API
Lecture 136 Including Parameters in a GET Request
Lecture 137 More Functionalities of the Exchange Rates API
Lecture 138 Coding a Simple Currency Conversion Calculator
Lecture 139 iTunes API
Lecture 140 iTunes API: Homework
Lecture 141 iTunes API: Structuring and Exporting the Data
Lecture 142 Pagination: GitHub API
Lecture 143 APIs: Exercise
Section 14: Data Cleaning and Data Preprocessing
Lecture 144 Data Cleaning and Data Preprocessing
Section 15: pandas Series
Lecture 145 .unique(), .nunique()
Lecture 146 Converting Series into Arrays
Lecture 147 .sort_values()
Lecture 148 Attribute and Method Chaining
Lecture 149 .sort_index()
Section 16: pandas DataFrames
Lecture 150 A Revision to pandas DataFrames
Lecture 151 Common Attributes for Working with DataFrames
Lecture 152 Data Selection in pandas DataFrames
Lecture 153 Data Selection - Indexing with .iloc[]
Lecture 154 Data Selection - Indexing with .loc[]
Lecture 155 A Few Comments on Using .loc[] and .iloc[]
Section 17: NumPy Fundamentals
Lecture 156 Indexing in NumPy
Lecture 157 Assigning Values in NumPy
Lecture 158 Elementwise Properties of Arrays
Lecture 159 Types of Data Supported by NumPy
Lecture 160 Characteristics of NumPy Functions Part 1
Lecture 161 Characteristics of NumPy Functions Part 2
Lecture 162 NumPy Fundamentals - Exercise
Section 18: NumPy DataTypes
Lecture 163 ndarrays
Lecture 164 Arrays vs Lists
Lecture 165 Strings vs Object vs Number
Lecture 166 NumPy DataTypes - Exercise
Section 19: Working with Arrays
Lecture 167 Basic Slicing in NumPy
Lecture 168 Stepwise Slicing in NumPy
Lecture 169 Conditional Slicing in NumPy
Lecture 170 Dimensions and the Squeeze Function
Lecture 171 Working with Arrays - Exercise
Section 20: Generating Data with NumPy
Lecture 172 Arrays of 0s and 1s
Lecture 173 "_like" functions in NumPy
Lecture 174 A Non-Random Sequence of Numbers
Lecture 175 Random Generators and Seeds
Lecture 176 Basic Random Functions in NumPy
Lecture 177 Probability Distributions in NumPy
Lecture 178 Applications of Random Data in NumPy
Lecture 179 Generating Data with NumPy - Exercise
Section 21: Statistics with NumPy
Lecture 180 Using Statistical Functions in NumPy
Lecture 181 Minimal and Maximal Values in NumPy
Lecture 182 Statistical Order Functions in NumPy
Lecture 183 Averages and Variance in NumPy
Lecture 184 Covariance and Correlation in NumPy
Lecture 185 Histograms in NumPy (Part 1)
Lecture 186 Histograms in NumPy (Part 2)
Lecture 187 NAN Equivalent Functions in NumPy
Lecture 188 Statistics with NumPy - Exercise
Section 22: NumPy - Preprocessing
Lecture 189 Checking for Missing Values in Ndarrays
Lecture 190 Substituting Missing Values in Ndarrays
Lecture 191 Reshaping Ndarrays
Lecture 192 Removing Values from Ndarrays
Lecture 193 Sorting Ndarrays
Lecture 194 Argument Sort in NumPy
Lecture 195 Argument Where in NumPy
Lecture 196 Shuffling Ndarrays
Lecture 197 Casting Ndarrays
Lecture 198 Striping Values from Ndarrays
Lecture 199 Stacking Ndarrays
Lecture 200 Concatenating Ndarrays
Lecture 201 Finding Unique Values in Ndarrays
Section 23: A Loan Data Example with NumPy
Lecture 202 Setting Up: Introduction to the Practical Example
Lecture 203 Setting Up: Importing the Data Set
Lecture 204 Setting Up: Checking for Incomplete Data
Lecture 205 Setting Up: Splitting the Dataset
Lecture 206 Setting Up: Creating Checkpoints
Lecture 207 Manipulating Text Data: Issue Date
Lecture 208 Manipulating Text Data: Loan Status and Term
Lecture 209 Manipulating Text Data: Grade and Sub Grade
Lecture 210 Manipulating Text Data: Verification Status & URL
Lecture 211 Manipulating Text Data: State Address
Lecture 212 Manipulating Text Data: Converting Strings and Creating a Checkpoint
Lecture 213 Manipulating Numeric Data: Substitute Filler Values
Lecture 214 Manipulating Numeric Data: Currency Change – The Exchange Rate
Lecture 215 Manipulating Numeric Data: Currency Change - From USD to EUR
Lecture 216 Completing the Dataset
Section 24: The "Absenteeism" Exercise - Introduction
Lecture 217 An Introduction to the "Absenteeism" Exercise
Lecture 218 The "Absenteeism" Exercise from a Business Perspective
Lecture 219 The Dataset
Section 25: Solution to the "Absenteeism" Exercise
Lecture 220 How to Complete the Absenteeism Exercise
Lecture 221 Eyeball Your Data First
Lecture 222 Note: Programming vs the Rest of the World
Lecture 223 Using a Statistical Approach to Solve Our Exercise
Lecture 224 Dropping the 'ID' Column
Lecture 225 Analysis of the 'Reason for Absence' Column
Lecture 226 Splitting the Reasons for Absence into Multiple Dummy Variables
Lecture 227 Working with Dummy Variables - A Statistical Perspective
Lecture 228 Grouping the Reason for Absence Columns
Lecture 229 Concatenating Columns in a pandas DataFrame
Lecture 230 Reordering Columns in a DataFrame
Lecture 231 Working on the 'Date' Column
Lecture 232 Extracting the Month Value from the 'Date' Column
Lecture 233 Creating the 'Day of the Week' Column
Lecture 234 Understanding the Meaning of 5 More Columns
Lecture 235 Modifying the 'Education' Column
Lecture 236 Final Remarks on the Absenteeism Exercise
Section 26: Data Visualization
Lecture 237 What Is Data Visualization and Why Is It Important?
Lecture 238 Why Learn Data Visualization?
Lecture 239 Choosing the Right Visualization – What Are Some Popular Approaches and Framewor
Lecture 240 Introduction into Colors and Color Theory
Lecture 241 Bar Chart - Introduction - General Theory and Getting to Know the Dataset
Lecture 242 Bar Chart - How to Create a Bar Chart Using Python
Lecture 243 Bar Chart – Interpreting the Bar Graph. How to Make a Good Bar Graph
Lecture 244 Pie Chart - Introduction - General Theory and Dataset
Lecture 245 Pie Chart - How to Create a Pie Chart Using Python
Lecture 246 Pie Chart – Interpreting the Pie Chart
Lecture 247 Pie Chart - Why You Should Never Create a Pie Graph
Lecture 248 Stacked Area Chart - Introduction - General Theory. Getting to Know the Dataset
Lecture 249 Stacked Area Chart - How to Create a Stacked Area Chart Using Python
Lecture 250 Stacked Area Chart - Interpreting the Stacked Area Graph
Lecture 251 Stacked Area Chart - How to Make a Good Stacked Area Chart
Lecture 252 Line Chart - Introduction - General Theory. Getting to Know the Dataset
Lecture 253 Line Chart - How to Create a Line Chart in Python
Lecture 254 Line Chart - Interpretation
Lecture 255 Line Chart - How to Make a Good Line Chart
Lecture 256 Histogram - Introduction - General Theory. Getting to Know the Dataset
Lecture 257 Histogram - How to Create a Histogram Using Python
Lecture 258 Histogram – Interpreting the Histogram
Lecture 259 Histogram – Choosing the Number of Bins in a Histogram
Lecture 260 Histogram - How to Make a Good Histogram
Lecture 261 Scatter Plot - Introduction - General Theory. Getting to Know the Dataset
Lecture 262 Scatter Plot - How to Create a Scatter Plot Using Python
Lecture 263 Scatter Plot – Interpreting the Scatter Plot
Lecture 264 Scatter Plot - How to Make a Good Scatter Plot
Lecture 265 Regression Plot - Introduction - General Theory. Getting to Know the Dataset
Lecture 266 Regression Plot - How to Create a Regression Scatter Plot Using Python
Lecture 267 Regression Plot – Interpreting the Regression Scatter Plot
Lecture 268 Regression Plot - How to Make a Good Regression Plot
Lecture 269 Bar and Line Chart - Introduction - General Theory. Getting to Know the Dataset
Lecture 270 Bar and Line Chart - How to Create a Combination Bar and Line Graph Using Python
Lecture 271 Bar and Line Chart – Interpreting the Combination Bar and Line Graph
Lecture 272 Bar and Line Chart – How to Make a Good Bar and Line Graph
Lecture 273 Data Visualization - Exercise
Section 27: Conclusion
Lecture 274 Conclusion
You should take this course if you want to become a Data Analyst and Data Scientist,This course is for you if you want a great career,The course is also ideal for beginners, as it starts from the fundamentals and gradually builds up your skills