Snowflake Cortex Masterclass 2024 Hands-On!
Published 5/2024
MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz
Language: English | Size: 14.58 GB | Duration: 20h 15m
Published 5/2024
MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz
Language: English | Size: 14.58 GB | Duration: 20h 15m
by World-Class Snowflake Expert, former Data Superhero and SnowPro Certification SME
What you'll learn
Everything about Snowflake Cortex, the new AI & ML platform from Snowflake
How to implement end-to-end ML pipelines using both Snowpark and Snowpark ML
How to develop ML experiments with Snowflake using notebooks and code snippets
How to use the ML-powered classes and functions from Snowflake Cortex
How to call the new LLM functions from Snowflake Cortex
How to use Snowflake Copilot and other super-new LLM UI features in Snowsight
How to integrate Snowflake with ChatGPT using the OpenAI REST API
How to use Snowpark over in-memory Pandas DataFrames
Requirements
Basic knowledge of Snowflake
Basic knowledge of programming in SQL
Basic knowledge of programming in Python
Basic knowledge of Data Science and Machine Learning
Basic knowledge of Deep Learning and Transformers
Basic knowledge of LLMs (like ChatGPT) and their use cases
Optional knowledge of Streamlit
Optional knowledge of Time Series
Description
What is Snowflake CortexSnowflake got heavily involved into AI and ML only in the past two years. I know, because I've been around since the beginning. In Jan 2021 I was selected by them as a "Snowflake Data Superhero". And in my last two years alone I passed many DS and ML certification exams (see below).In mid-2023 they came up with Snowpark ML. Then with some built-in ML-powered functions, about regression and classification. The new Model Registry from Snowpark ML - added in Jan 2024 - allows now end-to-end ML on the platform.You'll see side-by-side data science experiments I will present to you "the old way", on Snowflake. Many of them with integrations with ChatGPT (about which I talked in detail in another course of mine). To compare them now with the new LLM functions, as Snowflake decided to host their own Large Language Models.Most of these (and other features yet to come) are now presented under the Snowflake Cortex umbrella.What you will learnHigh-level picture of the new Snowflake Cortex AI & ML platform.Detailed views on each of the Snowflake Cortex areas.How ML experiments were done on Snowflake before Cortex.How ML experiments can be implemented today with the Snowpark ML APIs, part of Snowflake Cortex.End-to-end Machine Learning with Snowpark ML and its Model Registry.How to use the new regression and classification ML-powered classes and functions, in Snowflake Cortex.How to call the new LLM functions from Snowflake Cortex, and compare them with ChatGPT.What to expect from Snowflake Copilot and other incoming Snowflake features in Cortex.What was the roadmap and what are the future plans of Snowflake for Snowflake Cortex or in the AI & ML areas.What this course is NOT aboutI will not teach you data science and machine learning here from ground up. You are expected to have some basic knowledge about ML, DL, LLMs…I will not teach you about many other areas in Snowflake. You are expected to have basic knowledge of Snowflake and data warehouses in general.I will not teach programming in Python or SQL. It's a hands-on course and you are expected to have some basic knowledge in this area. However, I may come-up with some small Streamlit apps, but I'll keep everything simple and easy to understand.This will not cover EVERYTHING you can do as data science in Snowflake. While you will see experiments "the old way" with scikit-learn, or integrations with ChatGPT, these are not part of Cortex! We have to limit mostly to Snowflake Cortex, as that's a huge platform.While Snowpark Container Services are also very new and they also target mostly ML and DL experiments (especially the new containers with GPUs), there will not be enough time to go deeper in detail. And this is also a very specialized and more difficult platform to understand. I may come up later on with a more advanced course on SPCS, but for now this course does not require such an advanced level of preparation.Who I amThe only world-class expert from Canada selected for the Snowflake Data Superhero program in 2021.SnowPro Certification SME (Subject Matter Expert) - many SnowPro exam questions have been created by me.Passed four SnowPro certification exams to date (with no retakes): Core, Architect, Data Engineer, Data Analyst.Specialized in Snowflake for the past few years: I worked for Snowflake Partner companies. I served dozens of clients in this capacity or as an independent consultant. Today I share my knowledge with highly specialize courses on Snowflake.A few of my latest Data Science and Machine Learning certificationsAWS Certified in Machine LearningMicrosoft Azure Data Scientist AssociateMicrosoft Azure AI Engineer AssociateMicrosoft Azure AI FundamentalsTensorFlow Developer CertificateAlteryx Machine Learning Fundamentals CertifiedDataiku ML Practitioner CertifiedDataiku MLOps Practitioner CertifiedNeo4j Graph Data Science CertifiedTigerGraph Graph Algorithms for Machine LearningThis course truly offers a complete coverage of the new Snowflake Cortex, and my intention is to update it frequently. Enroll today, and keep this course forever!
Overview
Section 1: Introduction to Snowflake Cortex
Lecture 1 Course Structure and Content
Lecture 2 Welcome to This Course
Lecture 3 All About This Course (FAQ Post)
Lecture 4 Roadmap to Snowflake Cortex
Lecture 5 Quick Tips: SQL Query Without Typing SQL
Lecture 6 Related Features and Technologies
Lecture 7 Overview of Snowflake Cortex
Lecture 8 Quick Tips: TRANSLATE LLM Function
Lecture 9 Quick Checkpoint: About …Quick Checkpoints
Section 2: ML Pipelines on Datasets (outside Snowflake)
Lecture 10 About this Section
Lecture 11 Quick Tips: Correlation Heatmap
Lecture 12 Introduction: Machine Learning Basics
Lecture 13 Introduction: ML Pipeline Phases
Lecture 14 Introduction: ML Pipeline Architectures
Lecture 15 Quick Checkpoint: What if You Already Know All This?
Lecture 16 Data Collection: Time Series Generation
Lecture 17 Data Collection: Make Regression/Classification
Lecture 18 Data Collection: Realistic Fake Data Generation
Lecture 19 Data Collection: Data Access
Lecture 20 Data Collection: Data Split
Lecture 21 Data Collection: Overview
Lecture 22 Quick Tips: Fake but Realistic Data Generation
Lecture 23 Data Exploration: Overview
Lecture 24 Data Exploration: Correlation Matrix Heatmap
Lecture 25 Data Exploration: Pandas Profiling
Lecture 26 Quick Checkpoint: About Pandas Profiling
Lecture 27 Data Wrangling: Overview
Lecture 28 Data Wrangling: Feature Engineering with Pandas DataFrame
Lecture 29 Data Wrangling: Data Preprocessing with Transformers
Lecture 30 Data Wrangling: Data Preprocessing with Pipeline
Lecture 31 Quick Checkpoint: About Basic ML on Datasets
Lecture 32 Quick Tips: SUMMARIZE LLM Function
Lecture 33 Model Training: Overview
Lecture 34 Model Training: Regression
Lecture 35 Model Training: Classification
Lecture 36 Model Validation: Manual Hyperparameter Optimization
Lecture 37 Model Validation: Manual Cross-Validation
Lecture 38 Model Validation: GridSearchCV for Regression
Lecture 39 Model Validation: RandomizedSearchCV for Classification
Lecture 40 Quick Checkpoint: About Model Validation
Lecture 41 Model Evaluation: Performance Metrics for Regression
Lecture 42 Model Evaluation: Performance Metrics for Classification
Lecture 43 Model Serving: Save/Load the Trained Model File
Lecture 44 Quick Tips: Signup for a Free Snowflake Trial Account
Section 3: ML Pipelines using Snowpark (before Cortex)
Lecture 45 About this Section
Lecture 46 Quick Tips: Uploading Files in Snowflake
Lecture 47 Introduction: Snowpark Components
Lecture 48 Introduction: Procedures and Functions from SQL
Lecture 49 Introduction: Snowpark for Python
Lecture 50 Introduction: Procedures and Functions from Python
Lecture 51 Introduction: Vectorized User-Defined Functions
Lecture 52 Introduction: Runtimes and Package Versions
Lecture 53 Introduction: Snowpark for ML Pipelines
Lecture 54 Data Collection: Populating with SQL Statements
Lecture 55 Data Collection: Synthetic Data Generation
Lecture 56 Data Collection: Faker Library in Python Worksheet
Lecture 57 Quick Tips: Easiest Way to Connect to Snowflake
Lecture 58 Data Collection: Uploading with SQL Scripts
Lecture 59 Data Collection: Uploading with Python Code
Lecture 60 Data Collection: Uploading from External Stages
Lecture 61 Data Collection: Uploading Other Datasets
Lecture 62 Data Collection: Sample Data Extraction
Lecture 63 Data Collection: Data Split
Lecture 64 Quick Checkpoint: About Ingesting Data in Snowflake
Lecture 65 Quick Tips: Correlation Heatmap in Snowflake
Lecture 66 Data Exploration: Snowsight Charts and Dashboards
Lecture 67 Data Exploration: Snowflake Partner Notebooks
Lecture 68 Data Exploration: Snowflake Notebooks
Lecture 69 Data Exploration: Overview
Lecture 70 Quick Tips: Data Profiling in Snowflake
Lecture 71 Quick Checkpoint: Pandas vs Snowpark Data Frames
Lecture 72 Feature Engineering: Pandas vs Snowpark DataFrames
Lecture 73 Feature Engineering: Using Pandas DataFrames
Lecture 74 Feature Engineering: Using Snowpark DataFrames
Lecture 75 Feature Engineering: Scalability Check with Python Worksheets
Lecture 76 Feature Engineering: Overview
Lecture 77 Quick Checkpoint: About the Python Worksheets
Lecture 78 Quick Tips: DataFrame Queries
Lecture 79 Data Preprocessing: When You Cannot Avoid Pandas
Lecture 80 Model Training: Sentiment Analysis in Local Mode
Lecture 81 Model Training: Sentiment Analysis with Stored Procedure
Lecture 82 Model Training: Overview
Lecture 83 Model Training: Sentiment Analysis with Imported Modules
Lecture 84 Model Training: House Predictions with Stored Procedure
Lecture 85 Model Serving: Overview
Lecture 86 Model Serving: Sentiment Predictions with UDFs
Lecture 87 Model Serving: Sentiment Predictions with SQL
Lecture 88 Model Serving: House Predictions with Vectorized UDF
Lecture 89 Model Serving: Introduction to Cachetools
Lecture 90 Model Serving: UDFs vs Vectorized UDFs
Section 4: ML Pipelines with Snowpark ML (in Cortex)
Lecture 91 About this Section
Lecture 92 Introduction: Snowpark ML APIs
Lecture 93 Data Collection: FileSystem
Lecture 94 Data Collection: FileSet and Framework Connectors
Lecture 95 Data Collection: SnowflakeFile
Lecture 96 Data Collection: Overview
Lecture 97 Distributed Preprocessing: Sklearn vs Snowpark ML
Lecture 98 Distributed Preprocessing: Snowpark vs Snowpark ML
Lecture 99 Distributed Preprocessing: Notebook Experiments
Lecture 100 Distributed Preprocessing: Overview
Lecture 101 Quick Tips: Python Worksheets
Lecture 102 Model Training: Sklearn vs Snowpark ML
Lecture 103 Model Training: Snowpark vs Snowpark ML
Lecture 104 Model Training: Notebook Experiment
Lecture 105 Model Training: Overview
Lecture 106 Quick Tips: Estimator Pattern in Snowpark ML Modeling
Lecture 107 Quick Checkpoint: About the Roadmap to Snowpark ML
Lecture 108 Distributed HPO: Sklearn vs Snowpark ML
Lecture 109 Distributed HPO: Snowpark vs Snowpark ML
Lecture 110 Distributed HPO: Notebook Experiment
Lecture 111 Distributed HPO: Overview
Lecture 112 Distributed Metrics: Sklearn vs Snowpark ML
Lecture 113 Distributed Metrics: Snowpark vs Snowpark ML
Lecture 114 Distributed Metrics: Notebook Experiment
Lecture 115 Distributed Metrics: Overview
Lecture 116 Snowflake MLOps: Overview
Lecture 117 Snowflake MLOps: Logging a Model
Lecture 118 Snowflake MLOps: The Model Registry
Lecture 119 Snowflake MLOps: Model Predictions from Registered Models
Lecture 120 Snowflake MLOps: Model Types and Providers
Lecture 121 Quick Tips: Prediction Functions from Model Registry
Lecture 122 Cost of Snowpark ML
Lecture 123 Quick Tips: Warehouse Auto-Suspend Value
Lecture 124 Quick Checkpoint: About Auto-Suspend in Warehouses
Section 5: ML Functions (in Cortex)
Lecture 125 About this Section
Lecture 126 Quick Tips: Simple Classification through Wizard
Lecture 127 Introduction: ML Classes
Lecture 128 Introduction: ML Class Methods
Lecture 129 Introduction: Snowflake SQL Classes
Lecture 130 Introduction: Snowflake SQL Class Instances
Lecture 131 Quick Checkpoint: About the ML-Powered Functions
Lecture 132 Classification: Binary Classifier
Lecture 133 Classification: Multiclass Classifier
Lecture 134 Classification: Bank Classifier
Lecture 135 Classification: Overview
Lecture 136 Quick Tips: Confusion Heatmap for Classification ML Class
Lecture 137 Forecasting: Time Series Data
Lecture 138 Forecasting: Prepare Sales Data
Lecture 139 Forecasting: Train Model and Predict Sales
Lecture 140 Forecasting: Train Model and Predict Temperatures
Lecture 141 Forecasting: Overview
Lecture 142 Anomaly Detection: Overview
Lecture 143 Anomaly Detection: Detect Outliers in Sales
Lecture 144 Anomaly Detection: Automation with Tasks and Alerts
Lecture 145 Anomaly Detection: Detect Outliers in Temperatures
Lecture 146 Quick Tips: Marking Outliers for Anomaly Detection
Lecture 147 Quick Checkpoint: About Forecasting and Anomaly Detection
Lecture 148 Gradient Boosting: Algorithm
Lecture 149 Gradient Boosting: Classifier & Regressor
Lecture 150 Contribution Explorer: Overview
Lecture 151 Contribution Explorer: What Led to a Change in Sales
Lecture 152 Contribution Explorer: What Makes a Customer Take to a Loan
Lecture 153 Contribution Explorer: How to Survive on Titanic
Lecture 154 Quick Checkpoint: TOP_INSIGHTS is NOT a Time Series Function!
Lecture 155 Access Rights: Introduction to Roles
Lecture 156 Access Rights: Classification
Lecture 157 Access Rights: Forecasting and Anomaly Detection
Lecture 158 Quick Checkpoint: About Access Rights to ML Classes and Functions
Lecture 159 Cost of ML Functions
Section 6: LLM Functions and Extensions (in Cortex)
Lecture 160 About this Section
Lecture 161 Quick Tips: SENTIMENT LLM Function
Lecture 162 Introduction to LLM Functions: Overview
Lecture 163 Introduction to LLM Functions: Quick Demo
Lecture 164 Introduction to Data Science: Important Milestones
Lecture 165 Introduction to Data Science: Deep Learning Review
Lecture 166 Introduction to Data Science: Generative AI Review
Lecture 167 Quick Checkpoint: About Deep Learning in Snowflake
Lecture 168 ChatGPT Integrations: Local Applications
Lecture 169 ChatGPT Integrations: Snowflake Applications
Lecture 170 ChatGPT Integrations: Overview
Lecture 171 COMPLETE LLM Functions
Lecture 172 EXTRACT_ANSWER LLM Function
Lecture 173 SENTIMENT LLM Function
Lecture 174 SUMMARIZE LLM Functions
Lecture 175 TRANSLATE LLM Function
Lecture 176 Quick Checkpoint: About the Specialized LLM Functions
Lecture 177 Applications with Cortex LLM Functions
Lecture 178 Access Rights to LLM Functions
Lecture 179 Cost of LLM Functions
Lecture 180 Quick Tips: Mistral-Large Cost
Lecture 181 Quick Checkpoint: About Mistral Large
Lecture 182 LLM Extensions in Snowsight
Lecture 183 Universal Search: Overview
Lecture 184 Snowflake Copilot: Quick Demo
Lecture 185 Snowflake Copilot: Overview
Lecture 186 Snowflake Copilot: SQL Query Generation with LangChain and ChatGPT
Lecture 187 Quick Checkpoint: Is Snowflake Copilot Reliable Enough?
Lecture 188 Document AI: Overview
Lecture 189 Document AI: Private Data Access with LlamaIndex and ChatGPT
Lecture 190 Quick Checkpoint: About ChatGPT Integrations
Section 7: Wrapping Up
Lecture 191 Setup Instructions: GitHub Project and VSCode
Lecture 192 Setup Instructions: Free Snowflake Trial Account
Lecture 193 Setup Instructions: ChatGPT/OpenAI Account
Lecture 194 Congratulations, You Made It!
Lecture 195 Bonus Lecture
Data Scientists who want to learn about all AI & ML opportunities in Snowflake,Data Analysis looking how to use the new ML-based and LLM functions,Data and Software Engineers looking to expand into AI & ML on Snowflake,Project Managers looking for a 360 degree view of the new Snowflake Cortex platform,Data Architects willing to understand fast how Snowflake Cortex is built,Anyone else looking for a high-level (but detailed) picture of Snowflake Cortex,Anyone looking to understand the code and query pushdown model of Snowflake,Anyone looking how to save money on Snowflake using new built-in ML and LLM functions,Anyone looking for other Snowflake features yet to come in the AI & ML area