Tags
Language
Tags
December 2024
Su Mo Tu We Th Fr Sa
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31 1 2 3 4

Mastering Data Integration With Ibm Datastage

Posted By: ELK1nG
Mastering Data Integration With Ibm Datastage

Mastering Data Integration With Ibm Datastage
Published 10/2023
MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz
Language: English | Size: 2.25 GB | Duration: 7h 58m

Unlock the Power of Data Integration: Practical Training with IBM DataStage (ETL)

What you'll learn

Fundamentals of Data Integration: Understand the core concepts and types of data integration and explore real-world examples.

Navigating IBM Information Server: Get acquainted with the components of IBM Information Server and its role in data integration.

IBM Information Server Administration: Learn to navigate the IBM Information Server Administration Console and practice essential administrative tasks.

Exploring IBM DataStage: Dive into the architecture of IBM DataStage, its key components, and practical uses

Developing in IBM DataStage: Work hands-on in DataStage, create projects, explore job types, and utilize design elements for parallel processing.

DataStage Administration: Acquire practical skills in DataStage administration, including user management, permissions, and environment variables.

Metadata Management: Practice metadata management using DataStage Designer, importing, and exporting components.

Creating Parallel Jobs: Engage in practical sessions to create parallel jobs, define parameters, and document your jobs effectively.

Accessing Sequential Data: Hands-on experience in handling sequential data, utilizing the Sequential File stage, and managing reject links.

Implementing Partitioning and Collecting Algorithms: Gain practical insights into partition parallelism, partitioning algorithms, and collecting strategies.

Combining Data with Stages: Work with Lookup, Join, Merge, and Funnel stages, and practice their applications in real-world scenarios.

Group Processing Stages: Learn to sort data effectively, remove duplicates, and utilize Aggregator stages in practical exercises.

Transforming Data: Practice using the Transformer stage, constraints, and debugging techniques for data transformation.

Repository Functions: Explore practical aspects of using the repository, finding differences between jobs, and performing impact analyses.

Working with Relational Data: Engage in hands-on activities involving connector stages, reading and writing to database tables, and utilizing data connection ob

Job Sequence Control: Gain practical experience in creating job sequences, defining triggers, and managing job activities through various stages.

Real Practice: AWS Cloud Integration: Apply your skills to integrate data with AWS Cloud services in real-world scenarios.

Real Practice: Data Vault 1.0 & 2.0 Integration: Practical exercises in integrating Data Vault concepts into your data integration projects.

Requirements

Basic Understanding of Data Concepts: A fundamental grasp of data concepts is recommended. Students should understand terms like data sources, data transformation, and data loading.

SQL Knowledge (Optional): While not mandatory, having some familiarity with SQL (Structured Query Language) can be beneficial, especially when working with relational databases.

Access to IBM DataStage: Ideally, students should have access to IBM DataStage software to practice and follow along with the course.

IBM DataStage Software (Optional): If students want to practice the skills learned in the course, having access to IBM DataStage software is beneficial

Desire to Learn: A genuine interest in data integration and a willingness to learn and practice the concepts taught in the course are essential.

Description

Unlock the power of data integration with IBM DataStage, the industry-leading ETL (Extract, Transform, Load) tool. In this comprehensive course, you'll embark on a journey from data integration basics to advanced techniques, empowering you to harness the full potential of your data.What You'll Learn:Foundations of Data Integration: Begin by understanding the core concepts and types of data integration, laying a strong foundation for your journey.IBM Information Server: Explore the IBM Information Server ecosystem and its vital components to comprehend where DataStage fits in.Hands-On Administration: Get hands-on with DataStage administration tasks, managing users, roles, and permissions with ease.Mastering Metadata: Learn to work effectively with metadata, a crucial aspect of data integration, to streamline your processes.Parallel Jobs Creation: Dive into parallel job creation, understand its intricacies, and design efficient parallel jobs.Accessing Sequential Data: Master the art of accessing sequential data, a crucial skill in data integration.Advanced Algorithms: Explore partitioning and collecting algorithms, vital for efficient data processing.Combine Data Effectively: Get comfortable with stages like Lookup, Join, Merge, and Funnel to combine data seamlessly.Group Processing Stages: Learn to group process data, sort it, and aggregate it effectively.Transformer Stage: Dive deep into the Transformer stage and its capabilities for data transformation.Repository Functions: Understand repository functions, impact analysis, and how to compare different jobs.Relational Data Integration: Work with relational data using connector stages, read from and write to database tables.Job Sequence Control: Master job sequencing, control the flow of jobs, and create complex workflows.Real-world Practice: Apply your knowledge in real-world scenarios with practical AWS Cloud and Data Vault integration sessions.

Overview

Section 1: Introduction to Data Integration

Lecture 1 Introduction

Lecture 2 Outline of the course

Lecture 3 Get the matterials

Section 2: Data Integration in Data management

Lecture 4 The agenda of the session

Lecture 5 Data Integration in Data management

Lecture 6 Some concepts and kinds of Data Integration

Lecture 7 What does data integration look like?

Section 3: Introduction to IBM Information Server

Lecture 8 The agenda of this session

Lecture 9 Introduction to IBM Information Server

Lecture 10 Introduction to IBM Information Server (cont.)

Lecture 11 Key IBM Information Server Components

Lecture 12 IBM Information Server topology

Lecture 13 IBM Information Server topology (Cont.)

Section 4: IBM Information Server Administration Console

Lecture 14 The agenda of this session

Lecture 15 IBM Information Server Administration Console

Lecture 16 IBM Information Server Administration Console (Cont.)

Lecture 17 Real Practice 1 – IBM Information Server Administration Console

Section 5: Introduction to IBM DataStage

Lecture 18 The agenda of this session

Lecture 19 DataStage Architecture

Lecture 20 DataStage Administrator

Lecture 21 DataStage Designer

Lecture 22 DataStage Director

Section 6: Developing in DataStage and Features

Lecture 23 The agenda of this session

Lecture 24 Developing in DataStage

Lecture 25 DataStage project repository

Lecture 26 Types of DataStage Jobs

Lecture 27 Design Elements of Parallel Jobs

Lecture 28 Partition Parallelism

Lecture 29 Multi-Node Partitioning

Lecture 30 Job design versus execution

Lecture 31 Configuration File

Lecture 32 Configuration File (Cont.)

Lecture 33 Summary - Developing in IBM DataStage and Features

Section 7: DataStage Administration

Lecture 34 The agenda of this session

Lecture 35 Unit Objectives - DataStage Administration

Lecture 36 Information Server Web Console - Administration

Lecture 37 Web Console Login Window

Lecture 38 User and Group Management

Lecture 39 Creating a DataStage User ID

Lecture 40 Assign DataStage Roles

Lecture 41 DataStage Credentials

Lecture 42 DataStage Credentials Default Mapping

Lecture 43 Logging onto DataStage Administrator

Lecture 44 InforSphere DataStage Administration

Lecture 45 DataStage Administrator Tabs

Lecture 46 Environment variables

Lecture 47 Environment reporting variables

Lecture 48 DataStage Administrator Permissions tab

Lecture 49 Add users and groups

Lecture 50 Specify DataStage Role

Lecture 51 DataStage Administrator Logs Tab

Lecture 52 DataStage Administrator Parallel Tab

Lecture 53 Real Practice 2 - Task 1 - Information Server Administration - User & Group

Lecture 54 Real Practice 2 - Task 2 - Project Properties - General

Lecture 55 Real Practice 2 - Task 3 - Project Properties - Environment

Lecture 56 Real Practice 2 - Task 4 - Project Properties - User & Group

Lecture 57 Summary - DataStage Administration

Section 8: Work with metadata

Lecture 58 The agenda of this session

Lecture 59 Unit Objectives - Work with metadata

Lecture 60 Logging onto DataStage Designer

Lecture 61 DataStage Designer

Lecture 62 DataStage project repository

Lecture 63 Import and Export

Lecture 64 Exporting DataStage Components

Lecture 65 Export objects

Lecture 66 Import Procedure

Lecture 67 Import objects

Lecture 68 Real Practice 3 - Export & Import DataStage Components

Lecture 69 Summary - Work with metadata

Section 9: Create Parallel Job

Lecture 70 The agenda of this session

Lecture 71 Unit objectives - Create parallel jobs

Lecture 72 What is a parallel job?

Lecture 73 Tools Palette

Lecture 74 Create a new parallel job

Lecture 75 Drag stages and links from the Palette

Lecture 76 Rename links and stages

Lecture 77 Connection Properties

Lecture 78 Job parameters

Lecture 79 Define a job parameter

Lecture 80 Use a job parameter in a stage

Lecture 81 Add job documentation

Lecture 82 Job Properties window documentation

Lecture 83 Compile and run a job

Lecture 84 Compile, Errors or Successful message

Lecture 85 DataStage Director

Lecture 86 Run options

Lecture 87 Performance statistics

Lecture 88 Director Status View

Lecture 89 Job log, viewed from designer

Lecture 90 Other job log functions

Lecture 91 Director monitor

Lecture 92 Run jobs from the command line

Lecture 93 Parameter sets

Lecture 94 Create a parameter set

Lecture 95 Defining the parameters

Lecture 96 Defining values files

Lecture 97 Load a parameter set into a job

Lecture 98 Use parameter set parameters

Lecture 99 Run jobs with parameter set parameters

Lecture 100 Real Practice 4 - Task 1 - Database –> Transformer –> Database

Lecture 101 Real Practice 4 - Task 2 - Database –> Transformer –> Database with Parameter

Lecture 102 Summary - Create parallel jobs

Section 10: Create parallel jobs - Access sequential data

Lecture 103 The agenda of this session

Lecture 104 Unit objectives - Access sequential data

Lecture 105 How sequential data is handled

Lecture 106 Features of the Sequential File stage

Lecture 107 Job design with Sequential File Stages

Lecture 108 Sequential File stage properties

Lecture 109 Format Tab

Lecture 110 Multiple Readers

Lecture 111 Writing to a sequential file

Lecture 112 Reject links

Lecture 113 Source and target reject links

Lecture 114 Setting the Reject Mode property

Lecture 115 Real Practice 5 - Task 1 - File –> Transformer –> File

Lecture 116 Real Practice 5 - Task 2 - File –> Transformer –> DataSet with Parameter

Lecture 117 Real Practice 6 - File –> Transformer –> DataSet

Lecture 118 Summary - Access sequential data

Section 11: Partitioning and collecting algorithms

Lecture 119 The agenda of this session

Lecture 120 Unit objectives - Partitioning and collecting algorithms

Lecture 121 Partition parallelism

Lecture 122 Stage partitioning

Lecture 123 DataStage hardware environments

Lecture 124 Partitioning algorithms

Lecture 125 Collecting

Lecture 126 Collecting (Cont.)

Lecture 127 Collecting algorithms

Lecture 128 Keyless versus keyed partitioning algorithms

Lecture 129 Round Robin and Random partitioning

Lecture 130 Entire partitioning

Lecture 131 Hash partitioning

Lecture 132 Modulus partitioning

Lecture 133 Auto partitioning

Lecture 134 Partitioning / collecting link icons

Lecture 135 More partitioning icons

Lecture 136 Specify a partitioning algorithm

Lecture 137 Specify a collecting algorithm

Lecture 138 Configuration file

Lecture 139 Example configuration file

Lecture 140 Adding $APT_CONFIG_FILE as a job parameter

Lecture 141 Editing configuration files

Lecture 142 Summary - Partitioning and collecting algorithms

Section 12: Combine Data

Lecture 143 The agenda of this session

Lecture 144 Unit objectives - Combine Data

Lecture 145 Combine Data

Lecture 146 Lookup, Join, Merge stages

Lecture 147 Lookup Stage features

Lecture 148 Lookup types

Lecture 149 Match Lookup Stage Example

Lecture 150 Lookup Stage with an equality match

Lecture 151 Define the Lookup key

Lecture 152 Specify the output columns

Lecture 153 Lookup failure actions

Lecture 154 Specifying lookup failure actions

Lecture 155 Lookup stage with reject link

Lecture 156 Real Practice 7 - Task 1 - Combine data with Lookup Stage

Lecture 157 Join stage

Lecture 158 Job with Join Stage

Lecture 159 Join Stage Properties

Lecture 160 Output Mapping tab

Lecture 161 Merge stage

Lecture 162 Merge Stage Job

Lecture 163 Merge Stage Properties

Lecture 164 Funnel stage

Lecture 165 Funnel Stage Example

Lecture 166 Funnel Stage Properties

Lecture 167 Real Practice 7 - Task 2 - Combine data with Join Stage

Lecture 168 Real Practice 7 - Task 3 - Combine Data with Merge Stage

Lecture 169 Real Practice 7 - Task 4 - Combine Data with Funnel Stage

Lecture 170 Summary - Combine Data

Section 13: Group processing stages

Lecture 171 The agenda of this session

Lecture 172 Unit objectives - Group processing stages

Lecture 173 Group processing stages

Lecture 174 Sort data

Lecture 175 Sorting alternatives

Lecture 176 In-Stage sorting

Lecture 177 Sort Stage Properties tab

Lecture 178 Specify sort keys

Lecture 179 Partition sorts

Lecture 180 Aggregator stage

Lecture 181 Job with Aggregator Stage

Lecture 182 Aggregation types

Lecture 183 Aggregation types (Cont.)

Lecture 184 Output Mapping Tab

Lecture 185 Output Columns tab

Lecture 186 Calculation aggregation type

Lecture 187 Grouping methods

Lecture 188 Remove duplicates

Lecture 189 Remove Duplicates stage job

Lecture 190 Remove Duplicates stage properties

Lecture 191 Fork - Join Job Design

Lecture 192 Real Practice 8 - File –> Transformer –> DataSet –> Aggregator –> File

Lecture 193 Summary - Group processing stages

Section 14: Transfromer Stage

Lecture 194 The agenda of this session

Lecture 195 Unit Objectives - Transformer stage

Lecture 196 Transformer stage

Lecture 197 Job with a Transformer stage

Lecture 198 Inside the Transformer stage

Lecture 199 Transformer stage elements

Lecture 200 Transformer stage elements

Lecture 201 Constraints

Lecture 202 Constraints example

Lecture 203 Define a constraint

Lecture 204 Use the expression editor

Lecture 205 Otherwise links for data integrity

Lecture 206 Real Practice 9 - File –> Transformer –> File

Lecture 207 Parallel job debugger

Lecture 208 Set breakpoints

Lecture 209 Edit breakpoints

Lecture 210 Running a parallel job in the debugger

Lecture 211 Add columns to the watch list

Lecture 212 Checkpoint - Transformer stage

Section 15: Repository functions

Lecture 213 The agenda of this session

Lecture 214 Unit objectives - Repository funcions

Lecture 215 Quick find

Lecture 216 Found results

Lecture 217 Advanced Find window

Lecture 218 Advanced Find options

Lecture 219 Using the found results

Lecture 220 Performing an impact analysis

Lecture 221 Initiating an impact analysis

Lecture 222 Results in text format

Lecture 223 Results in graphical format

Lecture 224 Displaying the dependency graph

Lecture 225 Displaying the dependency path

Lecture 226 Viewing column-level data flow

Lecture 227 Finding where a column originates

Lecture 228 Displayed results

Lecture 229 Finding the difference between two jobs

Lecture 230 Initiating the comparison

Lecture 231 Comparison results

Lecture 232 Saving to an HTML file

Lecture 233 Summary - Repository functions

Section 16: Work with relational data

Lecture 234 The agenda of this session

Lecture 235 Unit Objectives - Work with relational data

Lecture 236 Connector stages

Lecture 237 Reading from database tables

Lecture 238 Connector stage GUI

Lecture 239 Navigation panel

Lecture 240 Connection properties

Lecture 241 Usage properties - Session and Before/After SQL

Lecture 242 Writing to database tables

Lecture 243 DB2 Connector GUI

Lecture 244 Connector write properties

Lecture 245 Data connection objects

Lecture 246 Data connection objects

Lecture 247 Creating a new data connection object

Lecture 248 Real Practice 11 - Relational Database –> Transformer –> File

Lecture 249 Summary - Work with relational data

Section 17: Job Sequence Control

Lecture 250 The agenda of this session

Lecture 251 Unit Objectives - Job Sequence control

Lecture 252 What is a job sequence?

Lecture 253 Basics for creating a job sequence

Lecture 254 Job sequence stages

Lecture 255 Job sequence example

Lecture 256 Job Activity stage properties

Lecture 257 Job Activity trigger

Lecture 258 Execute Command stage

Lecture 259 Notification Activity stage

Lecture 260 User Variables stage

Lecture 261 Wait for File stage

Lecture 262 Sequencer stage

Lecture 263 Nested Condition stage

Lecture 264 Loop stages

Lecture 265 Summary - Job control

Section 18: AWS Cloud Integration

Lecture 266 The agenda of this session

Lecture 267 Real Practice 12 - AWS Cloud Data Integration with IBM DataStage S3 Connector

Section 19: Data Vault 1.0 & 2.0 Integration

Lecture 268 Data Vault Introduction

Lecture 269 Data Vault Integration with IBM DataStage

Lecture 270 Real Practice 13 - Mini Project 1 - Data Vault 1.0 DWH with IBM DataStage

Lecture 271 Real Practice 13 - Mini Project 2 - Data Vault 2.0 DWH with IBM DataStage

Section 20: Summary

Lecture 272 What we have learned

Data Professionals: Data analysts, data engineers who want to enhance their data integration skills using IBM DataStage.,IT Professionals: IT specialists, software developers, and database administrators who need to work with data integration solutions.,Business Analysts: Business analysts who want to understand how data integration impacts their data-driven decision-making processes.,Students and Graduates: Students pursuing degrees or recent graduates looking to build a foundation in data integration and expand their job prospects.,IBM DataStage Users: Users of IBM DataStage looking to deepen their knowledge, explore advanced features, and improve their job performance.,Anyone Interested in Data Integration: If you have a general interest in data integration and want to learn how IBM DataStage can be used for these purposes, this course is suitable for you.