Tags
Language
Tags
June 2025
Su Mo Tu We Th Fr Sa
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 1 2 3 4 5
    Attention❗ To save your time, in order to download anything on this site, you must be registered 👉 HERE. If you do not have a registration yet, it is better to do it right away. ✌

    ( • )( • ) ( ͡⚆ ͜ʖ ͡⚆ ) (‿ˠ‿)
    SpicyMags.xyz

    Google Cloud Certified Professional Data Engineer 2023

    Posted By: lucky_aut
    Google Cloud Certified Professional Data Engineer 2023

    Google Cloud Certified Professional Data Engineer 2023
    Last updated 7/2023
    Duration: 22h37m | .MP4 1280x720, 30 fps(r) | AAC, 44100 Hz, 2ch | 13.4 GB
    Genre: eLearning | Language: English

    Theory, Hand-ons and 252 Questions, Answers with Explanations. All Hands-Ons in 1-Click Copy-Paste Style. PDF Downloads

    What you'll learn
    Designing data processing systems
    Building and operationalizing data processing systems
    Operationalizing machine learning models
    Ensuring solution quality
    Designing data pipelines
    Designing a data processing solution
    Migrating data warehousing and data processing
    Building and operationalizing storage systems
    Building and operationalizing pipelines
    Building and operationalizing processing infrastructure
    Leveraging pre-built ML models as a service
    Deploying an ML pipeline
    Measuring, monitoring, and troubleshooting machine learning models
    Designing for security and compliance
    Ensuring scalability and efficiency
    Ensuring reliability and fidelity
    Ensuring flexibility and portability


    Requirements
    Everything that you need in order to pass Google Cloud Certified Professional Data Engineer will be covered in this course
    Description
    Designing data processing systems
    Selecting the appropriate storage technologies. Considerations include:
    ● Mapping storage systems to business requirements
    ● Data modeling
    ● Trade-offs involving latency, throughput, transactions
    ● Distributed systems
    ● Schema design
    Designing data pipelines. Considerations include:
    ● Data publishing and visualization (e.g., BigQuery)
    ● Batch and streaming data (e.g., Dataflow, Dataproc, Apache Beam, Apache Spark and Hadoop ecosystem, Pub/Sub, Apache Kafka)
    ● Online (interactive) vs. batch predictions
    ● Job automation and orchestration (e.g., Cloud Composer)
    Designing a data processing solution. Considerations include:
    ● Choice of infrastructure
    ● System availability and fault tolerance
    ● Use of distributed systems
    ● Capacity planning
    ● Hybrid cloud and edge computing
    ● Architecture options (e.g., message brokers, message queues, middleware, service-oriented architecture, serverless functions)
    ● At least once, in-order, and exactly once, etc., event processing
    Migrating data warehousing and data processing. Considerations include:
    ● Awareness of current state and how to migrate a design to a future state
    ● Migrating from on-premises to cloud (Data Transfer Service, Transfer Appliance, Cloud Networking)
    ● Validating a migration
    Building and operationalizing data processing systems
    Building and operationalizing storage systems. Considerations include:
    ● Effective use of managed services (Cloud Bigtable, Cloud Spanner, Cloud SQL, BigQuery, Cloud Storage, Datastore, Memorystore)
    ● Storage costs and performance
    ● Life cycle management of data
    Building and operationalizing pipelines. Considerations include:
    ● Data cleansing
    ● Batch and streaming
    ● Transformation
    ● Data acquisition and import
    ● Integrating with new data sources
    Building and operationalizing processing infrastructure. Considerations include:
    ● Provisioning resources
    ● Monitoring pipelines
    ● Adjusting pipelines
    ● Testing and quality control
    Operationalizing machine learning models
    Leveraging pre-built ML models as a service. Considerations include:
    ● ML APIs (e.g., Vision API, Speech API)
    ● Customizing ML APIs (e.g., AutoML Vision, Auto ML text)
    ● Conversational experiences (e.g., Dialogflow)
    Deploying an ML pipeline. Considerations include:
    ● Ingesting appropriate data
    ● Retraining of machine learning models (AI Platform Prediction and Training, BigQuery ML, Kubeflow, Spark ML)
    ● Continuous evaluation
    Choosing the appropriate training and serving infrastructure. Considerations include:
    ● Distributed vs. single machine
    ● Use of edge compute
    ● Hardware accelerators (e.g., GPU, TPU)
    Measuring, monitoring, and troubleshooting machine learning models. Considerations include:
    ● Machine learning terminology (e.g., features, labels, models, regression, classification, recommendation, supervised and unsupervised learning, evaluation metrics)
    ● Impact of dependencies of machine learning models
    ● Common sources of error (e.g., assumptions about data)
    Ensuring solution quality
    Designing for security and compliance. Considerations include:
    ● Identity and access management (e.g., Cloud IAM)
    ● Data security (encryption, key management)
    ● Ensuring privacy (e.g., Data Loss Prevention API)
    ● Legal compliance (e.g., Health Insurance Portability and Accountability Act (HIPAA), Children's Online Privacy Protection Act (COPPA), FedRAMP, General Data Protection Regulation (GDPR))
    Ensuring scalability and efficiency. Considerations include:
    ● Building and running test suites
    ● Pipeline monitoring (e.g., Cloud Monitoring)
    ● Assessing, troubleshooting, and improving data representations and data processing infrastructure
    ● Resizing and autoscaling resources
    Ensuring reliability and fidelity. Considerations include:
    ● Performing data preparation and quality control (e.g., Dataprep)
    ● Verification and monitoring
    ● Planning, executing, and stress testing data recovery (fault tolerance, rerunning failed jobs, performing retrospective re-analysis)
    ● Choosing between ACID, idempotent, eventually consistent requirements
    Ensuring flexibility and portability. Considerations include:
    ● Mapping to current and future business requirements
    ● Designing for data and application portability (e.g., multicloud, data residency requirements)
    ● Data staging, cataloging, and discovery
    Who this course is for:
    Beginner
    Intermediate
    Advanced


    More Info