Become a successful data engineer by building and deploying your own data pipelines on Google Cloud, including making key architectural decisionsKey Features: - Get up to speed with data governance on Google Cloud- Learn how to use various Google Cloud products like Dataform, DLP, Dataplex, Dataproc Serverless, and Datastream- Boost your confidence by getting Google Cloud data engineering certification guidance from real exam experiences- Purchase of the print or Kindle book includes a free PDF eBookBook Description: The second edition of Data Engineering with Google Cloud builds upon the success of the first edition by offering enhanced clarity and depth to data professionals navigating the intricate landscape of data engineering.Beyond its foundational lessons, this new edition delves into the essential realm of data governance within Google Cloud, providing you with invaluable insights into managing and optimizing data resources effectively.
Author(s): Adi Wijaya
476 Pages
Computers + Internet, Data Modeling & Design
Description
About the Book
This book will help you delve into data governance on Google Cloud. Moreover, you'll also cover the latest technological advancements in the domain and be able to build and deploy data pipelines confidently.
Book Synopsis
Become a successful data engineer by building and deploying your own data pipelines on Google Cloud, including making key architectural decisions
Key Features:
- Get up to speed with data governance on Google Cloud
- Learn how to use various Google Cloud products like Dataform, DLP, Dataplex, Dataproc Serverless, and Datastream
- Boost your confidence by getting Google Cloud data engineering certification guidance from real exam experiences
- Purchase of the print or Kindle book includes a free PDF eBook
Book Description:
The second edition of Data Engineering with Google Cloud builds upon the success of the first edition by offering enhanced clarity and depth to data professionals navigating the intricate landscape of data engineering.
Beyond its foundational lessons, this new edition delves into the essential realm of data governance within Google Cloud, providing you with invaluable insights into managing and optimizing data resources effectively. Written by a Data Strategic Cloud Engineer at Google, this book helps you stay ahead of the curve by guiding you through the latest technological advancements in the Google Cloud ecosystem. You'll cover essential aspects, from exploring Cloud Composer 2 to the evolution of Airflow 2.5. Additionally, you'll explore how to work with cutting-edge tools like Dataform, DLP, Dataplex, Dataproc Serverless, and Datastream to perform data governance on datasets.
By the end of this book, you'll be equipped to navigate the ever-evolving world of data engineering on Google Cloud, from foundational principles to cutting-edge practices.
What You Will Learn:
- Load data into BigQuery and materialize its output
- Focus on data pipeline orchestration using Cloud Composer
- Formulate Airflow jobs to orchestrate and automate a data warehouse
- Establish a Hadoop data lake, generate ephemeral clusters, and execute jobs on the Dataproc cluster
- Harness Pub/Sub for messaging and ingestion for event-driven systems
- Apply Dataflow to conduct ETL on streaming data
- Implement data governance services on Google Cloud
Who this book is for:
Data analysts, IT practitioners, software engineers, or any data enthusiasts looking to have a successful data engineering career will find this book invaluable. Additionally, experienced data professionals who want to start using Google Cloud to build data platforms will get clear insights on how to navigate the path. Whether you're a beginner who wants to explore the fundamentals or a seasoned professional seeking to learn the latest data engineering concepts, this book is for you.
Table of Contents
- Fundamentals of Data engineering with GCP
- Big Data Capabilities on GCP
- Building a data warehouse in BigQuery
- Build Orchestration for Batch Data Loading Using Cloud Composer
- Building a Data Lake using Dataproc
- Process Streaming Data with Datastream, Pub/Sub and Dataflow
- Visualizing Data for Making Data-Driven Decisions with Looker Studio
- Build machine learning solutions on GCP
- User and Project Management on GCP
- Data Governance in GCP
- Cost Strategy in GCP
- CI/CD on Google Cloud Platform for Data Engineers
- Boost your confidence as a Data Engineer
Dimensions (Overall): 9.25 Inches (H) x 7.5 Inches (W) x .96 Inches (D)
Weight: 1.79 Pounds
Suggested Age: 22 Years and Up
Number of Pages: 476
Genre: Computers + Internet
Sub-Genre: Data Modeling & Design
Publisher: Packt Publishing
Format: Paperback
Author: Adi Wijaya
Language: English
Street Date: April 30, 2024
TCIN: 94346505
UPC: 9781835080115
Item Number (DPCI): 247-23-9179
Origin: Made in the USA or Imported
If the item details aren’t accurate or complete, we want to know about it.
Shipping details
Estimated ship dimensions: 0.96 inches length x 7.5 inches width x 9.25 inches height
Estimated ship weight: 1.79 pounds
We regret that this item cannot be shipped to PO Boxes.
This item cannot be shipped to the following locations: American Samoa (see also separate entry under AS), Guam (see also separate entry under GU), Northern Mariana Islands, Puerto Rico (see also separate entry under PR), United States Minor Outlying Islands, Virgin Islands, U.S., APO/FPO
Return details
This item can be returned to any Target store or Target.com.
This item must be returned within 90 days of the date it was purchased in store, shipped, delivered by a Shipt shopper, or made ready for pickup.