Migration to Google Cloud Google Professional Data Engineer GCP
two migration models to transferring HDFS data
push
- simplest model
- the source cluster runs the distcp jobs on its data nodes and pushes files directly to Cloud Storage.
Pull
- is complex
- An ephemeral Dataproc cluster runs the distcp jobs on its data nodes,
- pulls files from the source cluster, and copies them to Cloud Storage.
Google Professional Data Engineer (GCP) Free Practice TestTake a Quiz