Data durability Google Professional Data Engineer GCP

  1. Home
  2. Data durability Google Professional Data Engineer GCP
  • Data is stored on Colossus, Google’s internal, highly durable file system
  • HDFS cluster is not needed
  • If using replication, one copy of data is in Colossus for each cluster in the instance.
  • Each copy is located in a different zone or region
  • Google uses proprietary storage methods to achieve data durability
Menu