Data Management Google Professional Data Engineer GCP

  1. Home
  2. Data Management Google Professional Data Engineer GCP
  • Bigtable supports a number of methods for interacting with data, as
    • HTTP API
    • the gRPC API
    • the cbt command-line tool
    • client libraries
    • an HBase client (Google HBase Java client library or the HBase shell)

Dataflow templates export data from Bigtable as

  • Avro files
  • Parquet files
  • SequenceFiles

 

Migrating the data from an Apache HBase cluster to a Cloud Bigtable cluster –

  • Export the data as a series of Hadoop sequence files.
  • Collect details from HBase.
  • Export HBase tables to sequence files.
  • Move the sequence files to Cloud Storage.
  • Import the sequence files into Bigtable using Dataflow.
  • Validate the move.
Menu