Google Dataproc

Click here for the official Google Dataproc Docs

Contents

  1. Creating a Dataproc Cluster
  2. Deleting a Dataproc Cluster

Creating a Dataproc Cluster

Click here for the official Google Docs on creating a Dataproc cluster

  1. Navigate to Dataproc dataprocnavigatetopre_20191022

  2. Click on "Create Cluster" dataproccreateclickpre_20191022

  3. Complete the form to create the bucket dataproccreatecluster_20191022

    • Give your cluster an appropriate name
    • Select the Region → always select "europe-west1"
    • Adjust your Cluster mode if required
    • Configure your master node
      • Select the number of CPUs
      • Adjust the amount of RAM to suit your requirements
      • Set your Primary disk size and type
    • Configure your worker nodes
      • Select the number of CPUs
      • Adjust the amount of RAM to suit your requirements
      • Set your Primary disk size and type
      • Select the number of nodes you require
    • Check "Component gateway" to you to connect to the cluster components:
    • Click Advanced options
    • Select or create a Cloud Storage staging bucket. This will allow you to store your notebooks outside of the cluster so they can be safe from deletion
  4. Click create

Deleting a Dataproc Cluster

Click here for the official Google Docs on creating a Dataproc cluster

  1. Navigate to Dataproc dataprocnavigatetopre_20191022
  2. Select the cluster you want to delete
  3. Click on "Delete Cluster" dataprocdeletecluster_20191022