Google Dataproc
Click here for the official Google Dataproc Docs
Contents
Creating a Dataproc Cluster
Click here for the official Google Docs on creating a Dataproc cluster
-
Navigate to Dataproc
-
Click on "Create Cluster"
-
Complete the form to create the bucket
- Give your cluster an appropriate name
- Select the Region → always select "europe-west1"
- Adjust your Cluster mode if required
- Configure your master node
- Select the number of CPUs
- Adjust the amount of RAM to suit your requirements
- Set your Primary disk size and type
- Configure your worker nodes
- Select the number of CPUs
- Adjust the amount of RAM to suit your requirements
- Set your Primary disk size and type
- Select the number of nodes you require
- Check "Component gateway" to you to connect to the cluster components:
- Click Advanced options
- Select or create a Cloud Storage staging bucket. This will allow you to store your notebooks outside of the cluster so they can be safe from deletion
- Click create
Deleting a Dataproc Cluster
Click here for the official Google Docs on creating a Dataproc cluster
- Navigate to Dataproc
- Select the cluster you want to delete
- Click on "Delete Cluster"