Is it possible to deploy several Hadoop clusters in one Google Cloud project?
Multiple Hadoop clusters in one Google Cloud project
109 Views Asked by Evgeny Timoshenko At
1
There are 1 best solutions below
Related Questions in GOOGLE-CLOUD-PLATFORM
- Google Logging API - What service name to use when writing entries from non-Google application?
- Custom exception message from google endpoints exception
- Unable to connect database of lamp instance from servlet running on tomcat instance of google cloud
- How to launch a Jar file using Spark on hadoop
- Google Cloud Bigtable Durability/Availability Guarantees
- How do I add a startup script to an existing VM from the developer console?
- What is the difference between an Instance and an Instance group
- How do i change files using ftp in google cloud?
- How to update all machines in an instance group on Google Cloud Platform?
- Setting up freeswitch server on Google cloud compute
- Google Cloud Endpoints: verifyToken: Signature length not correct
- Google Cloud BigTable connection setup time
- How GCE HTTP Cross-Region Load Balancing implemented
- Google Cloud Bigtable compression
- Google cloud SDK code to execute via cron
Related Questions in GOOGLE-HADOOP
- Map tasks with input from Cloud Storage use only one worker
- Spark/Hadoop/Yarn cluster communication requires external ip?
- Read from BigQuery into Spark in efficient way?
- Hadoop on Google Compute Engine
- Spark job seems not to parallelize well
- Google cloud click to deploy hadoop
- What is the minimal setup needed to write to HDFS/GS on Google Cloud Storage with flume?
- What causes flume with GCS sink to throw a OutOfMemoryException
- Failed to copy Hadoop and Java packages to Google Cloud Storage
- Spark - "too many open files" in shuffle
- Hive external table location in google cloud storage is ignoring subdirectories
- Rate limit with Apache Spark GCS connector
- bdutil: How to launch a Hadoop cluster with a requested image id? (Ubuntu 12.04)
- Accessing Google Storage with SparkR on bdutil deployed cluster
- Map Only MapReduce Job with BigQuery
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Using bdutil you can deploy arbitrarily many different Hadoop clusters in a single Google project, as long as you've obtained sufficient Google Compute Engine quota to do so. The instructions here describe the usage of bdutil, but in short, cluster names in bdutil are simply distinguished by the
PREFIXvariable or--prefixflag when using bdutil. It's up to you to keep track of the zone and numbers of workers in each bdutil cluster.For easily keeping track of multiple clusters, it's highly recommended to use bdutil's
generate_configcommand. For example, suppose you want 3 clusters:test,stagingandprod. And perhaps they're different sizes and in different zones. You'll want to run something like:Once you've done that, the files
test-cluster_env.sh,staging-cluster_env.shandprod-cluster_env.shcan be used to refer to your three different clusters from now on. For example, suppose you want to delete your test cluster:Or just deploy your prod cluster:
Or to SSH into the master of your staging cluster:
When you do it this way, you can store your *_cluster_env.sh files in source control, and they'll be backwards compatible whenever you upgrade bdutil with new Google releases.
If you need to customize bdutil more extensively, you may want to consider obtaining bdutil from GitHub directly using:
So that you can use git to update to fresh versions of bdutil periodically while letting git resolve any merge conflicts with any customizations.