My app is a bash script that runs tesseract on GNU parallel. The data I need to process is to the tune of 50GB. It's too slow if I do it one VM. I need the power of cluster computing but I don't want to set up multiples VMs myself, instead I just want to launch my APP (along with the data files) on Google cluster (Kubernetes?). I don't have much clarity about these concepts. If someone can guide that would be great.
Cluster, Kubernetes, Docker. What to choose for my app?
102 Views Asked by Naveed At
1
There are 1 best solutions below
Related Questions in DOCKER
- Docker, redirecting to virtualbox port
- Collect only from STDERR when using Docker syslog logging driver
- How can I create a docker image from the current system?
- Moving Docker Containers Around
- How can I test with serverspec that Jenkins is running in a jenkins docker container?
- How to deploy django 1.8 on Elastic Beanstalk using Docker
- Emulating `docker run` using the golang docker API
- Where are docker images and containers stored when we use it with Windows?
- docker compose, vagrant and insecure Repository
- Commit data in a mysql container
- oh-my-zsh installation returns non zero code
- Use custom docker binary in CoreOS
- Can I use docker image ubuntu 14.04 if my host is 12.04?
- Hide/obfuscate environmental parameters in docker
- How to add initial users when starting a RabbitMQ Docker container?
Related Questions in KUBERNETES
- How to know a Pod's own IP address from inside a container in the Pod?
- Who will decide the "specified number of pods" for replication controller in kubernetes?
- Access other containers of a pod in Kubernetes
- Kubernetes cluster using Vagrant not working after restart
- kubectl not installed with gcloud SDK
- How do I access the Kubernetes api from within a pod container?
- Exposing several services with Vagrant and Kubernetes on my own server
- Does Kubernetes provision new VMs for pods on my cloud platform?
- Any suggestion for running Aerospike on Kubernetes on CoreOS on GCE?
- Kubernetes - kubectl exec bash - session drop and line width
- Google Container Engine (GKE): "Hello Wordpress" tutorial not working (ERR_CONNECTION_REFUSED)
- Kubernetes Pod Creation Speed
- How can i set max count of pods for replication-controller per node?
- Is there a way to tell kubernetes to update your containers?
- Postgres with Kubernetes and persistentDisk
Related Questions in CLUSTER-COMPUTING
- What clustering algorithms can I consider for graph?
- SQL Server 2012 or 2014 Failover Cluster - Change Instance Port Number
- Scatter 2D coordinates from distance matrix
- Forwarding signals in bash script which is submitted on the cluster
- How to deploy three config server instances for sharding in mongodb?
- do I need to restart ALL the hadoop daemons whenever I make changes to xml configuration files
- pbs python Error: No module named _pbs
- Predis: Pros and Cons of the two cluster strategies
- Hadoop NameNode does not start
- does redis cluster support transactions ?
- Best Practice for synchronized jobs in Application clusters
- Quartz scheduler with percona db cluster giving deadlock errors
- Running Batch Job on Slurm Cluster
- jdbc backed channel on a cluster mode
- How to run Cron Job in Node.js application that uses cluster module?
Related Questions in CONTAINER-IMAGE
- Kubernetes: How to apply Horizontal Pod (HPA) autoscaling for a RC which contains multiple containers?
- How do we map a container image layer to its source Dockerfile command?
- Podman push image fails with tls certification verification error
- Java can't find file while running in Docker container
- Why is a parent container image SHA not listed in the layers of the child?
- Docker Registry: How do I get the list of digest of an image layers?
- What is the difference between UBI and Atomic Base Image
- Container image - What is the purpose of using the phrase "Container image"?
- unable to deploy local container image to k8s cluster
- How to copy artifacts between projects in Google Artifact Registry
- Cluster, Kubernetes, Docker. What to choose for my app?
- Weblogic fmw_infrastructure 12c Docker Container- Domain Creation Failed.. Please check the Domain Logs
- Fetch single layer from image repository using docker or other tooling?
- How to refer local docker images loaded from tar file in Kubernetes deployment?
- How can I define an owner to an empty_dir using container_image or container_layer from bazel rules_docker?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Might be a challenge to learn all the container orchestration details from scratch when you are just concerned about this one use case.
While GNU Parrellel is nice on a single machine, there don't seem to be many starter kits for using it in distributed mode in the cloud.
I would consider google dataflow rather than spining up a K8S cluster. It allocates and cleans up easily and lets you avoid managing VMs and learning an orchestration framework.