I wanted to install some python packages (eg: python-json-logger) on Serverless Dataproc. Is there a way to do an initialization action to install python packages in serverless dataproc? Please let me know.
Installing python packages in Serverless Dataproc GCP
2.9k Views Asked by Ish14 At
1
There are 1 best solutions below
Related Questions in PYTHON
- new thread blocks main thread
- Extracting viewCount & SubscriberCount from YouTube API V3 for a given channel, where channelID does not equal userID
- Display images on Django Template Site
- Difference between list() and dict() with generators
- How can I serialize a numpy array while preserving matrix dimensions?
- Protractor did not run properly when using browser.wait, msg: "Wait timed out after XXXms"
- Why is my program adding int as string (4+7 = 47)?
- store numpy array in mysql
- how to omit the less frequent words from a dictionary in python?
- Update a text file with ( new words+ \n ) after the words is appended into a list
- python how to write list of lists to file
- Removing URL features from tokens in NLTK
- Optimizing for Social Leaderboards
- Python : Get size of string in bytes
- What is the code of the sorted function?
Related Questions in GOOGLE-CLOUD-PLATFORM
- Google Logging API - What service name to use when writing entries from non-Google application?
- Custom exception message from google endpoints exception
- Unable to connect database of lamp instance from servlet running on tomcat instance of google cloud
- How to launch a Jar file using Spark on hadoop
- Google Cloud Bigtable Durability/Availability Guarantees
- How do I add a startup script to an existing VM from the developer console?
- What is the difference between an Instance and an Instance group
- How do i change files using ftp in google cloud?
- How to update all machines in an instance group on Google Cloud Platform?
- Setting up freeswitch server on Google cloud compute
- Google Cloud Endpoints: verifyToken: Signature length not correct
- Google Cloud BigTable connection setup time
- How GCE HTTP Cross-Region Load Balancing implemented
- Google Cloud Bigtable compression
- Google cloud SDK code to execute via cron
Related Questions in DATAPROC
- Disk utilization of Dataproc Worker Node is getting increased day by day
- Error connecting to jdbc with pyspark in dataproc
- CPU core allocation in a DataProc cluster in GCP
- Not able to create a dataproc cluster with image version 2.0 404 HTTP response code 22 exit code see output in
- Invalid Argument When Creating Dataproc Cluster on GKE
- Create an email alert for a PySpark job executing on Google Dataproc
- Error installing package from private repository on Dataproc cluster
- Accessing Dataproc Cluster through Apache Livy?
- configuring dataproc with an external hive metastore
- ValueError: unknown enum label "Hudi"
- Using Spark Bigquery Connector on Dataproc and data appears to be delayed by an hour
- Is it possible that i set fully customized metric for auto scale-out with dataproc worker node in GCP (Google Cloud Platform)
- Trigger spark submit jobs from airflow on Dataproc Cluster without SSH
- Installing python packages in Serverless Dataproc GCP
- Dataproc: Can user create workers of different instance types?
Related Questions in GOOGLE-CLOUD-DATAPROC-SERVERLESS
- Google Cloud Dataproc Serverless gcloud ttl flag unrecognized argument
- Installing python packages in Serverless Dataproc GCP
- Googld cloud dataproc serverless (batch) pyspark reads parquet file from google cloud storage (GCS) very slow
- how to pass custom job id via google dataproc cluster job for spark using dataproc client
- Serverless Dataproc Error- Batch ID is required
- Dataproc Serverless - how to set javax.net.ssl.trustStore property to fix java.security.cert.CertPathValidatorException
- Dataproc on GKE via Terraform not working (example provided by Terraform doc)
- Reducing Dataproc Serverless CPU quota
- Dataproc: How to implement auto scale based on presto load
- does google provide techincal support for dataproc's optional components ex. Ranger?
- Use Google Cloud Workflows to trigger Dataproc Batch job
- Serverless spark job throwing an error while using shared VPC to connect on-prem storage
- How to enable component gateway, jupyter notebook in gcp dataproc cluster, once the cluster is created
- compute.requireOsLogin violated in dataproc serverless
- Programmatically cancelling a pyspark dataproc batch job
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
You have two options:
You can create a custom image with dependencies(python packages) in the GCR(Google Container Registry GCP) and add uri as parameter in the command below:
e.g.
To create custom container image for Dataproc Serveless for Spark.
Add to python-file the script below, it will install the desired package and then load this package into the container path (dataproc servless), this file must be saved in a bucket, this uses the secret manager package as an example.
python-file.py
finally the perator calls the python-file.py