Spark recently fleshed out the ML Pipeline stuff so I have been looking into writing my own transformers. However some useful utilities are private to spark or ml. Take for example the Identifiable trait / object which are private to spark. I would very much like to use the randomUID method and am curious as to why this is not exposed?
Private objects and traits in spark and ml
654 Views Asked by Chris At
1
There are 1 best solutions below
Related Questions in SCALA
- Spark .mapValues setup with multiple values
- Where do 'normal' println go in a scala jar, under Spark
- Serializing to disk and deserializing Scala objects using Pickling
- Where has "Show Type Info on Mouse Motion" gone in Intellij 14
- AbstractMethodError when mixing in trait nested in object - only when compiled and imported
- Scala POJO Aggregator Exception
- How to read in numbers from n lines into a Scala list?
- Spark pairRDD not working
- Scala Eclipse IDE compiler giving errors until "clean" is run
- How to port Slick 2.1 plain SQL queries to Slick 3.0
- Log of dependency does not show
- Getting unary error for escaped characters in Scala
- Akka actor invoked with a function delegate - is this bad practice?
- Json implicit format with recursive class definition
- How to create a executable jar reading files from local file system
Related Questions in APACHE-SPARK
- Spark .mapValues setup with multiple values
- Where do 'normal' println go in a scala jar, under Spark
- How to query JSON data according to JSON array's size with Spark SQL?
- How do I set the Hive user to something different than the Spark user from within a Spark program?
- How to add a new event to Apache Spark Event Log
- Spark streaming + kafka throughput
- dataframe or sqlctx (sqlcontext) generated "Trying to call a package" error
- Spark pairRDD not working
- How to know which worker a partition is executed at?
- Using HDFS with Apache Spark on Amazon EC2
- How to create a executable jar reading files from local file system
- How to keep a SQLContext instance alive in a spark streaming application's life cycle?
- Cassandra spark connector data loss
- Proper way to provide spark application a parameter/arg with spaces in spark-submit
- sorting RDD elements
Related Questions in APACHE-SPARK-MLLIB
- Spark MLLib How to ignore features when training a classifier
- SparkMLlib MultiClassMetrics.confusionMatrix() and precision() seems giving contradictory results
- What is rank in ALS machine Learning Algorithm in Apache Spark Mllib
- How to run Spark locally on Windows using eclipse in java
- Debugging large task sizes in Spark MLlib
- spark-mllib: Error "reassignment to val" in source code
- Spark saving RDD[(Int, Array[Double])] to text file got strange result
- How to integrate Apache Spark with Spring MVC web application for interactive user sessions
- How to train Matrix Factorization Model in Apache Spark MLlib's ALS Using Training, Test and Validation datasets
- TypeError: Incorrect padding while running Kmeans on Spark Mllib (spark 1.4.0)
- From DataFrame to RDD[LabeledPoint]
- Private objects and traits in spark and ml
- SPARK ERROR:executor.CoarseGrainedExecutorBackend: Driver while executing KMeans Clustering onspark on EC2 cluster
- Spark's LinearRegressionWithSGD is very sensitive to feature scaling
- How to create correct data frame for classification in Spark ML
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
The short version of the answer is that Spark is aiming for API stability, and anything where people think they might want to change how it functions is therefor marked as private. Part of this happens since as part of the PR merge process, if you have to be very explicit to make a new public API, so it's often easier to just make private versions of the things you need. I realize that this can maybe be a bit frustrating, if there is a specific part of Spark you think should be added to the public API you can try filing a JIRA.