I use K-Means algorithm to create clusters. As you know K-means algorithm needs cluster count as parameter. I try cluster counts as starting two from eight and calculate all C-Index of clusters in every looping then get the avegare of these C-Indexes. Then compare C-Index avegares and choose the minimum C-Index average as best quality cluster count. Is that true way for detecting cluster count?
Can we compare clusters with C-Index average?
874 Views Asked by cagin At
1
There are 1 best solutions below
Related Questions in DATA-MINING
- Does MATLAB support the parallelization of supervised machine learning algorithms? Alternatives?
- Why Confidence does not consider B in Association rule mining
- Clustering based on pearson correlation
- Extract relevant attributes from postal addresses data in order to do PCA on those Data (using R)
- ELKI DBSCAN for million files
- Cluster adjacent points
- Using Python to find correlation pairs
- Nominal valued dataset in machine learning
- Repeated ordered sequence search algorithm
- rankall : returning the correct data frame to rank hospitals on performance
- Which data mining algorithm should I use to find optimum performance (in this case)
- Data Mining issue with the apriori algorithm in C#
- What could be the cause for the slow speed of xgboost?
- K-medoids: k = total dataset
- How to assign more weight to bigram and trigram?
Related Questions in CLUSTERED-INDEX
- Optimize the execution of select
- Does INDEX() create a clustered or non-clustered index in MySQL?
- SQL Performance, Execution plan showing mostly Index Scan instead of Index Seek
- Sort order of an SQL Server 2008+ clustered index
- SqlServer clustered index with sequence and multiple Java servers inserting with pooled algorithm - performance
- Is it possible to have a table without any clustered index on DB2 version 9.7?
- R: clusters in histogram
- view contains a convert that is imprecise or non-deterministic?
- Postgres ignoring clustered index on date query
- Cassandra pattern for modeling columns with mutable keys
- Why SELECT COUNT(*) performs clustered index scan?
- partitioned index on a non partitioned table
- Difference between clustered and nonclustered index
- Performance of Non Clustered Indexes on Heaps vs Clustered Indexes
- Does adding a primary key cause restructuring of underlying data
Related Questions in KNIME
- Transforming less frequent values
- Create sequential IDs for rows
- Knime too slow - performance
- AttributeSelectedClassifier - How to deal with error "A nominal attribute (likes) cannot have duplicate labels ('(0.045455-0.045455]')"
- Fill pandas dataframe within a for loop
- Bioisosteric replacement using SMARTS (KNIME and RDKit)
- Can we compare clusters with C-Index average?
- Class Prediction - Data analytics
- Extract Student number
- Data preprocessing to both train and test sets?
- the dialog cannot be opened for the following reason: No column in spec compatible to 'DoubleValue'
- Python pandas: Assign values in column based on predecessor values
- Remove part of a string in each row of a large column of data in KNIME
- Knime, relevant nodes for regression analysis ( statistical models)
- Copy and rename files on shared drive
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
There is no one correct way to detect cluster count. See following google search, this is still an active research area. Wikipedia articles says that:
Only you can determine if using c-index in this way is a good way to determine cluster numbers in your domain. See another question of using c-index in clustering.