I'm quite new to machine learning and just got introduced to principle component analysis as a dimensionality reduction method. What I don't understand, in which circumstances is PCA any better than simply removing some features from the model? If the aim is to obtain lower dimensional data, why don't we just group those features that are correlated and retain one single feature from each group?
Principal component analysis vs feature removal
1.4k Views Asked by Botond At
1
There are 1 best solutions below
Related Questions in MACHINE-LEARNING
- sqlsrv_query doesn't return false on faulty T-SQL query
- PHP error connecting to MS SQL 2005 using CodeIgniter
- SQL Insert Date Mystery on 2012 from 2005
- SQL Server 2000 query conversion
- How to compare two date in SQL Server 2005?
- Different SELECT's for an INSERT INTO
- Selecting from a column with Ampersand(&)
- Get Parent and grand parents of a particular child
- UPDATE after INSERT for potentially multiple rows - not working
- Is there a way to re-write this SQL query using a WITH clause or any other CTE that might be even better?
Related Questions in PRINCIPAL-COMPONENTS
- sqlsrv_query doesn't return false on faulty T-SQL query
- PHP error connecting to MS SQL 2005 using CodeIgniter
- SQL Insert Date Mystery on 2012 from 2005
- SQL Server 2000 query conversion
- How to compare two date in SQL Server 2005?
- Different SELECT's for an INSERT INTO
- Selecting from a column with Ampersand(&)
- Get Parent and grand parents of a particular child
- UPDATE after INSERT for potentially multiple rows - not working
- Is there a way to re-write this SQL query using a WITH clause or any other CTE that might be even better?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
There is a fundamental difference between feature reduction (such as PCA) and feature selection (which you describe). The crucial difference is that feature reduction (PCA) maps your data to lower dimensional through some projection of all original dimensions, for example PCA uses linear combination of each. So final data embedding has information from all features. If you perform feature selection you discard information, you completely loose anything that was present there. Furthermore, PCA guarantees you retaining given fraction of the data variance.