Scikit-Learn Linear Regression how to get coefficient's respective features?

Question

Scikit-Learn Linear Regression how to get coefficient's respective features?

132.4k Views Asked by jeffrey At 15 November 2014 at 23:14

I'm trying to perform feature selection by evaluating my regressions coefficient outputs, and select the features with the highest magnitude coefficients. The problem is, I don't know how to get the respective features, as only coefficients are returned form the coef._ attribute. The documentation says:

Estimated coefficients for the linear regression problem. If multiple targets are passed during the fit (y 2D), this is a 2D array of shape (n_targets, n_features), while if only one target is passed, this is a 1D array of length n_features.

I am passing into my regression.fit(A,B), where A is a 2-D array, with tfidf value for each feature in a document. Example format:

         "feature1"   "feature2"
"Doc1"    .44          .22
"Doc2"    .11          .6
"Doc3"    .22          .2

B are my target values for the data, which are just numbers 1-100 associated with each document:

"Doc1"    50
"Doc2"    11
"Doc3"    99

Using regression.coef_, I get a list of coefficients, but not their corresponding features! How can I get the features? I'm guessing I need to modfy the structure of my B targets, but I don't know how.

Original Q&A

There are 8 best solutions below

**Jake0x32** · Answer 1 · 2014-11-15T23:31:21.357000

I suppose you are working on some feature selection task. Well using regression.coef_ does get the corresponding coefficients to the features, i.e. regression.coef_[0] corresponds to "feature1" and regression.coef_[1] corresponds to "feature2". This should be what you desire.

Well I in its turn recommend tree model from sklearn, which could also be used for feature selection. To be specific, check out here.

**Kirsche** · Answer 2 · 2017-04-29T19:41:31.050000

What I found to work was:

X = your independent variables

coefficients = pd.concat([pd.DataFrame(X.columns),pd.DataFrame(np.transpose(logistic.coef_))], axis = 1)

The assumption you stated: that the order of regression.coef_ is the same as in the TRAIN set holds true in my experiences. (works with the underlying data and also checks out with correlations between X and y)

**Snowde** · Answer 3 · 2017-06-09T09:08:40.357000

Snowde On 09 June 2017 at 09:08

coefficients = pd.DataFrame({"Feature":X.columns,"Coefficients":np.transpose(logistic.coef_)})

**clieforce** · Answer 4 · 2018-09-20T03:13:42.157000

clieforce On 20 September 2018 at 03:13

Suppose your train data X variable is 'df_X' then you can map into a dictionary and feed into pandas dataframe to get the mapping:

pd.DataFrame(dict(zip(df_X.columns,model.coef_[0])),index=[0]).T

**Pran Kumar Sarkar** · Answer 5 · 2019-01-03T17:24:52.570000

Pran Kumar Sarkar On 03 January 2019 at 17:24

You can do that by creating a data frame:

cdf = pd.DataFrame(regression.coef_, X.columns, columns=['Coefficients'])
print(cdf)

**Ankit Kumar Rajpoot** · Answer 6 · 2020-04-25T13:22:46.247000

Coefficients and features in zip

print(list(zip(X_train.columns.tolist(),logreg.coef_[0])))

Coefficients and features in DataFrame

pd.DataFrame({"Feature":X_train.columns.tolist(),"Coefficients":logreg.coef_[0]})

**Hanan Tabak** · Answer 7 · 2020-08-18T12:16:21.093000

Hanan Tabak On 18 August 2020 at 12:16

Try putting them in a series with the data columns names as index:

coeffs = pd.Series(model.coef_[0], index=X.columns.values)
coeffs.sort_values(ascending = False)

**Pablo Vilas** · Answer 8 · 2021-12-29T13:49:22.620000

Pablo Vilas On 29 December 2021 at 13:49

This is the easiest and most intuitive way:

pd.DataFrame(logisticRegr.coef_, columns=x_train.columns)

or the same but transposing index and columns

pd.DataFrame(logisticRegr.coef_, columns=x_train.columns).T

Scikit-Learn Linear Regression how to get coefficient's respective features?

There are 8 best solutions below

Coefficients and features in zip

Coefficients and features in DataFrame

Related Questions in SCIKIT-LEARN

Related Questions in LINEAR-REGRESSION

Related Questions in FEATURE-SELECTION

Trending Questions

Popular # Hahtags

Popular Questions