My code:
import matplotlib.pyplot as plt
import pandas as pd
data = pd.read_csv('data/data.csv')
X = data[['x1','x2']]
y = data['y']
from sklearn.svm import SVC
classifier = SVC()
classifier.fit(X,y)
plt.scatter(data['x1'], data['x2'], c=y, s=50)
plt.show()
My data:
x1,x2,y
0.336493583877,-0.985950993354,0.0
-0.0110425297266,-0.10552856162,1.0
0.238159509297,-0.61741666482,1.0
-0.366782883496,-0.713818716912,1.0
1.22192307438,-1.03939898614,0.0
Probably Support Vector Machine isn't the best algorithm to be used there, but I would like to see the boundary generated for that. How to do it?
Building off of Sun Yi's answer, you can use the example code from here. For example, you don't have all the points in your
data.csv
in your question but we can produce a plot with the decision boundary like this:This is taken heavily from the example code in the link above. I tried to only include what was needed to keep it simple. Here is the output image:
You'll notice that I explicitly used the
rbf
kernel as the full data in your example isn't linearly separable. For a nice, more general than mine, answer on these contours this answer is good.