WApj : 082293111162
In[1]:
import pandas as pd
import numpy as np
import statsmodels.api as sm
in[3]:
df=pd.read_csv("PWK.csv")
df
Out[3]:
Tanaman Pangan Luas Panen
0 5230 34
1 42520 149
2 165300 616
3 178350 1316
4 279320 2033
5 1919000 3708
In [4]:
Xxxxxxxxxx
corr = df.corr()
corr
Out[4]:
Tanaman Pangan Luas Panen
Tanaman Pangan 1.000000 0.903771
Luas Panen 0.903771 1.000000
Hasil Interpretasi
Berdasarkan hasil tabel correlations kekuatan hubungan antara variable tanaman
pangan dan luas panen sebesar 0,903 atau sangat kuat karena mendekati angka 1.
Arah korelasi antar dua variable positif karena nilai koefisien pearson postif 0.903.
Kesimpulannya korelasi antara variabel luas panen dan tanaman pangan sangat kuat,
signifikan dan searah.
In [5]:
xxxxxxxxxx
x = df[['Tanaman Pangan']]
y = df['Luas Panen']
x = sm.add_constant(x)
est = sm.OLS(y,x).fit()
est.summary()
C:\Users\Lenovo\anaconda3\lib\site-packages\statsmodels\stats\stattools.py:74:
ValueWarning: omni_normtest is not valid with less than 8 observations; 6 samples were
given.
warn("omni_normtest is not valid with less than 8 observations; %i "
Out[5]:
OLS REGRESSION RESULTS
DEP. VARIABLE: Luas Panen R-squared: 0.817
MODEL: OLS Adj. R-squared: 0.771
METHOD: Least Squares F-statistic: 17.83
DATE: Sat, 22 May 2021 Prob (F-statistic): 0.0134
TIME: 11:33:36 Log-Likelihood: -46.319
NO. 6 AIC: 96.64
OBSERVATIONS:
DF RESIDUALS: 4 BIC: 96.22
DF MODEL: 1
COVARIANCE TYPE: nonrobust
COEF STD ERR T P>| [0.025 0.975]
T|
CONST 569.3002 324.016 1.757 0.154 -330.312 1468.912
TANAMAN PANGAN 0.0017 0.000 4.223 0.013 0.001 0.003
OMNIBUS: NAN DURBIN-WATSON: 1.186
PROB(OMNIBUS): nan Jarque-Bera (JB): 0.773
SKEW: 0.764 Prob(JB): 0.679
KURTOSIS: 2.131 Cond. No. 9.49e+05
Notes:
[1] Standard Errors assume that the covariance matrix of the errors is correctly specified.
[2] The condition number is large, 9.49e+05. This might indicate that there are
strong multicollinearity or other numerical problems.
Hasil Interpretasi
Berdasarkan dari model regresi, Prob F statistic < F Statistic maka model dikatakan
bagus atau fit.
P value pada p>t adalah 0.154 dan 0.013 maka setiap variabel yang digunakan
signifikan.
Model regresi y=569.30+0.0017X
Konstanta 569.30 menunjukkan bahwa jika variabel X atau luas panen konstan maka
rata2 nilai variabel Y atau tanaman pangan adalah sebesar 569.30
Koefisien regresi X=0.0017 bahwa jika variabel X meningkat 1 satuan maka akan
meningkatkan variabel Y sebesar 0.0017.
In [7]:
xxxxxxxxxx
import seaborn as sns
import matplotlib.pyplot as plt
from scipy.stats import pearsonr
sns.set(style='white', font_scale=1.2)
g = sns.JointGrid(data=df, x='Tanaman Pangan', y='Luas Panen', xlim=(0, 2000000),
ylim=(0,4000), height=5)
g = g.plot_joint(sns.regplot, color="xkcd:muted blue")
#g = g.plot_marginals(sns.distplot, kde=False, bins=12, color="xkcd:bluey grey")
plt.tight_layout()