[sklearn] 교차 검증

1. 교차 검증 (Cross-Validation)

from sklearn.model_selection import cross_val_score

from sklearn.model_selection import cross_validate

from sklearn.model_selection import KFold

from sklearn.model_selection import LeaveOneOut

from sklearn.model_selection import ShuffleSplit

from sklearn.model_selection import GroupKFold

groups 배열은 훈련 세트와 테스트 세트로 분리되지 않아야 할 그룹 지정 - 같은 사람
scores = cross_val_score( model, X_data, y_data, groups, cv=GroupKFold(n_splits=n) )
훈련 세트와 테스트 세트에 서로 다른 사람의 정보가 들어가도록 배열 - 클래스와는 다름

2. 그리드 서치

from sklearn.model_selection import GridSearchCV

그리드 서치로 관심있는 매개변수 대상 모든 조합 시도해보는 것
param_grid = { 'C' : [0.001,0.1,1,10] }
grid_search = GridSearchCV( model, param_grid, cv=n, return_train_score = True )
grid_search.fit( X_train, y_train ) : 예측, 매개변수 찾기, 가장 좋은 매개변수로 새로운 모델 자동 생성
grid_search.score( X_test, y_test ) : 일반화 성능 평가
grid_search.best_params_ : 최상의 매개변수
grid_search.best_score_ : 최상의 교차검증 정확도
grid_search.best_estimator_ : 계수, 특성 중요도 등 자세한 사항 살펴볼 때
grid_search.cv_results_ : 그리드 서치 결과, 시각화

+ 참고 자료 및 출처

(한빛미디어) 안드레아스 뮐러, 세라 가이도 < 파이썬 라이브러리를 활용한 머신러닝 >

[sklearn] 특성 공학 (0)	2021.05.11
[sklearn] 비지도학습 (0)	2021.05.10
[sklearn] 지도학습 (0)	2021.05.10
[sklearn / statsmodels] 선형회귀 Linear Regression (0)	2021.01.31

5 O L E