[python/머신러닝] scikit-learn 사용하여 머신러닝 구현

Python/Machine Learning

유달잇 2021. 2. 6. 23:55

728x90

scikit-learn 모듈을 사용하여 머신러닝을 구현해보자

머신러닝 구현 단계

1단계 : 데이터 수집 및 전처리

2단계 : 데이터 분할

- sklearn.model_selection.train_test_split : 목적 데이터를 분할

3단계 : 모델 지정

- ex) sklearn.ensemble.RandomForestClassifier -> sklearn.ensemble.* 등 모델 사용

4단계 : 모델 학습

- model.fit( X, y )

5단계 : 테스트 및 평가

: model.predict( ) , sklearn.metrics. *

정답률 : accuracy_score( )
적합율 : precision_score( )
재현율 : recall_score( )
평가지표 산출률 : Classification_report( )
F1 measure 값 : f1_score( )

실습하기

 
  from sklearn.model_selection import train_test_split
  from sklearn.ensemble import RandomForestClassifier
  from sklearn.metrics import accuracy_score
  from sklearn import datasets

  # 1. 데이터 로드
  iris = datasets.load_iris()
  data, target = iris.data, iris.target

  # 2. 데이터 분할(학습 데이터, 테스트 데이터)
  X_train, X_test, y_train, y_test = train_test_split(data, target, random_state=0)

  # 3. 모델 지정
  model = RandomForestClassifier()

  # 4. 모델 학습
  model.fit(X_train, y_train)

  # 5. 테스트 및 평가
  y_pred = model.predict(X_test)

  print( '정답률 : ', accuracy_score(y_test, y_pred) )

< 출력 결과 >

정답률 : 0.9736842105263158

728x90

저작자표시 비영리 변경금지 (새창열림)

'Python > Machine Learning' 카테고리의 다른 글

[python/머신러닝] 핸즈온 머신러닝 Chapter 1 (2)	2021.05.14
[python/머신러닝] scikit-learn의 분류 성능평가 (sklearn.metrics) (0)	2021.02.08
[python/머신러닝] 머신러닝이란? (0)	2021.01.31

최근에 올라온 글

TAG more

« 2025/07 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Total

Today

Yesterday

250x250

Warm Developer