社区微信群开通啦,扫一扫抢先加入社区官方微信群
社区微信群
ML之xgboost:基于xgboost(5f-CrVa)算法对HiggsBoson数据集(Kaggle竞赛)训练(模型保存+可视化)实现二分类预测
目录
Dataset之HiggsBoson:Higgs Boson(Kaggle竞赛)数据集的简介、下载、案例应用之详细攻略
更新中……
1、交叉训练时间比较长,大约需要20多分钟。
更新中……
更新中……
num_round = 1000
n_estimators = cvresult.shape[0]
print ('running cross validation, with preprocessing function')
# do cross validation, for each fold
# the dtrain, dtest, param will be passed into fpreproc
# then the return value of fpreproc will be used to generate results of that fold
cvresult = xgb.cv(param, dtrain, num_round, nfold=5,
metrics={'ams@0.15', 'auc'},
early_stopping_rounds=10, seed = 0,
fpreproc = fpreproc)
print ('finish cross validation','n',cvresult)
print ('train model using the best parameters by cv ... ')
bst = xgb.train( param, dtrain, n_estimators )
bst.save_model('data_input/xgboost/data_higgsboson/higgs_cv.model')
如果觉得我的文章对您有用,请随意打赏。你的支持将鼓励我继续创作!