XGBRegressor vs. xgboost.train diferença de velocidade enorme?

Se eu treinar meu modelo usando o seguinte código:

import xgboost as xg
params = {'max_depth':3,
'min_child_weight':10,
'learning_rate':0.3,
'subsample':0.5,
'colsample_bytree':0.6,
'obj':'reg:linear',
'n_estimators':1000,
'eta':0.3}

features = df[feature_columns]
target = df[target_columns]
dmatrix = xg.DMatrix(features.values,
                     target.values,
                     feature_names=features.columns.values)
clf = xg.train(params, dmatrix)

termina em cerca de 1 minuto.

Se eu treinar meu modelo usando o método Sci-Kit:

import xgboost as xg
max_depth = 3
min_child_weight = 10
subsample = 0.5
colsample_bytree = 0.6
objective = 'reg:linear'
num_estimators = 1000
learning_rate = 0.3

features = df[feature_columns]
target = df[target_columns]
clf = xg.XGBRegressor(max_depth=max_depth,
                min_child_weight=min_child_weight,
                subsample=subsample,
                colsample_bytree=colsample_bytree,
                objective=objective,
                n_estimators=num_estimators,
                learning_rate=learning_rate)
clf.fit(features, target)

demora mais de 30 minutos.

Eu acho que o código subjacente é quase exatamente o mesmo (ou seja, XGBRegressorchamadas xg.train) - o que está acontecendo aqui?

machine-learning python decision-trees xgboost efficiency user1566200
fonte

Respostas:

xgboost.trainirá ignorar o parâmetro n_estimators, enquanto xgboost.XGBRegressoraceita. Em xgboost.train, o aumento de iterações (ie n_estimators) é controlado por num_boost_round(padrão: 10)

No seu caso, o primeiro código fará 10 iterações (por padrão), mas o segundo fará 1000 iterações. Não haverá qualquer grande diferença se você tentar mudar clf = xg.train(params, dmatrix)para clf = xg.train(params, dmatrix, 1000),

Referências

http://xgboost.readthedocs.io/en/latest/python/python_api.html#xgboost.train

http://xgboost.readthedocs.io/en/latest/python/python_api.html#xgboost.XGBRegressor

Lâmina gelada
fonte