Changelog
Source:NEWS.md
mixgb 1.4.0
- Optimised
mixgb()
for large datasets:- Significantly faster imputation by optimising data preprocessing and the use of RcppArmadillo
- Enhanced memory efficiency with in-place modifications using data.table
- Bootstrapping option removed from
mixgb()
. Users can still use bootstrap in the archived functionmixgb0()
. -
PMM
is now set toNULL
by default.
mixgb 1.3.1
Compatibility
- Makes the package compatible with XGBoost 2.0.0 with GPU support:
- Introduces a new parameter
device
. - Deprecates parameters
gpu_id
andpredictor
. - Sets
tree_method = "hist"
by default, aligning with XGBoost 2.0.0.
- Introduces a new parameter
New Features
- Introduces support for saving imputation models to a local directory through the parameter
save.models.folder
inmixgb()
.- Models save in JSON format using
xgb.save()
, a method recommended by XGBoost for future compatibility. - When
save.models.folder
is specified, the return object ofmixgb()
includes the current imputed datasets, directories for imputation models, and relevant parameters. This object can save usingsaveRDS()
as it doesn’t directly contain the models. Users can later load this object into R and employimpute_new(object, newdata, ...)
for new data imputation.
- Models save in JSON format using
mixgb 1.1.0
New Features
- Introduces
default_params()
, an auxiliary function formixgb()
, to validate the list of XGBoost hyperparameters supplied by the user. It simplifies hyperparameter modifications without requiring explicit specification of all default values.
mixgb 1.0.1
Changes in Default Settings
- Transitions from bootstrapping to subsampling. Subsampling, set at
subsample = 0.7
, becomes the default method due to identified biases with bootstrapping in certain scenarios.- Default for
mixgb()
:- Subsampling:
subsample = 0.7
. - No bootstrapping:
bootstrap = FALSE
.
- Subsampling:
- Default for
mixgb 0.1.1
Minor Bug Fixes and Updates
- Resolves a minor issue in the
createNA()
function. - Modifies default settings in
mixgb()
:-
ordinalAsInteger
: Changes fromTRUE
toFALSE
. -
max_depth
: Changes from 6 to 3. -
nrounds
: Changes from 50 to 100. -
bootstrap
: Sets toTRUE
by default.
-
mixgb 0.1.0
CRAN release: 2022-06-07
Initial Release
- First version releases on CRAN.
- Supports both single and multiple imputation.
- Offers customisable settings for bootstrapping and predictive matching.
- Provides visual diagnostics for multiply imputed data.
Notes
- Mac OSX users might face challenges with multithreading in
mixgb
asxgboost
requires OpenMP for multi-core operations. For details, please refer to OpenMP for Mac.