miae: Multiple Imputation Through Autoencoders

Name: miae: Multiple Imputation Through Autoencoders
Start: 2023-12-07T13:30:00Z
End: 2023-12-07T15:10:00Z
Location: Macquarie University, NSW, Australia

Dec 7, 2023·

Yongshi Deng

· 0 min read

Code

Abstract

Standard implementations of multiple imputation have limitations in handling missing data in large datasets with complex data structures. Achieving satisfactory imputation performance often depends on properly specifying the imputation model to account for interactions among variables. Therefore, imputing a large dataset can be daunting, particularly when there is a large number of incomplete variables. In this talk, we will discuss the potential of applying different variants of autoencoders to multiple imputation. A comprehensive analysis on the the effect of hyperparameters on imputation performance is given. We provide insights into the suitability of using autoencoders for multiple imputation tasks and give practical suggestions to improve their imputation performance. The proposed procedure is implemented in an R package miae, which uses torch as the backend, so that setting up Python is not required. In addition, miae aims to provide an automated procedure, where the main imputation function can automatically handle tasks such as data preprocessing and proprocessing, without requiring extra work from users. Various statistical techniques have also been implemented to enhance the imputation performance of miae and its performance is evaluated and compared to those of mice and mixgb. The development version of miae is available at Github.

Date

Dec 7, 2023 1:30 PM — 3:10 PM

Event

The 12th conference of the Asian Regional Section of the International Association for Statistical Computing (IASC-ARS)

Location

Macquarie University, NSW, Australia

Wallumattagal Campus, Sydney,

Last updated on Dec 7, 2023

Miae Autoencoders Multiple Imputation