This paper uses a transformer neural network model to perform imputation of missing data. The method returns a distribution which allows for easy marginalization which can allow for statistically efficient analysis when combined with a model for inference.