Do any of my friends have pointers to resources on statistical generation of synthetic data? Assume the reader/user is comfortable with Python and R but hasn't done a stats course in…a while. thanks
If you click on the squares at the top of the main page (the ones that say 'Single Table', 'Multi Table', etc) it will open a Google Colab notebook with tutorials
@gvwilson The “synthpop” R package is excellent for when you have an existing dataset and you want to create a synthetic version (e.g., for sharing data while protecting research participants’ confidentiality)
@gvwilson FWIW, TensorFlow Probability and PyMC both implement Markov chain Monte Carlo (although it's not hard to write your own) and have documentation available.
Add comment