jbigham, I’m thinking of putting out a yearly report on my toddler, modeled after these LLM expert papers — like there will be a chart on the delta of her math performance from year 3 to year 4, some dubious comparison to gpt4, etc… this is a good idea right?