@kellogh@swaldman ...personally i am not sure whether this is the right way to adress this issue. Ofcourse the problem of training LLMs with large pools of unlicensed data, including private/personal data without permission (when it's needed) should be adressed, i only don't know if this is the way...🤔