That is currently not possible, as org-mode has no feature to modify an image generated by some other program (plantuml, graphviz etc)
Also, originally you asked about captions generally. A code block can also run e.g. Python code and use the output of it in a result block. The minority of code blocks generate images.
Add comment