Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You all might enjoy some of the things I've made with it.

Tour of the Sacred Library — A short story illustrated with VQGAN+CLIP https://moultano.wordpress.com/2021/07/20/tour-of-the-sacred...

Doorways — A series of images exploring "semantic symmetry" using CLIP's embeddings to do visual analogy completion. https://moultano.wordpress.com/2021/08/23/doorways/

Depth of Field — Exploring the scale of the Hubble Ultra Deep Field image using CLIP guided diffusion to create visual analogies. https://moultano.wordpress.com/2022/03/24/depth-of-field/



Why don't you share the exact code for these experiments so that anybody can reproduce them? (and tweak them!)


Pretty sure Moultano's Tour was made with a hosted version of the original VQGAN+CLIP method https://colab.research.google.com/drive/15UwYDsnNeldJFHJ9Ndg... Though that method and implementation is quite old.

If you want an up-to-date list of open implementations, it's here https://pharmapsychotic.com/tools.html Whatever is the newest Disco Diffusion has been the best around for the past few iterations.


For the first and third, I can't, it isn't my code.

For the second, I have to get approval from my employer to release it, but I stalled out half way through the paperwork and haven't had the energy to keep pushing it forward.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: