4 comments

  • DonThomasitos 6 minutes ago

    Nice summary! I missed the mention of EQ-VAE when it comes to generation quality. Tiny trick, huge impact! Have you tried it?

    • lastdong 39 minutes ago

      This seems like a great model to experiment fine tuning with original art, given it’s relatively small and with open license. Is that a fair assessment?

      Thanks for the great write up and making it available to us all.

      • schopra909 31 minutes ago

        yep, Apache 2.0! so anyone's welcome to download and hack away

      • schopra909 1 day ago

        Hi HN, I’m one of the two authors of the post and the Linum v2 text-to-video model (https://news.ycombinator.com/item?id=46721488). We're releasing our Image-Video VAE (open weights) and a deep dive on how we built it. Happy to answer questions about the work!

        • fjejfhdh 50 minutes ago

          I take my children to school to learn them how to use English grammar.