Skyward Synapses: AI Insights #3

Midjourney just got better at generating hands!

Hello, AI Enthusiasts, and Welcome to this week's Skyward Synapses: AI Insights!

In today’s email, we will talk about:

  • Google trying to chase OpenAI with their updated LLM.

  • Midjourney 5.1 is significantly better at generating hands!

  • Do you consider buying ChatGPT Plus? Check out poe.com instead!

  • Google MusicLM is great at generating music!

Let’s get started!

🎬 Video of the week

This week Google presented their Keynote presentation, where they talked about their recent upgrades to their AI Chat system - Bard, and introduced their updated LLM, called PaLM 2.

The whole Keynote is 2 hours long, but the part regarding LLMs lasts around 10 minutes. You can check it out below:

I’m very excited to see some competition in the space of LLM systems. OpenAI definitely has the upper hand for now, but it would be great to see Google try to bridge the gap!

I summarized the new capabilities of Bard in a Medium article.

💡 Idea of the week

The idea of the week is music generation with AI!

Google gives now access to an early version of MusicLM - their latest music generation model.

MusicLM is poised to take center stage in the realm of music generation. Building on Google’s earlier research, AudioLM, this model synthesizes high-fidelity audio from text prompts. From modifying existing audio based on text prompts to generating diverse music compositions, MusicLM is set to transform how we create and experience music.

In this Twitter thread, you can see what the model is capable of:

🧩 Prompt of the week

This week I played with Midjourney 5.1. I wrote a summary of my findings in this thread:

The most significant thing I found out is that the new version of Midjourney is significantly better at generating hands! It still sometimes generates anomalies but it happens less frequently. Look at these two results I was able to get:

A cinematic photography of romantic couple holding their hands, artistic image, realistic, romantic, hands close up --ar 3:2 --v 5.1 --style raw

A cinematic photography of romantic couple holding their hands, sitting at a cafe in Paris, artistic image, realistic, romantic --ar 3:2 --v 5.1 --style raw

Two people learning how to play guitar, each holding their own guitars, artistic image, whole body shot, realistic --ar 3:2 --v 5.1 --style raw

In my opinion, the results are mindblowing!

📚 Resources

poe.com might be an interesting alternative to ChatGPT Plus. At the same price level, it has access Claude-instant-100k - a model with a HUGE context window (100k tokens). In comparison, GPT-4 has a context of only 8k tokens!

Context of 100k tokens will let you summarize whole books and articles (it is equal to around 100 pages of text).

You can check out more details in this post:

To infinity and beyond,

Luke Skyward