#2 Generative context menus, Proactive agents & More

👋 Hey designers and innovators,

Dive into this week's Witbits—your growing list of experiments and insights into Human-AI design.

Generative context menus

Based on your selection and situation, context menus can help you discover actions and access them quickly.

A screenshot of a text excerpt titled "Philosophical Perspectives on Love," with a highlighted section discussing Plato's views on love as a motivation for individuals, referencing his "Symposium." To the right, a context menu with options like Copy, Print, Read aloud, and other suggestions is visible. The background is pink.

Realtime image generation

Realtime generation allows people to manipulate content instantly, giving them more agency in using generative AI as a tool for exploration.

This image displays a 3D glossy robot toy with a round blue body and a white face, featuring rosy cheeks and a smile. The toy has two black antennae on top of its head. The interface shows different views of the toy on the right side, and the caption below the main image reads, "3D tiny glossy robot toy."

Proactive agents

Proactive agents can autonomously initiate conversations and actions based on previous interactions and context providing timely and relevant assistance.

This image shows a chat interface with three messages on an orange background. The first message, in a blue bubble, says "Great advice!" at 22:04. The second message, in a dark gray bubble, reads "Of course, That’s what I’m here for. See ya!" also at 22:04. There is a notification for "1 unread message" followed by the third message, in another dark gray bubble, asking "Did everything go well yesterday?" at 11:11.

Evolve your output

Presenting multiple outputs helps users explore and identify their preferences and provides valuable insights into their choices, even enabling model improvement.

This image features four abstract visual representations, possibly generated by an AI called Midjourney, showcasing "diffusion AI grain." The visuals include a central spiral structure, a flowing wave-like formation, a complex geometric mesh, and a field of wheat. The highlighted selection indicates "V2" with a cursor pointing to it.

✨ What sparked our curiosity this week

Gregg Brockman from OpenAI introduced the GPT-4o model, highlighting its advanced capabilities. GPT-4o excels in understanding and describing video inputs, capturing scene nuances. Two agents, both powered by GPT-4o, can communicate: one analyzes visuals, the other asks questions. While Gregg is able to intervene to steer the discussion. This model's blend of visual comprehension with interactive EXPRESSIVE dialogue marks a paradigm shift in human-machine interaction!

👋

See you in a week!

Explore all WitBits on witlist.design