AI application

Stableboost generates up to 200 frames at once with Stable Diffusion

Newsletter Even with a perfect prompt, there are virtually an infinite number of ways an AI-generated image can turn out. The Stableboost website shows up to 200 after a single run. DALL-E 2 and Midjourney generate four frames per run, Dreamstudio, StabilityAI’s official web app for stable streaming, even up to nine – but even …

Stableboost generates up to 200 frames at once with Stable Diffusion Read More »

Why Alexa Still Can’t Continue Smooth Dialogs Despite Significant AI Advances

Summary Set the alarm clock, ask about the weather – Alexa easily understands simple commands. But beyond that, things get complicated. Why is that? Compared to large language models (LLMs) like GPT-3, voice assistants like Alexa and Google Assistant are rather unobtrusive. Real conversations do not take place, the systems only understand trivial commands immediately …

Why Alexa Still Can’t Continue Smooth Dialogs Despite Significant AI Advances Read More »

Google is looking into self-acting AI for code, but not everyone thinks it’s a good idea

Summary Google wants to optimize and rewrite code with AI. The tech giant is also said to be planning a major breakthrough in generative AI. But there’s also skepticism about code-writing AI. In 2018, Google put artificial intelligence at the center of its operations and even rebranded its research department as Google AI. Since then, …

Google is looking into self-acting AI for code, but not everyone thinks it’s a good idea Read More »

Sony introduces the GANstrument neural synthesizer

Summary Sony AI researchers present GANstrument, a neural synthesizer that transforms arbitrary input sounds into instrument sounds. Generative AI systems such as DALL-E 2, Midjourney or Stable Diffusion are currently shaking up the visual arts. Text-to-image systems allow impressive results even with simple text inputs. Powerful comparable systems do not yet exist in music. But …

Sony introduces the GANstrument neural synthesizer Read More »

Meta’s diplomatic AI can negotiate, persuade and cooperate

Summary CICERO is Meta’s latest AI system that can negotiate with humans in natural language, convince them of strategies and cooperate with them. The strategic board game “Diplomacy” serves as a reference. According to Meta, CICERO is the first language AI capable of playing the board game “Diplomacy” on a human level. In Diplomacy, players …

Meta’s diplomatic AI can negotiate, persuade and cooperate Read More »

Nvidia’s Magic3D turns text into high-resolution 3D objects

Summary Nvidia’s Magic3D can create 3D objects based on text input. The model is said to significantly outperform Google’s Dreamfusion 3D text model, which was only introduced in September. Like Dreamfusion, Magic3D is essentially based on an image generation model that uses text to create images from different perspectives, which in turn serve as input …

Nvidia’s Magic3D turns text into high-resolution 3D objects Read More »

Paella is a compact and powerful text-to-image AI model

Summary An international team of researchers presents Paella, a performance-optimized text-to-image AI model. Currently, the best-known text-to-image AI systems, such as Stable Diffusion and DALL-E 2, are based on diffusion models for image generation and transformers for speech understanding. This allows the generation of high quality images for text input. However, the systems require multiple …

Paella is a compact and powerful text-to-image AI model Read More »