AI is pure madness !!

This week in AI was pure madness.

- Grok’s AI Vision
- Genspark AI Slides
- Perplexity Assistant
- OpenAI gpt-image-1
- Tavus SoTA lipsync model
- Dia SoTA speech AI model
- Dreamina AI's Top AI Image
- ChatGPT Deep Research Mini

Here's EVERYTHING you need to know:

1) Grok Vision debuts with multimodal features, allowing real-time analysis via phone cameras, plus multilingual audio and real-time search.

[2] Genspark's AI Slides revolutionizes presentation creation by researching topics, generating visuals, and converting documents into polished slides.

[3] Perplexity Assistant launches on iOS, featuring voice functions and multi-app tasks like booking dinner, rides, and setting reminders.

[4] OpenAI's gpt-image-1 is now available via API for third-party apps, enabling ChatGPT's popular image generation. This model debuted in ChatGPT in late March, generating over 700 million images in its first week.

[5] Tavus unveils a state-of-the-art lipsync model that achieves unprecedented realism in speech-to-video synthesis.
It enables perfect lip synchronization with natural facial expressions.

[6] Nari Labs launches Dia 1.6B, an AI with impressive emotional range, able to laugh and cough! Try it now on HuggingFace.

[7] Dreamina AI launches Seedream 3.0
It ranks #1 at creating photorealistic images up to 2k resolution and it can also upscale, inpaint, expand and even generate videos.

[8] OpenAI introduces ChatGPT Deep Research Mini, offering essential features with fewer resources.

When the original version reaches its limits, queries switch to this lightweight version.














Comments

Popular posts from this blog

Sholay - Jai-Mausi (Property Indexation)

A Leap of Faith : Deepika Kapoor (Startup Space)

P/E ratio can make you rich ! Lets find out.