AI is pure madness !!
This week in AI was pure madness.
- Grok’s AI Vision
- Genspark AI Slides
- Perplexity Assistant
- OpenAI gpt-image-1
- Tavus SoTA lipsync model
- Dia SoTA speech AI model
- Dreamina AI's Top AI Image
- ChatGPT Deep Research Mini
Here's EVERYTHING you need to know:
1) Grok Vision debuts with multimodal features, allowing real-time analysis via phone cameras, plus multilingual audio and real-time search.
[2] Genspark's AI Slides revolutionizes presentation creation by researching topics, generating visuals, and converting documents into polished slides.
[3] Perplexity Assistant launches on iOS, featuring voice functions and multi-app tasks like booking dinner, rides, and setting reminders.
[4] OpenAI's gpt-image-1 is now available via API for third-party apps, enabling ChatGPT's popular image generation. This model debuted in ChatGPT in late March, generating over 700 million images in its first week.
[5] Tavus unveils a state-of-the-art lipsync model that achieves unprecedented realism in speech-to-video synthesis.
It enables perfect lip synchronization with natural facial expressions.
[6] Nari Labs launches Dia 1.6B, an AI with impressive emotional range, able to laugh and cough! Try it now on HuggingFace.
[7] Dreamina AI launches Seedream 3.0
It ranks #1 at creating photorealistic images up to 2k resolution and it can also upscale, inpaint, expand and even generate videos.
[8] OpenAI introduces ChatGPT Deep Research Mini, offering essential features with fewer resources.
When the original version reaches its limits, queries switch to this lightweight version.
Comments
Post a Comment