Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present the hottest content in the AI field, focusing on developers to help you understand technical trends and innovative AI product applications.
Fresh AI products click to learn more: https://top.aibase.com/
1. This May Day holiday, Xiaohongshu was taken over by Remini's "Clay AI"
During the May Day holiday, a new trend emerged on the Xiaohongshu platform – the "Clay AI" filter, which quickly took over the homepage of Xiaohongshu, becoming the focus of user discussions. Remini's AI clay filter feature has sparked a new wave of enthusiasm globally, demonstrating the potential of AI technology in the field of image processing.
【AiBase Summary:】
📸 Xiaohongshu taken over by "Clay AI", unique clay-style photos go viral
🔥 Remini's AI clay filter feature is popular, users just need to upload pictures to get clay-style photos
🚀 Remini's success proves the huge potential of image processing AI products in meeting user's life and entertainment needs
Details link: https://top.aibase.com/tool/remini-app
2. HeyGen releases automatic editing tool Instant Highlights 1.0
HeyGen has recently released the Instant Highlights 1.0 automatic video editing tool, providing users with a convenient video editing experience. The tool features multilingual dubbing capabilities, simplifying the multi-platform adaptation of video content, and improving content dissemination efficiency. In addition, HeyGen has released the Avatar in Motion 1.0 technology, which enables motion capture and voice cloning of virtual characters, broadening the application potential of virtual characters in various fields. These two new technologies showcase HeyGen's strength and innovative spirit in the AI field.
【AiBase Summary:】
✨ Multilingual dubbing capabilities, simplifying video multi-platform adaptation work, improving content dissemination efficiency.
🌟 Avatar in Motion 1.0 technology enables virtual character motion capture and voice cloning, broadening the application potential.
💡 HeyGen showcases profound strength and innovative spirit in the AI field, bringing users a rich personalized experience.
Details link: https://top.aibase.com/tool/heygen
3. StoryDiffusion: Maintain character consistency, generate multi-image comics and long videos
The StoryDiffusion tool developed by the HVision team at Nankai University can create magical stories, maintain character consistency, and generate multi-image comics and long videos. By implementing Consistent self-attention and Motion predictor, coherent images and videos are generated, which can be used for comic generation, image-to-video conversion, and various other scenarios.
【AiBase Summary:】
🔮 Consistent self-attention achieves consistent character image generation
🎥 Motion predictor achieves long video generation
🎨 Supports comic generation, image-to-video conversion, long and short video content generation functions
Details link: https://top.aibase.com/tool/storydiffusion
4. AI music tool Udio updates, allowing the creation of music up to 15 minutes long
I am very excited about the latest updates to Udio. These updates provide a longer and more coherent music creation experience, bringing more creative freedom and possibilities to music makers.
【AiBase Summary:】
✨ Expanded context window, considering content from the previous two minutes, enhances the coherence of music works
🎵 Supports the creation of up to 15-minute audio tracks, meeting the duration needs of music creation
🌳 Introduces an innovative way to organize audio track history, allowing users to clearly trace the development history of audio track versions
Details link: https://top.aibase.com/tool/udio
5. Adobe introduces 3D icon tool Project Neo for quick 2D to 3D conversion
Adobe's latest Project Neo is a revolutionary 3D technology that enhances the visual effects and production efficiency of traditional 2D graphic design by integrating 3D elements and effects. The tool's fast and efficient illustration creation feature allows users to easily create unique 3D shapes, greatly improving work efficiency. Project Neo has powerful stylization and modeling functions, enhanced color control functions allow users to finely adjust midtones and shadows, adding depth and geometry to design works.
【AiBase Summary:】
✨ 3D technology revolution, enhancing 2D design efficiency
🎨 Fast illustration creation, easily creating unique shapes
🖌️ Powerful stylization functions, finely adjusting colors and shadows
Details link: https://top.aibase.com/tool/project-neo
6. Apple's AI plan exposed: Smarter Siri on the way
Apple is working to improve Siri, using smaller and more efficient models, and plans to make Siri respond intelligently without a wake word in the future. Apple AI has shown various potential applications in health, image editing, Memojis, and other areas, and the company's AI strategy is gradually becoming clear.
【AiBase Summary:】
⭐ Apple is working to improve Siri, using smaller and more efficient models.
⭐ The future of Siri may be able to respond intelligently without a wake word.
⭐ Apple AI has shown various potential applications in health, image editing, Memojis, and other areas.
7. VILA: A multimodal model that understands videos, supports notebook deployment
VILA, a visual language model released by NVIDIA, has video understanding and multi-image understanding capabilities. The latest version, VILA-1.5, supports multiple model scale options, and can be efficiently deployed on various NVIDIA GPUs through the TinyChat and TensorRT-LLM backends.
【AiBase Summary:】
💡 VILA is a visual language model pretrained on large-scale interwoven image-text data
💡 VILA-1.5 is released, with video understanding capabilities, supporting multiple model scale options
💡 VILA can be efficiently deployed on various NVIDIA GPUs through the TinyChat and TensorRT-LLM backends
Details link: https://top.aibase.com/tool/vila
8. NVIDIA's ChatRTX introduces multiple new features
NVIDIA's latest update to ChatRTX introduces multiple new features, including support for more large language models, contrastive language-image pretraining, the Whisper speech recognition system, etc., significantly enhancing the capabilities of chatbot applications. The update reflects NVIDIA's continuous innovation in the fields of AI and RTX acceleration technology, bringing users a smarter and more interactive experience.
【AiBase Summary:】
✨ ChatRTX supports more large language models, including Google's Gemma and the bilingual ChatGLM3, expanding language processing capabilities.
🔍 ChatRTX supports OpenAI's contrastive language-image pretraining (CLIP), allowing users to interact with photos and images on local devices through text.
🎙 ChatRTX supports the Whisper speech recognition system, users can interact with ChatRTX through voice, enhancing the user experience.
Details link: https://blogs.nvidia.com/blog/ai-decoded-chatrtx-update/
9. Brilliant Labs releases Frame: An open-source AR glasses integrated with AI
Brilliant Labs has recently released a pair of open-source AR glasses called Frame, combining artificial intelligence (AI) and augmented reality (AR) technology to bring users an unprecedented interactive experience. The Frame glasses have powerful visual capabilities, collecting and analyzing the image data that users see in real time, providing detailed answers to questions through advanced AI models, enhancing users' understanding and interaction with their surroundings. It supports multimodal interaction and real-time translation functions, combined with the cloud-based Noa AI assistant to achieve more powerful AR functions.
【AiBase Summary:】
👓 Frame glasses combine AI and AR technology to provide an unprecedented interactive experience.
🔍 Frame has powerful visual capabilities, real-time analysis of the image data that users see.
🗣️ Supports multimodal interaction, real-time translation functions, combined with the cloud-based Noa AI assistant to achieve more powerful AR functions.
Details link: https://brilliant.xyz/
10. Rabbit R1 continues to be exposed: AI hype overnight transition, NFT recharge users in tears, the big action model is also a shell
This article reveals the transformation path of Rabbit Company under the AI hype and the plight of its NFT recharge users. The article points out that the company's big action model LAM relied on OpenAI's interface but was questioned as a shell for Android. At the same time, the company's transition from the metaverse to AI terminals has raised doubts and concerns from users.
【AiBase Summary:】
🔍 Rabbit Company undergoes overnight transition under AI hype, NFT recharge users face difficulties.
💥 The company's big action model LAM, relying on OpenAI's interface, is questioned as a shell for Android.
🔄 The company transitions from the metaverse to AI terminals, raising user doubts and concerns.
Details link: https://twitter.com/EmilyLShepherd/status/1786037498507853852