Welcome to the 【AI Daily】 column! This is your guide to exploring the world of artificial intelligence every day. Here we present the hot topics in the AI field daily, focusing on developers and helping you gain insight into technical trends and understand innovative AI product applications.
Fresh AI products click to learn more:https://top.aibase.com/
1. Midjourney launches its first video generation model V1: Supports up to 21 seconds, priced at $10 per month
Midjourney has released its first AI video generation model V1, primarily focusing on image-to-video conversion functionality, supporting multiple dynamic modes and customizable text prompts. It offers an affordable pricing plan and is easy to operate, but it faces copyright disputes and there is room for technical optimization, such as minor flickering issues in high-dynamic scenes.
[AiBase Summary:]
🌟 Focuses on image-to-video conversion with support for multiple dynamic modes and customizable text prompts.
💰 Affordable pricing, starting at $10 per month to experience video generation functions.
⚠️ Faces copyright disputes, and technology still has room for optimization, such as minor flickering problems in high-dynamic scenes.
2. OpenAI CEO announces: GPT-5 will be released this summer
This article provides a detailed introduction to OpenAI's development dynamics, including the release time of GPT-5, adjustments to its cooperation with Microsoft, and breakthrough progress in defense fields, showcasing OpenAI's continued leadership in the field of artificial intelligence.
[AiBase Summary:]
🚀 OpenAI CEO confirms that GPT-5 will be released this summer, and the industry is eagerly looking forward to it.
💰 OpenAI plans to renegotiate its cooperation agreement with Microsoft to enhance its market independence.
🛡️ OpenAI signs a $200 million contract with the U.S. Department of Defense, marking its rise in the defense sector.
3. Google Search Live goes live! Voice dialogue search revolutionizes the experience, with AI assistants always on standby!
Google's Search Live voice search function, based on AI Mode, allows users to engage in real-time dialogues with search engines via voice, providing seamless interaction experiences.
[AiBase Summary:]
✨ Real-time voice dialogue: Users can ask questions through voice and receive AI-generated voice answers, supporting continuous follow-up questions.
🌐 Web link assistance: Each answer comes with relevant links to ensure information transparency and credibility.
🌟 Technology integration: Combining Gemini models and Astra technology to handle complex voice inputs and generate natural and coherent responses.
4. OpenAI releases an open-source customer service agent framework to help enterprises transform digitally
I learned that OpenAI has released an open-source customer service agent example, which made me very excited. This example not only demonstrates how to build intelligent AI agents but also provides detailed safeguards and practical application cases. Through this framework, enterprises can more easily automate customer service, improve efficiency, and reduce costs.
[AiBase Summary:]
🚀 Use OpenAI agent SDK to build intelligent and workflow-aware AI agents, supporting various business scenarios.
🔒 Set safety and relevance safeguards to ensure system security and stable operation.
📖 Provide Python backend and Next.js frontend, demonstrating actual applications of multi-agent collaboration and safeguard mechanisms.
5. MiniMax Agent is officially released! From 'Give me code' to 'Tell me your needs', AI smart agents are revolutionizing workflows!
MiniMax Agent is an intelligent agent specifically designed to solve long-term complex tasks. It features expert-level multi-step planning capabilities, flexible task decomposition mechanisms, and end-to-end execution efficiency. By deeply understanding user needs, it automatically completes task planning and execution, allowing users to focus on higher-value creativity and decision-making.
[AiBase Summary:]
✨ Key highlights: Multi-scenario empowerment, including programming, multimodal understanding, and seamless MCP integration, meeting individual and enterprise team needs.
💻 Functional advantages: Freeing users from tedious coding by understanding requirements for efficient task planning and execution.
🌟 Industry impact: Smart agents lead the future, reshaping the landscape of productivity tools and promoting intelligent and automated development.
6. New variants of malicious tool WormGPT reappear, using Mistral AI and Grok models to write malicious code
Recently, Cato Network discovered two new versions of WormGPT based on Grok and Mixtral, which can help cybercriminals generate phishing emails, malicious code, and bypass AI security protections. This indicates that cybercrime is upgrading its methods by leveraging advanced AI technologies.
[AiBase Summary:]
⚠️ New versions of WormGPT based on Grok and Mixtral models are specifically used for cybercriminal activities.
🔒 These tools can bypass ethical defenses on AI platforms to generate malicious scripts and steal credentials.
🛡️ Cybersecurity experts call for strengthening defensive strategies, such as improving threat detection and response capabilities.
7. OpenAI launches discounts for ChatGPT Enterprise Edition, with discount rates ranging from 10% to 20%
OpenAI has launched discounts for ChatGPT Enterprise Edition, attracting enterprise users and reducing usage costs. It is expected that by 2030, annual revenue from enterprise customers could reach $15 billion.
[AiBase Summary:]
🚀 OpenAI offers discounts for the ChatGPT Enterprise Edition, with a range of 10%-20%, helping businesses reduce costs and increase efficiency.
🌟 ChatGPT, as a conversation generation tool, is widely adopted, driving the popularization of AI technology.
📈 By 2030, annual revenue from enterprise customers is expected to reach $15 billion, showing the huge potential of the AI market.
8. DeepSite V2 upgrade! Supports DeepSeek-R1-0528 model, easily generating 3D web animations, no coding required to play with creativity!
DeepSite V2 significantly enhances code generation capabilities and real-time preview experiences by integrating the DeepSeek-R1-0528 model, allowing users to generate complex webpage codes, including HTML, CSS, and JavaScript, simply by describing them. It is suitable for both developers and non-professionals.
[AiBase Summary:]
🚀 Supports natural language generation of complex codes, such as 3D animations, and generates runnable code within seconds just by entering descriptions.
🌐 Real-time preview and adjustment functions allow users to instantly view results and optimize outputs to ensure they meet expectations.
🌱 Completely open source and free, supporting multimodal tasks, covering webpages, games, effects, and 3D interactive content, lowering development thresholds.
Details link: https://deepsite.hf.co/projects/new
9. AI becomes a PPT master! Office-PowerPoint-MCP-Server is online, automatically generating professional reports, doubling efficiency!
Office-PowerPoint-MCP-Server is an open-source tool based on Model Context Protocol (MCP). It assists users in quickly creating and editing PowerPoint presentations through AI, offering multiple functions from generating entirely new PPTs to fine-tuning existing files.
[AiBase Summary:]
🌟 Supports creating entirely new PPTs or editing existing files, covering slide management, content filling, and data visualization functions.
📊 Seamlessly integrates AI assistants, generating PPTs through natural language commands or code in batches, greatly increasing enterprise report generation efficiency.
🌐 Open-source characteristics allow developers to customize functions, such as integrating image generation models or connecting with external data sources, expanding application scenarios.
Details link: https://github.com/GongRzhe/Office-PowerPoint-MCP-Server
10. BYD and ByteDance partner to develop key battery technologies using AI
BYD and ByteDance have jointly built the 'AI + High-Throughput Joint Laboratory' to leverage AI technology to promote battery development, solving technical challenges such as fast charging, lifespan, and safety, accelerating the iteration cycle of batteries, and injecting new momentum into the new energy vehicle industry.
[AiBase Summary:]
🌟 BYD and ByteDance jointly establish a joint laboratory to research core battery technologies.
⚙️ Sharing algorithms, computing power, and experimental data to overcome critical issues like fast charging, lifespan, and safety.
🚀 Accelerating the battery iteration cycle, promoting the discovery of new materials and formulas, and driving technological advancements in the industry.
11. Musk refutes rumors of massive losses at xAI: Monthly burn rate of $1 billion is pure nonsense
xAI was reported to spend $1 billion monthly, but Musk denied this claim, calling it a rumor. xAI is seeking $9.3 billion in financing, expecting a full-year loss of $13 billion, but Musk remains confident about its future profitability.
[AiBase Summary:]
🌟 The rumor of xAI spending $1 billion monthly has been denied by Musk, considering it groundless.
💰 xAI is seeking $9.3 billion in financing to cover funding gaps, expecting a full-year loss of $13 billion.
🚀 Despite the massive losses, Musk remains optimistic about xAI's future development, targeting profitability by 2027.