Recently, Apache Doris 4.0 was officially released, injecting new vitality into the enterprise-grade data analysis platform. This update focuses on three key directions: "AI-driven, search-enhanced, and offline optimization," bringing a series of key features such as vector indexing and AI functions, aiming to enhance user experience and data processing capabilities.
In terms of AI capabilities, Doris 4.0 deeply integrates vector indexing technology, enabling efficient processing of high-dimensional vector data, such as text embeddings and image features. This allows users to perform structured data queries and vector similarity searches using SQL on the same platform, significantly reducing the complexity of AI application development and deployment. Additionally, the newly added AI function library enables data analysts to call large language models through simple SQL statements, completing tasks such as text processing and information extraction.
To meet enterprise-level search requirements, Doris 4.0 has comprehensively upgraded its search functionality, introducing the SEARCH function with a unified entry point. This lightweight DSL full-text retrieval tool makes complex text retrieval more convenient and improves retrieval performance. Users can achieve multi-condition indexing and boolean combination queries with simple syntax, greatly enhancing the flexibility and accuracy of search.
The Apache Doris team had over 200 contributors who provided more than 9,000 optimizations and fixes during the development of this version, offering users more efficient and stable data analysis support. The release of Doris 4.0 marks another technological breakthrough for it in the fields of AI and search.
- GitHub: https://github.com/apache/doris/releases
Key Points:
🌟 New vector indexing feature, supporting efficient storage and retrieval of high-dimensional vector data.
🤖 Deep integration of AI functions, simplifying the process of text processing and information extraction.
🔍 Enhanced search capabilities, providing a more flexible and accurate full-text search experience.