


Databases usher in the AI fast lane, Alibaba Cloud releases new open source technology PilotScope
Source | Xinyan Technology
文 | Jia Ningyu
On December 20, VLDB2024, the top international database conference, announced a new batch of papers. Alibaba Cloud's new technology PilotScope was successfully shortlisted. This platform technology can realize "one-click deployment" of AI algorithms in the database, greatly reducing the number of AI algorithms in the database. The application threshold has opened up a new path for database intelligence. On December 20th, the international top conference for databases VLDB2024 announced a new batch of papers, and Alibaba Cloud's new technology PilotScope successfully made it to the list. The platform's technology can achieve "one-click deployment" of AI algorithms in databases, greatly reducing the application threshold of AI algorithms in databases and opening up a brand new path for database intelligence.
Alibaba Cloud announced that it will open source all PilotScope technologies for free on the same day
Why is it difficult to make database intelligent?
Database is a basic software technology that is crucial to the national economy and people's lives. The continuous updating of database technology has an important impact on all walks of life in the digital era. One of the frontier areas is database intelligence (AI4DB, i.e. database intelligence)
The current database system is very complex and has very high stability requirements. Even just matching and debugging an AI algorithm with a database requires engineers from both parties to work closely for weeks or even months, which is inefficient and results in poor results

The more common situation is that AI engineers don’t understand the details of databases, database developers don’t understand AI, and the two fields don’t even know the programming language (AI development mostly uses Python, databases mostly use C/Java), it’s very difficult Easy to cause rupture.
Generally speaking, companies in the industry usually choose to embed some AI algorithms directly into the database to replace certain functional modules of the database, such as intelligent query optimization modules. However, this customized approach results in very high development, maintenance, and upgrade costs. Every time the AI algorithm is upgraded and replaced, the development process needs to be redone. At the same time, changing the code base of the database will also bring additional risks
Because of this, despite the rapid development of artificial intelligence, the practical application of related results has not yet become popular in the database field
Is there a common platform technology that can more effectively apply artificial intelligence algorithms to databases?
This became the starting point for the thinking of the Alibaba PilotScope project team
PilotScope project leader Zhu Rong said: "AI4DB, AI and DB are both done by people, but the bridge at this connection, But it has never been done well. We want to build a public bridge between AI algorithms and databases to make communication between the two parties smoother."
Cross-tech innovation from 0 to 1
Zhu Rong described PilotScope as the "super administrator" of database AI. Through the PilotScope platform, AI engineers only need to focus on designing general AI algorithms to implement the deployment and application of different databases; while database users can call APIs like Likewise, the idea of using AI conveniently and efficiently took about 2 years from conception to implementation. Zhu Rong said: "It involves the intersection of algorithms and systems, the intersection of AI and databases, the intersection of research and development, and the intersection of academia and industry. It is a true intersection of technology."
According to his introduction, the project After many rounds of polishing, the team finally developed a brand new middleware system platform. By abstracting and generalizing module and interface definitions at the database and AI system levels, the AI algorithm can be implemented in the database within hours or even minutes. Key deployment", this is the current PilotScope
The rewritten content is as follows: Annotation of the Alibaba Cloud PilotScope architecture diagram

PilotScope is useful for parameter tuning, index recommendation, cardinality estimation, and query It provides more than 10 AI algorithms for mainstream database tasks such as optimization, and has successfully adapted to two mainstream open source databases such as PostgreSQL and Spark
According to experimental data, using PilotScope to embed AI algorithms into the database is faster than traditional The "hard implant" method can speed up tasks such as query optimization by 1 to 2 times. In addition, the additional cost of deployment caused by PilotScope itself is basically negligible, and the performance is excellent
Image description: PilotScope rendering

PilotScope performs "micro-intrusion" on the database and introduces Intelligent detection, rollback, isolation and other mechanisms to reduce the risk of AI hallucinations and achieve intelligent improvement while ensuring database stability
Zhu Rong said that in the past, artificial intelligence engineers and database developers needed to continuously collaborate and refine, and it might take weeks or even months to ensure stability. "With the help of our PilotScope, it only takes a few hours or even dozens of minutes to go online for testing directly. This zero-to-one technological innovation greatly improves development efficiency."
Open source drives the industrialization of AI4DB Process
PilotScope paper results have been included in VLDB. The VLDB review believes that PilotScope's pioneering system design based on application scenarios will open up a new direction of database intelligence.
According to our understanding, VLDB is one of the three top international database conferences, and only includes reports on academia and industry every year. Practice new results that have important impact. It is an authoritative indicator of database technology. The 50th VLDB Conference is planned to be held in Guangzhou, China in August 2024.

Picture Note: Top Database Conference VLDB2024
Zhu Rong said, PilotScope related technologies have been freely open sourced on GitHub and Modelscope communities. The team hopes to incorporate more AI algorithms and a wider range of databases into PilotScope through the power of the open source community, and explore more AI4DB innovations with developers
At the same time, PilotScope has begun to deploy on Alibaba Cloud Conduct pilot applications internally to conduct corresponding tests for industrial deployment
Zhu Rong said that AI4DB can only generate value in a real production environment. We hope that PilotScope can truly realize this and help people from all walks of life. Improve the efficiency and effect of database intelligence
Please attach the open source address:
https://github.com/alibaba/pilotscope
The above is the detailed content of Databases usher in the AI fast lane, Alibaba Cloud releases new open source technology PilotScope. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

The top ten cryptocurrency trading platforms in the world include Binance, OKX, Gate.io, Coinbase, Kraken, Huobi Global, Bitfinex, Bittrex, KuCoin and Poloniex, all of which provide a variety of trading methods and powerful security measures.

Recommended reliable digital currency trading platforms: 1. OKX, 2. Binance, 3. Coinbase, 4. Kraken, 5. Huobi, 6. KuCoin, 7. Bitfinex, 8. Gemini, 9. Bitstamp, 10. Poloniex, these platforms are known for their security, user experience and diverse functions, suitable for users at different levels of digital currency transactions

MeMebox 2.0 redefines crypto asset management through innovative architecture and performance breakthroughs. 1) It solves three major pain points: asset silos, income decay and paradox of security and convenience. 2) Through intelligent asset hubs, dynamic risk management and return enhancement engines, cross-chain transfer speed, average yield rate and security incident response speed are improved. 3) Provide users with asset visualization, policy automation and governance integration, realizing user value reconstruction. 4) Through ecological collaboration and compliance innovation, the overall effectiveness of the platform has been enhanced. 5) In the future, smart contract insurance pools, forecast market integration and AI-driven asset allocation will be launched to continue to lead the development of the industry.

Currently ranked among the top ten virtual currency exchanges: 1. Binance, 2. OKX, 3. Gate.io, 4. Coin library, 5. Siren, 6. Huobi Global Station, 7. Bybit, 8. Kucoin, 9. Bitcoin, 10. bit stamp.

The top ten digital currency exchanges such as Binance, OKX, gate.io have improved their systems, efficient diversified transactions and strict security measures.

Using the chrono library in C can allow you to control time and time intervals more accurately. Let's explore the charm of this library. C's chrono library is part of the standard library, which provides a modern way to deal with time and time intervals. For programmers who have suffered from time.h and ctime, chrono is undoubtedly a boon. It not only improves the readability and maintainability of the code, but also provides higher accuracy and flexibility. Let's start with the basics. The chrono library mainly includes the following key components: std::chrono::system_clock: represents the system clock, used to obtain the current time. std::chron

Measuring thread performance in C can use the timing tools, performance analysis tools, and custom timers in the standard library. 1. Use the library to measure execution time. 2. Use gprof for performance analysis. The steps include adding the -pg option during compilation, running the program to generate a gmon.out file, and generating a performance report. 3. Use Valgrind's Callgrind module to perform more detailed analysis. The steps include running the program to generate the callgrind.out file and viewing the results using kcachegrind. 4. Custom timers can flexibly measure the execution time of a specific code segment. These methods help to fully understand thread performance and optimize code.

Bitcoin’s price ranges from $20,000 to $30,000. 1. Bitcoin’s price has fluctuated dramatically since 2009, reaching nearly $20,000 in 2017 and nearly $60,000 in 2021. 2. Prices are affected by factors such as market demand, supply, and macroeconomic environment. 3. Get real-time prices through exchanges, mobile apps and websites. 4. Bitcoin price is highly volatile, driven by market sentiment and external factors. 5. It has a certain relationship with traditional financial markets and is affected by global stock markets, the strength of the US dollar, etc. 6. The long-term trend is bullish, but risks need to be assessed with caution.
