Home Database Mysql Tutorial MySQL的词法分析纵谈

MySQL的词法分析纵谈

Jun 07, 2016 pm 04:26 PM
mysql analyze

MySQL的词法分析漫谈 这个链接上有点介绍,可以了解个大概:http://blog.imaginea.com/mysql-query-parsing/ 关键点: 1. SQL解析包括语法分析器和词法分析器。 简便的做法是用bison/flex组合。不过MySQL的词法分析器是手工打造的。 语法分析器的入口函数是MY

MySQL的词法分析漫谈
这个链接上有点介绍,可以了解个大概:http://blog.imaginea.com/mysql-query-parsing/   

关键点:
1. SQL解析包括语法分析器和词法分析器。
   简便的做法是用bison/flex组合。不过MySQL的词法分析器是手工打造的。
   语法分析器的入口函数是MYSQLparse,词法分析器的入口函数是MYSQLlex。
2. 词法分析中会检查token是否为关键字。
    最直接的做法是弄个大的关键字数组,进行折半查找。MySQL在此做了些优化。
   本文主要介绍的是这一部分。

考虑到关键字是一个只读的列表,对它做一个只读的查找树可以改善查找的性能。
产生查找树:
1. 读取关键字数组,产生一个Trie树。
2. 调整这棵树,并产生一个数组(也就是一个不用链表表示的树)。

使用查找树:
这个比较简单,直接看函数get_hash_symbol好了。

产生查找树,相关的Makefile规则:     
In `sql/CMakeFiles/sql.dir/build.make':

sql/lex_hash.h: sql/gen_lex_hash
  $(CMAKE_COMMAND) -E cmake_progress_report /home/zedware/Workspace/mysql/CMakeFiles $(CMAKE_PROGRESS_153)
  @$(CMAKE_COMMAND) -E cmake_echo_color --switch=$(COLOR) --blue --bold "Generating lex_hash.h"
  cd /home/zedware/Workspace/mysql/sql && ./gen_lex_hash > lex_hash.h

容易发现,最主要的函数就是`get_hash_symbol',它主要的调用关系为:

/* sql/lex_hash.h */
get_hash_symbol->sql_functions_map
get_hash_symbol->symbols_map

/* sql/sql_lex.cc */
find_keyword->get_hash_symbol
is_keyword->get_hash_symbol
is_lex_native_function->get_hash_symbol

文件"gen_lex_hash.cc"注释中的树的示例:

+-----------+-+-+-+
|       len |1|2|3|
+-----------+-+-+-+
|first_char |0|0|a|
|last_char  |0|0|d|
|link       |0|0|+|
                 |
                 V
       +----------+-+-+-+--+
       |    1 char|a|b|c|d |
       +----------+-+-+-+--+
       |first_char|d|0|0|0 |
       |last_char |n|0|0|-1|
       |link      |+|0|0|+ |
                   |     |
                   |     V
                   |  symbols[2] ( "DAY" )
                   V
+----------+--+-+-+-+-+-+-+-+-+-+--+
|    2 char|d |e|f|j|h|i|j|k|l|m|n |
+----------+--+-+-+-+-+-+-+-+-+-+--+
|first_char|0 |0|0|0|0|0|0|0|0|0|0 |
|last_char |-1|0|0|0|0|0|0|0|0|0|-1|
|link      |+ |0|0|0|0|0|0|0|0|0|+ |
            |                    |
            V                    V
         symbols[0] ( "ADD" )  symbols[1] ( "AND" )

如果你还记得Trie树,理解起来会容易一点。下面是不同的输入数组对应的树。
i=0

+-----------+-+--+
|       len |1| 2|
+-----------+-+--+
|first_char |0|-1|
|last_char  |0| 0|
|char_tails |0| x|
|ithis      |0| 0|
|iresult    |0| 0|
                |
               &&

static SYMBOL symbols[] = {
  { "&&",   SYM(AND_AND_SYM)},

static uchar symbols_map[8]= {
0,   0,   1, 0,                    0,   0,   0, 0,                    };

i=1

+-----------+--+--+
|       len | 1| 2|
+-----------+--+--+
|first_char |-1|-1|
|last_char  | 0| 0|
|char_tails | x| x|
|ithis      | 0| 0|
|iresult    | 1| 0|
              |  |
             
static SYMBOL symbols[] = {
  { "&&",   SYM(AND_AND_SYM)},
  { "
static uchar symbols_map[8]= {
0,   0,   1, 0,                    0,   0,   0, 0,                    };
             
i=2

+-----------+--+--+
|       len | 1| 2|
+-----------+--+--+
|first_char |-1| &|
|last_char  | 0| |char_tails | x| ^|
|ithis      | 0| 0|
|iresult    | 1| x|
              |  |
                               |          
       +----------+--+--+   +--+
       |    1 char| &|  |...|        +----------+--+--+   +--+
       |first_char|-1| 0|   |-1|
       |last_char | 0| 0|   | 0|
       |char_tails| 0| 0|   | x|
       |ithis     | 0| 0|   | 0|
       |iresult   | 0| 0|   | 2|
                   |          |
                   &&       
static SYMBOL symbols[] = {
  { "&&",   SYM(AND_AND_SYM)},
  { "   { "
static uchar symbols_map[100]= {
0,   0,   1, 0,
'&', ' 0,   0,   0, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   3, 0,
0,   0,   2, 0,
};

i=3

+-----------+--+--+
|       len | 1| 2|
+-----------+--+--+
|first_char |-1| &|
|last_char  | 0| |char_tails | x| ^|
|ithis      | 0| 0|
|iresult    | 1| x|
              |  |
                               |          
       +----------+--+--+   +--+
       |    1 char| &|  |...|        +----------+--+--+   +--+
       |first_char|-1| 0|   |-1|
       |last_char | 0| 0|   | 0|
       |char_tails| 0| 0|   | x|
       |ithis     | 0| 0|   | 0|
       |iresult   | 0| 0|   | p|
                   |          |
                   &&         |
                              |
                   +----------+--+--+
                   |    2 char| =| >|
                   +----------+--+--+
                   |first_char|-1|-1|
                   |last_char | 0| 0|
                   |char_tails| x| x|
                   |ithis     | 0| 0|
                   |iresult   | 2| 3|
                                |  |
                              
                              
static SYMBOL symbols[] = {
  { "&&",   SYM(AND_AND_SYM)},
  { "   { "   { "",   SYM(NE)},

static uchar symbols_map[108]= {
0,   0,   1, 0,
'&', ' 0,   0,   0, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
0,   0,   4, 0,
'=', '>', 25, 0,
0,   0,   2, 0,
0,   0,   3, 0,
};
                              
可以看到,数组表示中存在一定的空间浪费。要是不怕麻烦,我们还可以去榨出一点油水来。
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Nordhold: Fusion System, Explained
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1666
14
PHP Tutorial
1273
29
C# Tutorial
1252
24
Laravel Introduction Example Laravel Introduction Example Apr 18, 2025 pm 12:45 PM

Laravel is a PHP framework for easy building of web applications. It provides a range of powerful features including: Installation: Install the Laravel CLI globally with Composer and create applications in the project directory. Routing: Define the relationship between the URL and the handler in routes/web.php. View: Create a view in resources/views to render the application's interface. Database Integration: Provides out-of-the-box integration with databases such as MySQL and uses migration to create and modify tables. Model and Controller: The model represents the database entity and the controller processes HTTP requests.

MySQL and phpMyAdmin: Core Features and Functions MySQL and phpMyAdmin: Core Features and Functions Apr 22, 2025 am 12:12 AM

MySQL and phpMyAdmin are powerful database management tools. 1) MySQL is used to create databases and tables, and to execute DML and SQL queries. 2) phpMyAdmin provides an intuitive interface for database management, table structure management, data operations and user permission management.

MySQL vs. Other Programming Languages: A Comparison MySQL vs. Other Programming Languages: A Comparison Apr 19, 2025 am 12:22 AM

Compared with other programming languages, MySQL is mainly used to store and manage data, while other languages ​​such as Python, Java, and C are used for logical processing and application development. MySQL is known for its high performance, scalability and cross-platform support, suitable for data management needs, while other languages ​​have advantages in their respective fields such as data analytics, enterprise applications, and system programming.

Solve database connection problem: a practical case of using minii/db library Solve database connection problem: a practical case of using minii/db library Apr 18, 2025 am 07:09 AM

I encountered a tricky problem when developing a small application: the need to quickly integrate a lightweight database operation library. After trying multiple libraries, I found that they either have too much functionality or are not very compatible. Eventually, I found minii/db, a simplified version based on Yii2 that solved my problem perfectly.

Laravel framework installation method Laravel framework installation method Apr 18, 2025 pm 12:54 PM

Article summary: This article provides detailed step-by-step instructions to guide readers on how to easily install the Laravel framework. Laravel is a powerful PHP framework that speeds up the development process of web applications. This tutorial covers the installation process from system requirements to configuring databases and setting up routing. By following these steps, readers can quickly and efficiently lay a solid foundation for their Laravel project.

Solve MySQL mode problem: The experience of using the TheliaMySQLModesChecker module Solve MySQL mode problem: The experience of using the TheliaMySQLModesChecker module Apr 18, 2025 am 08:42 AM

When developing an e-commerce website using Thelia, I encountered a tricky problem: MySQL mode is not set properly, causing some features to not function properly. After some exploration, I found a module called TheliaMySQLModesChecker, which is able to automatically fix the MySQL pattern required by Thelia, completely solving my troubles.

MySQL: Structured Data and Relational Databases MySQL: Structured Data and Relational Databases Apr 18, 2025 am 12:22 AM

MySQL efficiently manages structured data through table structure and SQL query, and implements inter-table relationships through foreign keys. 1. Define the data format and type when creating a table. 2. Use foreign keys to establish relationships between tables. 3. Improve performance through indexing and query optimization. 4. Regularly backup and monitor databases to ensure data security and performance optimization.

MySQL: Key Features and Capabilities Explained MySQL: Key Features and Capabilities Explained Apr 18, 2025 am 12:17 AM

MySQL is an open source relational database management system that is widely used in Web development. Its key features include: 1. Supports multiple storage engines, such as InnoDB and MyISAM, suitable for different scenarios; 2. Provides master-slave replication functions to facilitate load balancing and data backup; 3. Improve query efficiency through query optimization and index use.

See all articles