regular expression
1. Regular expression
1. Type of matching characters
[a-z]: lowercase letters
[A-Z]: Uppercase letters
[a-Z]: Small or uppercase letters
[0-9]: Numbers
[a-zA-Z0-9]: Matches a character that is a letter or number
. : Matches any character, except spaces
[0-f]: Hexadecimal number
abc | def: abc or def
a (bc | de) f: abcf or adef
\<: The first word is usually separated by spaces or special characters, and the continuous string is regarded as the word
\>: Word ending
[^expression]: All characters except lowercase letters, and so on.
2, followed by the following symbols to control the number of matches
The left side of such symbols must have the expression of the first point above
Expression*: 0 or n characters
Expression+: 1 or n characters
Expression?: 0 or 1 characters
Expression {n}: n characters
Expression {n:m}: n to m Characters
Expression {n,}: at least n characters
[Example] [a-z]* means matching 0 Or multiple lowercase letters
3. Control the matching characters at the beginning and end
^ Expression: The head matches
Expression$: The tail matches
2. Three major Linux text processing tools
1. egrep filtering tool
Extended version of grep, you can use regular expressions
Syntax:
egrep - option 'regular expression' file name
Options:
-n: Display line number
- ##-o: Display only matching content
- -q: Silent mode, no output, you have to use $? to judge whether the execution is successful, that is, whether the desired content is filtered
- - l: If the match is successful, only the file name will be printed. If it fails, it will not be printed. Usually -rl is used together, grep -rl 'root' /etc
- -A: If it matches If successful, the matching line and the following n lines will be printed together
- -B: If the match is successful, the matching line and the first n lines will be printed out
- -C: If the match is successful, print out the matching line and n lines before and after it
- --color
- -c: If the match is successful, print out the number of matched lines
- -i: Ignore case
- - v: Negate, do not match
- -w: Match the word
Grammar:
Syntax 1: sed - option 'numeric positioning + command' file nameOption:
- -n: Silent mode, no output
- -e: Multiple edits, this is not very clear
- -i: Direct modification File content instead of output
- -r: Extended mode, you can use regular expressions
- -f: Specify the file name, the action Write in a new file
Positioning:
① Numeric positioning (input line number positioning)
- 1: Single line
- 1,3: Range from the first line to the third line
- 2 ,+4: Several lines after the matching line
- 4,~3: From the fourth line to the next multiple of 3
- 2 ~3: Every three lines starting from the second line
- $: The last line
- 1!: Lines other than the first line
【Example】sed -n '1p' /etc/passwd
②Regular expression positioning
- Regular expressions must be wrapped with //
- Expanding regular expressions requires the -r parameter or escaping
- Replace sub-patterns that can use regular expressions, that is, parentheses (), \1 and \2 can represent sub-patterns
[Example] sed -r 's/ (.)(.)/\2\1/ file1 means to replace the first and second parts of the match
*Greedy option: fill in g, which means to replace all the matching parts in one line Matching item replacement
Command:
- a: Append,
- c ∶ Change change,
- d ∶ Delete delete,
- #i ∶ Insert, i can be followed by strings, and these strings Will appear on a new line (the current previous line)
- p: print print
- s: replace substitute, you can replace it directly work. Usually this s action can be paired with a regular expression. For example, 1,20s/old/new/g
*s command special instructions:
Use {Command 1: Command 2: Command 3} Multiple commands can be addedsCommand syntax: sed -r 'Replacement command s/regular expression/replacement content/greedy option g' File name3, awk text analysis toolComposed of commands, regular expressions (need to be surrounded by //), comparisons and relational operationsUse the -F parameter in option to define the interval symbolUse the order of $1, $2, $3, etc. to represent the different fields in each column separated by spacers in each row of files. The NF variable represents the number of fields in the current record.
Syntax
awk - Option parameters 'Logical judgment {command variable 1, variable 2, variable 3}' File name
Option
-F Define field separator , the default delimiter is consecutive spaces or tabs
-v. Define variables and assign values. You can also use the borrowed method to introduce
AWK variable
NR The number of current records (statistics after all files are connected)
FNR The number of current records (only statistics for the current file, not all)
FS field separator defaults to consecutive spaces or tabs, and multiple different symbols can be used for separation. Symbol -F[:/]
OFS The default separator for output characters is a space
[OFS example]
# awk -F: 'OFS="=====" {print $1,$2}' /etc/passwd
root===== x
NF The number of fields in the currently read row
ORS The output record separator defaults to newline
【ORS example】
# awk -F: 'ORS="=====" {print $1,$2}' /etc/ passwd
root x=====bin x=====
FILENAME Current file name
[Example 1] Using AWK variables
# awk '{print NR,FNR,$1}' file1 file2
1 1 aaaaa
2 2 bbbbb
3 3 ccccc
4 1 dddddd
5 2 eeeeee
6 3 ffffff
#[Example 2]How to quote shell variables
# a=root
# awk -v var=$a -F: '$1 == var {print $0}' /etc/passwd
Or split the entire command and pass it to expose the shell variables,
# awk -F: '$1 == "'$a'" {print $0}' /etc/passwd
# a=NF
# awk -F: '{print $'$a'}' /etc/passwd
Logical operations (can directly reference fields for operations)
= += -= / = *=: Assignment
&& || !: Logical and logical or logical non-
~ !~: Match regular or not match, Regular expressions need to be surrounded by /regular/
- ##< <= > >= != ==: relationship, when comparing strings, the strings must be enclosed in double quotes
- $: Field references need to be added with $, while variable references are directly taken from variable names
- + - * / % ++ --: Operation Symbol
Escape sequence
- \\ \self
- \$ escape$
- \t tab character
- \b backspace character
- \r Carriage return character
- \n Line feed character
- \c Cancel line feed
Please correct me if there are any errors. For more details, please refer to:
The above is the detailed content of regular expression. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

VS Code system requirements: Operating system: Windows 10 and above, macOS 10.12 and above, Linux distribution processor: minimum 1.6 GHz, recommended 2.0 GHz and above memory: minimum 512 MB, recommended 4 GB and above storage space: minimum 250 MB, recommended 1 GB and above other requirements: stable network connection, Xorg/Wayland (Linux)

The five basic components of the Linux system are: 1. Kernel, 2. System library, 3. System utilities, 4. Graphical user interface, 5. Applications. The kernel manages hardware resources, the system library provides precompiled functions, system utilities are used for system management, the GUI provides visual interaction, and applications use these components to implement functions.

Although Notepad cannot run Java code directly, it can be achieved by using other tools: using the command line compiler (javac) to generate a bytecode file (filename.class). Use the Java interpreter (java) to interpret bytecode, execute the code, and output the result.

vscode built-in terminal is a development tool that allows running commands and scripts within the editor to simplify the development process. How to use vscode terminal: Open the terminal with the shortcut key (Ctrl/Cmd). Enter a command or run the script. Use hotkeys (such as Ctrl L to clear the terminal). Change the working directory (such as the cd command). Advanced features include debug mode, automatic code snippet completion, and interactive command history.

To view the Git repository address, perform the following steps: 1. Open the command line and navigate to the repository directory; 2. Run the "git remote -v" command; 3. View the repository name in the output and its corresponding address.

The reasons for the installation of VS Code extensions may be: network instability, insufficient permissions, system compatibility issues, VS Code version is too old, antivirus software or firewall interference. By checking network connections, permissions, log files, updating VS Code, disabling security software, and restarting VS Code or computers, you can gradually troubleshoot and resolve issues.

Writing code in Visual Studio Code (VSCode) is simple and easy to use. Just install VSCode, create a project, select a language, create a file, write code, save and run it. The advantages of VSCode include cross-platform, free and open source, powerful features, rich extensions, and lightweight and fast.

VS Code is available on Mac. It has powerful extensions, Git integration, terminal and debugger, and also offers a wealth of setup options. However, for particularly large projects or highly professional development, VS Code may have performance or functional limitations.
