golang zip Chinese garbled code
With the continuous popularity and application of Golang in the field of web development, Zip compressed files have become an essential functional module. However, some developers encountered a common problem when using Golang's Zip package for file compression - Chinese file names appear garbled.
This is a very troublesome problem, because it not only makes the originally beautiful file name look nondescript, but may also cause a series of other errors. Below we will explore the causes and solutions to this problem.
Cause of the problem
Zip format is a binary format that contains the file name, file directory, compression method and compressed data. Among this information, the file name is a very critical part, because it determines the name and storage path of the file after the user decompresses it.
However, different file systems and encoding formats may result in different parsing results for the same file name. For example, on Windows systems, the default encoding used for file names is GBK, while on UNIX/Linux systems, the file names use UTF-8. If we do not handle these different encoding methods when using the Zip package in Golang for compression, the file name may be treated as garbled characters.
Solution
We have a variety of solutions for the problem of garbled Chinese file names. Below we will introduce some of the more feasible methods.
Method 1: GB18030 encoding conversion
Because the Zip format uses CP437 encoding to represent the file name, we can convert the file name from GB18030 encoding to CP437 encoding, so as to ensure that File names can be correctly parsed in any environment. In Go, you can use the golang.org/x/text/encoding/simplifiedchinese
package to convert between GB18030 encoding and CP437 encoding.
import ( "golang.org/x/text/encoding/simplifiedchinese" "golang.org/x/text/transform" ) func GbkToUtf8(data []byte) ([]byte, error) { return transform.NewReader(bytes.NewReader(data), simplifiedchinese.GB18030.NewDecoder()).ReadAll() } func Utf8ToGbk(data []byte) ([]byte, error) { return transform.NewReader(bytes.NewReader(data), simplifiedchinese.GB18030.NewEncoder()).ReadAll() }
Method 2: Use reflect
We can also use the StructTag
option in the reflect
package to force the use of the specified encoding method. Specifically, you can add a zip
tag to the structure and a chinese-utf8
tag to the Tag. The sample code is as follows:
type File struct { Name string `zip:"filename=测试文件,chinese-utf8"` } func main() { zhName := "测试文件" utf8Name, _ := GbkToUtf8([]byte(zhName)) f := &File{Name: string(utf8Name)} // 压缩文件... }
Method 3: Use the name attribute in fileheader and FileInfo
In the Golang Zip package, we can also call fileheader
and ##name
attribute of #FileInfo to manually specify the correct encoding for each file.
import "archive/zip" func zipFiles(filePaths []string, dest string) error { // 创建文件 newZipFile, err := os.Create(dest) if err != nil { return err } defer newZipFile.Close() // 创建 ZIP writer 对象 zipWriter := zip.NewWriter(newZipFile) defer zipWriter.Close() // 遍历 filePaths,为每个文件设置正确的编码方式 for _, filePath := range filePaths { zipFile, err := os.Open(filePath) if err != nil { return err } defer zipFile.Close() // 解析文件名,并转换编码 zipFileInfo, _ := zip.FileInfoHeader(zipFile.Stat()) zipFileInfo.Name, _ = GbkToUtf8([]byte(zipFileInfo.Name)) // 创建 Zip 文件写入器 zipWriterNewFile, err := zipWriter.CreateHeader(zipFileInfo) if err != nil { return err } // 读取文件并写入 Zip 文件中 _, err = io.Copy(zipWriterNewFile, zipFile) if err != nil { return err } } return nil }
The above is the detailed content of golang zip Chinese garbled code. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

OpenSSL, as an open source library widely used in secure communications, provides encryption algorithms, keys and certificate management functions. However, there are some known security vulnerabilities in its historical version, some of which are extremely harmful. This article will focus on common vulnerabilities and response measures for OpenSSL in Debian systems. DebianOpenSSL known vulnerabilities: OpenSSL has experienced several serious vulnerabilities, such as: Heart Bleeding Vulnerability (CVE-2014-0160): This vulnerability affects OpenSSL 1.0.1 to 1.0.1f and 1.0.2 to 1.0.2 beta versions. An attacker can use this vulnerability to unauthorized read sensitive information on the server, including encryption keys, etc.

The library used for floating-point number operation in Go language introduces how to ensure the accuracy is...

Queue threading problem in Go crawler Colly explores the problem of using the Colly crawler library in Go language, developers often encounter problems with threads and request queues. �...

Backend learning path: The exploration journey from front-end to back-end As a back-end beginner who transforms from front-end development, you already have the foundation of nodejs,...

The difference between string printing in Go language: The difference in the effect of using Println and string() functions is in Go...

This article introduces a variety of methods and tools to monitor PostgreSQL databases under the Debian system, helping you to fully grasp database performance monitoring. 1. Use PostgreSQL to build-in monitoring view PostgreSQL itself provides multiple views for monitoring database activities: pg_stat_activity: displays database activities in real time, including connections, queries, transactions and other information. pg_stat_replication: Monitors replication status, especially suitable for stream replication clusters. pg_stat_database: Provides database statistics, such as database size, transaction commit/rollback times and other key indicators. 2. Use log analysis tool pgBadg

Under the BeegoORM framework, how to specify the database associated with the model? Many Beego projects require multiple databases to be operated simultaneously. When using Beego...

The problem of using RedisStream to implement message queues in Go language is using Go language and Redis...
