Table of Contents
Cause of the problem
Solution
Method 1: GB18030 encoding conversion
Method 2: Use reflect
Method 3: Use the name attribute in fileheader and FileInfo
Home Backend Development Golang golang zip Chinese garbled code

golang zip Chinese garbled code

May 14, 2023 pm 10:35 PM

With the continuous popularity and application of Golang in the field of web development, Zip compressed files have become an essential functional module. However, some developers encountered a common problem when using Golang's Zip package for file compression - Chinese file names appear garbled.

This is a very troublesome problem, because it not only makes the originally beautiful file name look nondescript, but may also cause a series of other errors. Below we will explore the causes and solutions to this problem.

Cause of the problem

Zip format is a binary format that contains the file name, file directory, compression method and compressed data. Among this information, the file name is a very critical part, because it determines the name and storage path of the file after the user decompresses it.

However, different file systems and encoding formats may result in different parsing results for the same file name. For example, on Windows systems, the default encoding used for file names is GBK, while on UNIX/Linux systems, the file names use UTF-8. If we do not handle these different encoding methods when using the Zip package in Golang for compression, the file name may be treated as garbled characters.

Solution

We have a variety of solutions for the problem of garbled Chinese file names. Below we will introduce some of the more feasible methods.

Method 1: GB18030 encoding conversion

Because the Zip format uses CP437 encoding to represent the file name, we can convert the file name from GB18030 encoding to CP437 encoding, so as to ensure that File names can be correctly parsed in any environment. In Go, you can use the golang.org/x/text/encoding/simplifiedchinese package to convert between GB18030 encoding and CP437 encoding.

import (
    "golang.org/x/text/encoding/simplifiedchinese"
    "golang.org/x/text/transform"
)

func GbkToUtf8(data []byte) ([]byte, error) {
    return transform.NewReader(bytes.NewReader(data), simplifiedchinese.GB18030.NewDecoder()).ReadAll()
}

func Utf8ToGbk(data []byte) ([]byte, error) {
    return transform.NewReader(bytes.NewReader(data), simplifiedchinese.GB18030.NewEncoder()).ReadAll()
}
Copy after login

Method 2: Use reflect

We can also use the StructTag option in the reflect package to force the use of the specified encoding method. Specifically, you can add a zip tag to the structure and a chinese-utf8 tag to the Tag. The sample code is as follows:

type File struct {
    Name string `zip:"filename=测试文件,chinese-utf8"`
}

func main() {
    zhName := "测试文件"
    utf8Name, _ := GbkToUtf8([]byte(zhName))
    f := &File{Name: string(utf8Name)}
    // 压缩文件...
}
Copy after login

Method 3: Use the name attribute in fileheader and FileInfo

In the Golang Zip package, we can also call fileheader and ##name attribute of #FileInfo to manually specify the correct encoding for each file.

import "archive/zip"

func zipFiles(filePaths []string, dest string) error {
    // 创建文件
    newZipFile, err := os.Create(dest)
    if err != nil {
        return err
    }
    defer newZipFile.Close()

    // 创建 ZIP writer 对象
    zipWriter := zip.NewWriter(newZipFile)
    defer zipWriter.Close()

    // 遍历 filePaths,为每个文件设置正确的编码方式
    for _, filePath := range filePaths {
        zipFile, err := os.Open(filePath)
        if err != nil {
            return err
        }
        defer zipFile.Close()

        // 解析文件名,并转换编码
        zipFileInfo, _ := zip.FileInfoHeader(zipFile.Stat())
        zipFileInfo.Name, _ = GbkToUtf8([]byte(zipFileInfo.Name))

        // 创建 Zip 文件写入器
        zipWriterNewFile, err := zipWriter.CreateHeader(zipFileInfo)
        if err != nil {
            return err
        }

        // 读取文件并写入 Zip 文件中
        _, err = io.Copy(zipWriterNewFile, zipFile)
        if err != nil {
            return err
        }
    }

    return nil
}
Copy after login
Conclusion

Golang’s Zip package provides a variety of solutions to the problem of garbled Chinese file names. If the encoding format of the file name is not uniform, it is easy for the file name to be garbled. We can easily avoid this problem by simply choosing the right solution according to our needs.

The above is the detailed content of golang zip Chinese garbled code. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

What are the vulnerabilities of Debian OpenSSL What are the vulnerabilities of Debian OpenSSL Apr 02, 2025 am 07:30 AM

OpenSSL, as an open source library widely used in secure communications, provides encryption algorithms, keys and certificate management functions. However, there are some known security vulnerabilities in its historical version, some of which are extremely harmful. This article will focus on common vulnerabilities and response measures for OpenSSL in Debian systems. DebianOpenSSL known vulnerabilities: OpenSSL has experienced several serious vulnerabilities, such as: Heart Bleeding Vulnerability (CVE-2014-0160): This vulnerability affects OpenSSL 1.0.1 to 1.0.1f and 1.0.2 to 1.0.2 beta versions. An attacker can use this vulnerability to unauthorized read sensitive information on the server, including encryption keys, etc.

What libraries are used for floating point number operations in Go? What libraries are used for floating point number operations in Go? Apr 02, 2025 pm 02:06 PM

The library used for floating-point number operation in Go language introduces how to ensure the accuracy is...

What is the problem with Queue thread in Go's crawler Colly? What is the problem with Queue thread in Go's crawler Colly? Apr 02, 2025 pm 02:09 PM

Queue threading problem in Go crawler Colly explores the problem of using the Colly crawler library in Go language, developers often encounter problems with threads and request queues. �...

Transforming from front-end to back-end development, is it more promising to learn Java or Golang? Transforming from front-end to back-end development, is it more promising to learn Java or Golang? Apr 02, 2025 am 09:12 AM

Backend learning path: The exploration journey from front-end to back-end As a back-end beginner who transforms from front-end development, you already have the foundation of nodejs,...

In Go, why does printing strings with Println and string() functions have different effects? In Go, why does printing strings with Println and string() functions have different effects? Apr 02, 2025 pm 02:03 PM

The difference between string printing in Go language: The difference in the effect of using Println and string() functions is in Go...

PostgreSQL monitoring method under Debian PostgreSQL monitoring method under Debian Apr 02, 2025 am 07:27 AM

This article introduces a variety of methods and tools to monitor PostgreSQL databases under the Debian system, helping you to fully grasp database performance monitoring. 1. Use PostgreSQL to build-in monitoring view PostgreSQL itself provides multiple views for monitoring database activities: pg_stat_activity: displays database activities in real time, including connections, queries, transactions and other information. pg_stat_replication: Monitors replication status, especially suitable for stream replication clusters. pg_stat_database: Provides database statistics, such as database size, transaction commit/rollback times and other key indicators. 2. Use log analysis tool pgBadg

How to specify the database associated with the model in Beego ORM? How to specify the database associated with the model in Beego ORM? Apr 02, 2025 pm 03:54 PM

Under the BeegoORM framework, how to specify the database associated with the model? Many Beego projects require multiple databases to be operated simultaneously. When using Beego...

How to solve the user_id type conversion problem when using Redis Stream to implement message queues in Go language? How to solve the user_id type conversion problem when using Redis Stream to implement message queues in Go language? Apr 02, 2025 pm 04:54 PM

The problem of using RedisStream to implement message queues in Go language is using Go language and Redis...

See all articles