Home Java javaTutorial Under what circumstances is Java much slower than C++?

Under what circumstances is Java much slower than C++?

Nov 30, 2016 am 09:58 AM
c++ java

Question: Under what circumstances is Java much slower than C++?

Answer: Ben Maurer:

In order to answer this question, we need to first divide the problem into several possible causes of slowness:

Garbage collector. This is a "double-edged sword." If your program follows the "most objects die in the young generation" model, the garbage collector is very beneficial (less fragmentation, better cache locality). However, if the program does not follow this model, the JVM will spend a lot of resources reclaiming heap memory.

 Large objects. In Java, all objects have a vtable pointer, while in C++ there is no additional overhead using the POD structure. In addition, all Java objects can be locked. Its implementation depends on the JVM, which may require adding additional fields to the object. Large objects == cache fewer objects == slower. (On the other hand, Java 7 records compressed pointers with 64 bits, which is part of the problem.

Lack of inline objects. In Java, all classes are pointers. In C++, objects can be Allocate other objects together, or on the stack. This can improve the locality of the cache, thereby reducing the overhead of dynamic memory allocation. In Java, JNI calls or compiling objects into local code will cause. Not a small overhead. If you need to call client C++ code frequently, it will add a lot of overhead. For example, if you want to write an XML parser in Java. , you only use String objects (without char[]), it will be slow because of the need to allocate additional space. Virtual function calls are increased. In JVM, almost all function calls are virtual function calls. Try to avoid virtual function calls, but in many cases, the JVM cannot solve this problem. This hinders the inlining of the code and makes the code slower. It lacks advanced compilation features and the ability to convert to assembly. Code that benefits from assembly may not perform well in Java

In my opinion, the biggest problem is garbage collection, which is the most common problem between Java and C++ when forcing multiple full GCs on large memory. One of the reasons for the gap between the two. In addition, if the working set of the program is placed outside the L2 cache, problems such as large objects and lack of inline objects will also lead to huge differences between the two. Inefficient forced abstractions and platform functions can also cause slowdowns, but this usually only occurs because of low-level code, which is usually not a big problem if you use a well-written Java code base. Todd Lipcon

I basically agree with Ben Maurer's (hey Ben!) answer with a few minor differences:

In the latest JVM, when this allocation is never done from (a) a local function or (b) a local. When a thread escapes, escape analysis can effectively determine a fixed allocation. That is, when the allocation does not require locking, it is usually performed on its own stack space. In both cases, it is a simple ". "Bump the pointer" allocation, which is equivalent to stack allocation in C.

Translator's Note:

Escape Analysis is a compilation optimization technology that refers to the method of analyzing the dynamic range of pointers. In layman's terms , when an object pointer is referenced by multiple methods or threads, we say that the pointer escapes.

Pointer collision (bump the point) Assume that the memory in the Java heap is absolutely regular, and all used memory is buffered. Put it on one side, the free memory is placed on the other side, and a pointer is placed in the middle as an indicator of the dividing point. The allocated memory is just to move the pointer to the free space by a distance equal to the size of the object. This This allocation method is called "pointer collision".

Even without escape analysis, the allocation of the young generation is done in the thread local allocation buffer (TLAB) through pointer collision, and no synchronization is required. Therefore, the allocation of small objects in Java is sometimes faster than the malloc() method implemented in C language. Better malloc methods like Google's tcmalloc take a similar approach. However, because the C language cannot reallocate allocated objects in memory, it is limited in some aspects.

Although there are problems with inlining and virtual functions, in fact, Java can even do better than C in some cases. In particular, C cannot implement inlining through dynamic linking because inlining is done at compile time, not run time. Java can dynamically inline a function across the boundaries of different classes or libraries, even if the actual implementation of the class is not available during compilation. In many jobs, this approach is more efficient than C++ virtual function calls, which always require calls to virtual tables. The JIT compiler, if previously dynamic attributes have been lost (such as a new class has been loaded), can intelligently cancel inline optimization.

The new version of GCC provides some optimizations in this area, called "whole-program optimization" or "link-time optimization", which allows inlining across object files within the project scope. However, it is basically not allowed to implement inlining through dynamic linking (such as calling zlib through inlining, etc.). Many large projects are implemented by copying the functionality of the standard library into their code.


Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Nordhold: Fusion System, Explained
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1669
14
PHP Tutorial
1273
29
C# Tutorial
1256
24
What is static analysis in C? What is static analysis in C? Apr 28, 2025 pm 09:09 PM

The application of static analysis in C mainly includes discovering memory management problems, checking code logic errors, and improving code security. 1) Static analysis can identify problems such as memory leaks, double releases, and uninitialized pointers. 2) It can detect unused variables, dead code and logical contradictions. 3) Static analysis tools such as Coverity can detect buffer overflow, integer overflow and unsafe API calls to improve code security.

How to use the chrono library in C? How to use the chrono library in C? Apr 28, 2025 pm 10:18 PM

Using the chrono library in C can allow you to control time and time intervals more accurately. Let's explore the charm of this library. C's chrono library is part of the standard library, which provides a modern way to deal with time and time intervals. For programmers who have suffered from time.h and ctime, chrono is undoubtedly a boon. It not only improves the readability and maintainability of the code, but also provides higher accuracy and flexibility. Let's start with the basics. The chrono library mainly includes the following key components: std::chrono::system_clock: represents the system clock, used to obtain the current time. std::chron

Composer: Aiding PHP Development Through AI Composer: Aiding PHP Development Through AI Apr 29, 2025 am 12:27 AM

AI can help optimize the use of Composer. Specific methods include: 1. Dependency management optimization: AI analyzes dependencies, recommends the best version combination, and reduces conflicts. 2. Automated code generation: AI generates composer.json files that conform to best practices. 3. Improve code quality: AI detects potential problems, provides optimization suggestions, and improves code quality. These methods are implemented through machine learning and natural language processing technologies to help developers improve efficiency and code quality.

How to understand DMA operations in C? How to understand DMA operations in C? Apr 28, 2025 pm 10:09 PM

DMA in C refers to DirectMemoryAccess, a direct memory access technology, allowing hardware devices to directly transmit data to memory without CPU intervention. 1) DMA operation is highly dependent on hardware devices and drivers, and the implementation method varies from system to system. 2) Direct access to memory may bring security risks, and the correctness and security of the code must be ensured. 3) DMA can improve performance, but improper use may lead to degradation of system performance. Through practice and learning, we can master the skills of using DMA and maximize its effectiveness in scenarios such as high-speed data transmission and real-time signal processing.

How to understand ABI compatibility in C? How to understand ABI compatibility in C? Apr 28, 2025 pm 10:12 PM

ABI compatibility in C refers to whether binary code generated by different compilers or versions can be compatible without recompilation. 1. Function calling conventions, 2. Name modification, 3. Virtual function table layout, 4. Structure and class layout are the main aspects involved.

How to handle high DPI display in C? How to handle high DPI display in C? Apr 28, 2025 pm 09:57 PM

Handling high DPI display in C can be achieved through the following steps: 1) Understand DPI and scaling, use the operating system API to obtain DPI information and adjust the graphics output; 2) Handle cross-platform compatibility, use cross-platform graphics libraries such as SDL or Qt; 3) Perform performance optimization, improve performance through cache, hardware acceleration, and dynamic adjustment of the details level; 4) Solve common problems, such as blurred text and interface elements are too small, and solve by correctly applying DPI scaling.

What is real-time operating system programming in C? What is real-time operating system programming in C? Apr 28, 2025 pm 10:15 PM

C performs well in real-time operating system (RTOS) programming, providing efficient execution efficiency and precise time management. 1) C Meet the needs of RTOS through direct operation of hardware resources and efficient memory management. 2) Using object-oriented features, C can design a flexible task scheduling system. 3) C supports efficient interrupt processing, but dynamic memory allocation and exception processing must be avoided to ensure real-time. 4) Template programming and inline functions help in performance optimization. 5) In practical applications, C can be used to implement an efficient logging system.

How to optimize code How to optimize code Apr 28, 2025 pm 10:27 PM

C code optimization can be achieved through the following strategies: 1. Manually manage memory for optimization use; 2. Write code that complies with compiler optimization rules; 3. Select appropriate algorithms and data structures; 4. Use inline functions to reduce call overhead; 5. Apply template metaprogramming to optimize at compile time; 6. Avoid unnecessary copying, use moving semantics and reference parameters; 7. Use const correctly to help compiler optimization; 8. Select appropriate data structures, such as std::vector.

See all articles