Home Backend Development PHP Tutorial Million-level PHP website architecture toolbox_PHP tutorial

Million-level PHP website architecture toolbox_PHP tutorial

Jul 21, 2016 pm 02:51 PM
facebook php world learn Backstage exist toolbox most Architecture million of site website

After learning about the world’s largest PHP site, Facebook’s backend technology, today we will learn about the website architecture of a million-level PHP site: Poppen.de. Poppen.de is a social networking site in Germany. It is a small website compared to Facebook and Flickr, but it has a good architecture and integrates many technologies, such as Nigix, MySql, CouchDB, Erlang, Memcached, RabbitMQ, PHP, Graphite, Red5 and Tsung.

Poppen.de currently has 2 million registered users, 20,000 concurrent users, 200,000 private messages per day, and 250,000 logins per day. The project team has 11 developers, two designers, and two system administrators. The site's business model uses a freemium model, and users can use functions such as searching for users, sending messages to friends, and uploading pictures and videos.

If users want to enjoy unlimited sending messages and uploading pictures, they have to pay for different types of membership services according to their needs. The same strategy is used for video chat and other services on the website.

Nginx

All services of Poppen.de are based on Nginx service. The front-end has two Nginx servers serving a load of 150,000 requests per minute at peak times. Each machine is four years old and has only one CPU and 3GB of RAM. Poppen.de has three independent image servers, and three Nginx servers provide *.bilder.poppen.de with 80,000 request services per minute.

A cool design in the Nginx architecture is that many requests are handled by Memcached, so requests get content from the cache without directly accessing the PHP machine. For example, user profile (user profile) is content that requires intensive processing on the website. If all user profile pages are cached on Memcached, then the request will directly obtain the content from Memcached. Poppen.de's Memcached can handle 8,000 requests per minute.

There are three Nginx image servers in the architecture to provide local image caching, and users upload images to a central file server. When an image is requested from one of the three Nginx, if the image does not exist locally on the server, it will be downloaded from the central file server to the server for caching and service. This load-balanced distributed image server architecture design can reduce the load on primary storage devices.

PHP-FPM

This website runs on PHP-FPM. There are a total of 28 PHP machines with dual CPUs and 6GB of memory, each running 100 PHP-FPM worker threads. Using PHP5.3.x with APC enabled. PHP5.3 can reduce CPU and memory usage by more than 30%.

 The program code is developed based on the Symfony1.2 framework. One is that external resources can be used, and the other is that it can improve project development progress, and at the same time, it can make it easier for new developers to join the team on a well-known framework. Although nothing is perfect, you can get a lot of benefits from the Symfony framework, allowing the team to focus more on Poppen.de's business development.

Website performance optimization uses XHProf, which is a class library open sourced by Facebook. This framework is very easy to personalize and configure, and can cache most expensive server calculations.

 MySQL

MySQL is the main RDBMS for the website. The website has several MySql servers: a 4CPU, 32GB server stores user-related information, such as basic information, photo description information, etc. This machine has been used for 4 years, and the next step is to replace it with a shared cluster. The design is still based on this system to simplify the data access code. Data partitioning is based on user ID, because most of the information in the website is user-centered, such as photos, videos, messages, etc.

There are three servers providing user forum services based on a master-slave-slave configuration architecture. A slave server is responsible for storing custom messages on the website, and there are currently 250 million messages. The other four machines are in a master-slave configuration. In addition, four machines are configured into an NDB cluster to specifically serve intensive write operation data, such as user access statistics.

The data table design should try to avoid association operations and cache as much data as possible. Of course, the structural specifications of the database have been completely destroyed. Therefore, to make searching easier, database design creates data mining tables. Most of the tables are MyISAM-type tables, which can provide fast search. The problem now is that more and more tables have been fully locked. Poppen.de is considering migrating to the XtraDB storage engine.

Memcached

There are quite a lot of Memcached applications in the website architecture, with more than 45GB of cache and 51 nodes. Session, view cache, function execution cache, etc. are cached. There is a system in the architecture that automatically updates the data to the cache when records are modified. Possible solutions to improve cache updates in the future are to use the new Redis Hash API or MongoDB.

RabbitMQ

Started using RabbitMQ in the architecture in mid-2009. This is a good messaging solution that is easy to deploy and centralize into this architecture, running two RabbitMQ servers behind LVS. In the last month, more things have been integrated into the queue, meaning that at one time there were 28 PHP servers handling 500,000 requests per day. Send logs, email notifications, system messages, image uploads, and more to this queue.

Use the fastcgi_finish_request() function in PHP-FPM to integrate queue messages and send messages to the queue asynchronously. This function is called when the system needs to send an HTML or JSON format response to the user, so that the user does not have to wait for the PHP script to clean up.

This system can improve architectural resource management. For example, during peak periods the service can handle 1,000 login requests per minute. This means that there are 1000 concurrent updates to the user table to save the user's login time. Thanks to the queuing mechanism, these queries can be run in reverse order. If you need to increase the processing speed, you only need to add more queue processors, and you can even add more servers to the cluster without modifying any configuration or deploying new nodes.

CouchDB

Log storage CouchDB runs on a machine. Log query/grouping can be done on this machine based on module/behavior, or based on error type, etc. This is very useful for locating problems. Before using the log aggregation service CouchDB, I had to log in to the PHP servers one by one to try to analyze the logs and locate the problem, which was very troublesome. Now all the logs are concentrated in the queue and saved in CouchDB, so that problem inspection and analysis can be carried out centrally.

Graphite

The website uses Graphite to collect real-time website information and statistics. From requesting every module/behavior to Memcached hits and misses, RabbitMQ status monitoring, Unix loads and more. The Graphite service has an average of 4,800 update operations per minute. Practice has proven to be very useful for monitoring what is happening on the website, and its simple text protocol and drawing functions can be easily used in a plug-and-play manner on any system that needs to be monitored.

One cool thing is using Graphite to monitor two versions of the website at the same time. A new version of the Symfony framework was deployed in January, with the previous code deployed as a backup. This means the website may face performance issues. Therefore, Graphite can be used to compare the two versions online.

Found that the Unix load table on the new version was higher, so I used XHProf to perform performance analysis on the two versions to find out the problem.

Red5

The website also provides two types of video services for users, one is videos uploaded by users themselves, and the other is video chat, where users interact and share videos. By mid-2009, it will provide users with 17TB of traffic services every month.

 Tsung

Tsung is a distributed benchmark analysis tool written in Erlang. On the Poppen.de website, it is mainly used for HTTP benchmark analysis and comparative analysis of MySQL and other storage systems (XtraDB). A system was used to record the traffic of the main MySQL server and then converted it into Tsung's baseline session. The traffic is then replayed, and Tsung generates thousands of concurrent users accessing the laboratory's servers. This allows the experimental environment to be very close to the real scene.

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/371668.htmlTechArticleAfter learning about the world’s largest PHP site, Facebook’s backend technology, today we will learn about a million-level PHP Website architecture of the site: Poppen.de. Poppen.de is a social networking site in Germany...
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1663
14
PHP Tutorial
1263
29
C# Tutorial
1236
24
PHP and Python: Comparing Two Popular Programming Languages PHP and Python: Comparing Two Popular Programming Languages Apr 14, 2025 am 12:13 AM

PHP and Python each have their own advantages, and choose according to project requirements. 1.PHP is suitable for web development, especially for rapid development and maintenance of websites. 2. Python is suitable for data science, machine learning and artificial intelligence, with concise syntax and suitable for beginners.

PHP in Action: Real-World Examples and Applications PHP in Action: Real-World Examples and Applications Apr 14, 2025 am 12:19 AM

PHP is widely used in e-commerce, content management systems and API development. 1) E-commerce: used for shopping cart function and payment processing. 2) Content management system: used for dynamic content generation and user management. 3) API development: used for RESTful API development and API security. Through performance optimization and best practices, the efficiency and maintainability of PHP applications are improved.

PHP: A Key Language for Web Development PHP: A Key Language for Web Development Apr 13, 2025 am 12:08 AM

PHP is a scripting language widely used on the server side, especially suitable for web development. 1.PHP can embed HTML, process HTTP requests and responses, and supports a variety of databases. 2.PHP is used to generate dynamic web content, process form data, access databases, etc., with strong community support and open source resources. 3. PHP is an interpreted language, and the execution process includes lexical analysis, grammatical analysis, compilation and execution. 4.PHP can be combined with MySQL for advanced applications such as user registration systems. 5. When debugging PHP, you can use functions such as error_reporting() and var_dump(). 6. Optimize PHP code to use caching mechanisms, optimize database queries and use built-in functions. 7

PHP vs. Python: Understanding the Differences PHP vs. Python: Understanding the Differences Apr 11, 2025 am 12:15 AM

PHP and Python each have their own advantages, and the choice should be based on project requirements. 1.PHP is suitable for web development, with simple syntax and high execution efficiency. 2. Python is suitable for data science and machine learning, with concise syntax and rich libraries.

The Enduring Relevance of PHP: Is It Still Alive? The Enduring Relevance of PHP: Is It Still Alive? Apr 14, 2025 am 12:12 AM

PHP is still dynamic and still occupies an important position in the field of modern programming. 1) PHP's simplicity and powerful community support make it widely used in web development; 2) Its flexibility and stability make it outstanding in handling web forms, database operations and file processing; 3) PHP is constantly evolving and optimizing, suitable for beginners and experienced developers.

PHP vs. Other Languages: A Comparison PHP vs. Other Languages: A Comparison Apr 13, 2025 am 12:19 AM

PHP is suitable for web development, especially in rapid development and processing dynamic content, but is not good at data science and enterprise-level applications. Compared with Python, PHP has more advantages in web development, but is not as good as Python in the field of data science; compared with Java, PHP performs worse in enterprise-level applications, but is more flexible in web development; compared with JavaScript, PHP is more concise in back-end development, but is not as good as JavaScript in front-end development.

PHP and Python: Code Examples and Comparison PHP and Python: Code Examples and Comparison Apr 15, 2025 am 12:07 AM

PHP and Python have their own advantages and disadvantages, and the choice depends on project needs and personal preferences. 1.PHP is suitable for rapid development and maintenance of large-scale web applications. 2. Python dominates the field of data science and machine learning.

What is Cross-Site Request Forgery (CSRF) and how do you implement CSRF protection in PHP? What is Cross-Site Request Forgery (CSRF) and how do you implement CSRF protection in PHP? Apr 07, 2025 am 12:02 AM

In PHP, you can effectively prevent CSRF attacks by using unpredictable tokens. Specific methods include: 1. Generate and embed CSRF tokens in the form; 2. Verify the validity of the token when processing the request.

See all articles