


SMPLer-X: Subverting the seven major lists, presenting the first human motion capture model!
At present, although great research progress has been made in human whole body pose and shape estimation (EHPS, Expressive Human Pose and Shape estimation), the most advanced methods are still limited by the limitations of the training data set.
Recently, researchers from Nanyang Technological University's S-Lab, SenseTime, Shanghai Artificial Intelligence Laboratory, University of Tokyo and IDEA Research Institute proposed for the first time the estimation of human body posture and body shape. SMPLer-X, a large motion capture model of the mission. The study used up to 4.5 million instances from different data sources to train the model, achieving the best performance on 7 key lists
SMPLer-X can not only Capture body movements, and also output facial and hand movements, and estimate body shape
Paper link: https://arxiv.org/ abs/2309.17448
Project homepage: https://caizhongang.github.io/projects/SMPLer-X/
With rich data and huge models, SMPLer-X shows strong performance in various tests and rankings, and has excellent versatility even in unknown environments
In terms of data expansion, The researchers conducted a comprehensive evaluation and analysis of 32 3D human body data sets to provide a reference for model training
2. In terms of model scaling, use visual large models to study increasing model parameters The improvement effect of quantity on performance
3. Through fine-tuning strategies, SMPLer-X general large model can be transformed into a dedicated large model, allowing it to achieve further performance improvement.
In summary, SMPLer-X has explored data scaling and model scaling (see Figure 1), and has performed on 32 academic data Ranked on the set and trained on its 4.5 million instances simultaneously, it achieved the best performance on 7 key lists including AGORA, UBody, EgoBody and EHF
Figure 1 Increasing the amount of data and model parameters is effective in reducing the mean principal error (MPE) of the key lists (AGORA, UBody, EgoBody, 3DPW and EHF)
Conducting a generalization study on existing 3D human body datasets
Researchers conducted a generalization study on 32 academic datasets Ranking was performed: To measure the performance of each dataset, a model was trained using that dataset and the model was evaluated on five evaluation datasets: AGORA, UBody, EgoBody, 3DPW, and EHF.
Mean Primary Error (MPE) is also calculated in the table to facilitate simple comparisons between various data sets.
Inspiration from studying the generalization of data sets
Passed From the analysis of a large number of data sets (see Figure 3), the following four conclusions can be drawn:
1. Regarding the data volume of a single data set, a data set of the order of 100,000 instances It can be used for model training to achieve higher cost performance;
2. Regarding the collection scenario of the data set, the In-the-wild data set has the best effect. If data can only be collected indoors, in order to improve the training effect, you need to avoid using data from a single scene
Regarding the collection of data sets, two of the top three data sets are generated data set. In recent years, generated data sets have demonstrated strong performance
Regarding the annotation of data sets, pseudo-labels also play a very important role in training
Training and fine-tuning of large motion capture models
Today's state-of-the-art methods usually only use a few data sets (for example, MSCOCO, MPII and Human3.6M) for training, and this paper studies the use of More datasets
Considering that higher-ranked data sets are preferred, we used four different data sizes: 5, 10, 20 and 32 data sets as training sets, with a total size of 750,000, 1.5 million, 3 million and 4.5 million instances respectively
In addition, the researchers also demonstrated low-cost fine-tuning strategies to adapt general large models to specific Scenes.
#The above table shows some of the main tests, such as AGORA Test set (Table 3), AGORA validation set (Table 4), EHF (Table 5), UBody (Table 6), EgoBody-EgoSet (Table 7).
In addition, the researchers also evaluated the generalization of the large motion capture model on two test sets, ARCTIC and DNA-Rendering
Researchers hope that SMPLer-X can bring inspiration beyond algorithm design and provide the academic community with a powerful full-body human motion capture large model.
The code and pre-trained model have been open sourced on the project homepage. Welcome to visit https://caizhongang.github.io/projects/SMPLer-X/ for more details
Result display##
The above is the detailed content of SMPLer-X: Subverting the seven major lists, presenting the first human motion capture model!. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

WorldCoin (WLD) stands out in the cryptocurrency market with its unique biometric verification and privacy protection mechanisms, attracting the attention of many investors. WLD has performed outstandingly among altcoins with its innovative technologies, especially in combination with OpenAI artificial intelligence technology. But how will the digital assets behave in the next few years? Let's predict the future price of WLD together. The 2025 WLD price forecast is expected to achieve significant growth in WLD in 2025. Market analysis shows that the average WLD price may reach $1.31, with a maximum of $1.36. However, in a bear market, the price may fall to around $0.55. This growth expectation is mainly due to WorldCoin2.

Exchanges that support cross-chain transactions: 1. Binance, 2. Uniswap, 3. SushiSwap, 4. Curve Finance, 5. Thorchain, 6. 1inch Exchange, 7. DLN Trade, these platforms support multi-chain asset transactions through various technologies.

The plunge in the cryptocurrency market has caused panic among investors, and Dogecoin (Doge) has become one of the hardest hit areas. Its price fell sharply, and the total value lock-in of decentralized finance (DeFi) (TVL) also saw a significant decline. The selling wave of "Black Monday" swept the cryptocurrency market, and Dogecoin was the first to be hit. Its DeFiTVL fell to 2023 levels, and the currency price fell 23.78% in the past month. Dogecoin's DeFiTVL fell to a low of $2.72 million, mainly due to a 26.37% decline in the SOSO value index. Other major DeFi platforms, such as the boring Dao and Thorchain, TVL also dropped by 24.04% and 20, respectively.

In the bustling world of cryptocurrencies, new opportunities always emerge. At present, KernelDAO (KERNEL) airdrop activity is attracting much attention and attracting the attention of many investors. So, what is the origin of this project? What benefits can BNB Holder get from it? Don't worry, the following will reveal it one by one for you.

Aavenomics is a proposal to modify the AAVE protocol token and introduce token repos, which has implemented a quorum for AAVEDAO. Marc Zeller, founder of the AAVE Project Chain (ACI), announced this on X, noting that it marks a new era for the agreement. Marc Zeller, founder of the AAVE Chain Initiative (ACI), announced on X that the Aavenomics proposal includes modifying the AAVE protocol token and introducing token repos, has achieved a quorum for AAVEDAO. According to Zeller, this marks a new era for the agreement. AaveDao members voted overwhelmingly to support the proposal, which was 100 per week on Wednesday

Suggestions for choosing a cryptocurrency exchange: 1. For liquidity requirements, priority is Binance, Gate.io or OKX, because of its order depth and strong volatility resistance. 2. Compliance and security, Coinbase, Kraken and Gemini have strict regulatory endorsement. 3. Innovative functions, KuCoin's soft staking and Bybit's derivative design are suitable for advanced users.

Factors of rising virtual currency prices include: 1. Increased market demand, 2. Decreased supply, 3. Stimulated positive news, 4. Optimistic market sentiment, 5. Macroeconomic environment; Decline factors include: 1. Decreased market demand, 2. Increased supply, 3. Strike of negative news, 4. Pessimistic market sentiment, 5. Macroeconomic environment.

The platforms that have outstanding performance in leveraged trading, security and user experience in 2025 are: 1. OKX, suitable for high-frequency traders, providing up to 100 times leverage; 2. Binance, suitable for multi-currency traders around the world, providing 125 times high leverage; 3. Gate.io, suitable for professional derivatives players, providing 100 times leverage; 4. Bitget, suitable for novices and social traders, providing up to 100 times leverage; 5. Kraken, suitable for steady investors, providing 5 times leverage; 6. Bybit, suitable for altcoin explorers, providing 20 times leverage; 7. KuCoin, suitable for low-cost traders, providing 10 times leverage; 8. Bitfinex, suitable for senior play
