Home Technology peripherals AI Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

Sep 18, 2023 pm 07:37 PM
ar/vr Microsoft patent body posture capture

(Nweon September 18, 2023) In order to accurately represent the real-world posture of a human user, relatively detailed information about the position and orientation of the user's body parts is usually required, but this information is not always available. For example, when using a headset to provide a virtual reality experience, the system may only be able to obtain spatial information related to the user's head and hands. However, in most cases this is not sufficient to accurately reproduce the real pose of a human user

So in the patent application called "Pose prediction for articulated object", Microsoft proposed a technology to predict the posture of articulated objects. In particular, the machine learning model receives the spatial information of n different joints of the articulated object, where n joints are smaller than all the joints of the articulated object.

In the case of a human user, the n joints may include the human user's head joint and/or one or two wrist joints, which are associated with spatial information detailing the parameters of the user's head and/or hands

The machine learning model has been trained to receive input spatial information for n m joints of an articulated object, where m is greater than or equal to 1. For example, during initial training, a machine learning model receives input data corresponding to nearly all joints of an articulated object. The n m joints may include each joint of the articulated object.

In other examples, there may be n m joints where there are less than all joints of an articulated object. During the training process, the data input to the machine learning model may be gradually hidden. You can use predefined values ​​to replace the corresponding input data of a specific node in m nodes, or simply omit

In other words, a machine learning model is trained to accurately predict the pose of an articulated object based on progressively less information about the position/orientation of the various movable parts of the articulated object.

Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

Using this approach, machine learning models are able to accurately predict the pose of articulated objects at runtime and require only sparse input data. Microsoft notes that this technology can accurately reproduce the real-world pose of articulated objects for human users without requiring a large amount of information about the orientation of each joint.

In other words, inventions can provide technical advantages that improve human-computer interaction by more accurately reproducing the real-world gestures of human users. These technical benefits include improving the immersion of virtual reality experiences and improving the accuracy of gesture recognition systems

In addition, the described technology can reduce the consumption of computing resources while accurately reproducing the real posture of human users by reducing the amount of data that must be collected as input to the posture prediction process.

Example method 200 shows Figure 2 for predicting the pose of an articulated object

Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

At point 202, receive the spatial information of n joints, which are used for articulated objects. The system receives the spatial information of n joints of the articulated object, which contains fewer joints than all the joints of the articulated object. Representing the spatial information of a joint as the position and orientation of six degrees of freedom connecting body parts, this can be used to infer the state of the joint

As an example, the n joints may include head joints of the human body, and the spatial information of the head joints may describe the parameters of the human head in detail. In addition, the n joints may include one or more wrist joints of the human body, and the spatial information of the one or more wrist joints may describe in detail the parameters of one or more hands of the human body.

Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

Figure 3 shows human users. The human user has a head 300 and two hands 302A and 302B. The computing system may receive spatial information for one or more joints of a human user, which may include head and/or wrist joints.

The spatial information of the n joints of the articulated object can be derived from the positioning data output by one or more sensors. Sensors may be integrated into one or more devices held or worn by corresponding body parts of a human user.

For example, sensors may include one or more inertial measurement units integrated into a head-mounted display device and/or a handheld controller. As another example, a sensor may include one or more cameras.

Figure 3 schematically illustrates different types of sensors where the output from the sensors may include or be used to derive spatial information. Specifically, a human user wears a head mounted display device 304 on his or her head 300 .

Additionally, the human user holds position sensors 306A and 306B, which may be configured to detect and report motion of the user's hands to the headset 304 and/or another computing system configured to receive spatial information.

In Figure 2, we are back to the 204 situation. We pass the spatial information of n joints to the previously trained machine learning model. This model receives spatial information of n m joints as input, where the value of m is greater than or equal to 1. In other words, compared to the previous training model, this machine learning model receives less joint space information

In 206, a pose prediction of the joint object is received as output from the machine learning model, the prediction is based on at least the spatial information of the n joints and does not contain the spatial information of their joints. In other words, even if the spatial information of m joints is not provided, the machine learning model can predict the complete posture of the joint object.

Schematic 4 shows an example machine learning model 400 to illustrate this process

Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

In Figure 4, the machine learning model receives spatial information 402, corresponding to three different joints J1, J2, and J3. The spatial information of the joint may take the form of any suitable computer data that specifies or can be used to derive the position and/or orientation of the body part connected to the joint.

For example, the spatial information may directly specify the position and orientation of a body part, and/or the spatial information may specify one or more rotations of a joint relative to one or more rotation axes. In Figure 4, joints J1, J2, J3 correspond to a human user's head joint 404A and two wrist joints 404B/404C, as shown by the shaded circles superimposed on the user's body.

In this example, the n joints include three joints, corresponding to the head and wrist joints of the human body. Based on the input spatial information 402, the machine learning model outputs a predicted pose 406 of the articulated object.

In addition, the machine learning model can output predicted spatial information corresponding to the joints represented by the virtual hinge. Human users can be represented by avatars with cartoonish or non-human proportions. For example, the predicted spatial information may correspond to joints represented by SMPL.

In other words, the joints of the virtual representation of the articulated representation do not have to have a 1:1 correspondence with the joints of the articulated object. Therefore, the spatial information output predicted by the machine learning model may be for joints that do not directly correspond to the n m joints of the articulated object. For example, a virtual representation may have fewer spinal joints than an articulated object.

Machine learning models can be trained in any suitable way. In one embodiment, the machine learning model may have been previously trained using training input data with ground truth labels for articulated objects.

In other words, the training spatial information of the joints of the articulated object can be provided to the machine learning model and marked as the ground truth label specifying the actual pose of the articulated object corresponding to the spatial information.

As mentioned above, a machine learning model can be trained to receive spatial information of n m joints as input. This involves, in the first training iteration, providing the machine learning model with training input data for all n m joints. In a subsequent series of training iterations, the training input data of m joints can be gradually masked.

For example, in the second training iteration, the first joint among the m joints can be masked, where the spatial information of the joint in the training data set is replaced with a predefined value representing the masked joint, or simply omitted.

As an example. In the third training iteration, the second of the m joints can be masked, and so on, until all m joints are masked, and only the spatial information of n joints is provided to the machine learning model.

This process is illustrated in Figures 5a-5d. Specifically, in Figure 5A, machine learning model 400 is provided with a training input data set. In this embodiment, the training input data includes spatial information corresponding to a plurality of different postures of the articulated object, including the first posture 502A and the second posture 502B.

Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

In Figure 5A, we provide the spatial information of n m joints for the articulated object of the machine learning model. In this simplified representation of the human body, each circle representing a joint is represented by a white fill pattern. However, in Figure 5B we have shielded 504A as shown with a black fill pattern representing the circle

of connector 504A

In other words, Figure 5A represents the initial iteration of the training process, in which the spatial information of all n m joints is provided to the machine learning model. Figure 5B shows the second iteration of the training process, in which the first joint 504A

of the m joints is masked.

Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

In Figure 5C, the second joint 504B among the m joints represented by the hinge is blocked. Similarly, in Figure 5D, the third joint among the m joints is occluded. Multiple training iterations can be continued until the spatial information of each of the m joints is masked, and only the spatial information of n joints is provided to the machine learning model.

In the above scenario, we describe the situation where the articulated object is the whole body of the human body. However, articulated objects can also take other forms

Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture

As shown in Figure 7, the articulated object is the human hand, not the entire human body. Specifically, Figure 7 shows an example machine learning model 700.

The machine learning model 700 receives spatial information for joints J1, J2, and J3, which correspond to the three joints 704A-C of an articulated object, in this example taking the form of a human hand 706.

In this case, specifically, the n joints include one or more finger joints of the human hand. The spatial information of one or more finger joints details the parameters of one or more fingers or finger segments of the human hand. For example, spatial information may specify the position/orientation of the fingers of the hand, and/or the rotation applied to the joints of the hand

Any suitable method may be used to collect joint space information, such as via position sensor 708. For example, a position sensor could take the form of a camera configured to image the hand. As another example, a position sensor may include an appropriate radio frequency antenna configured to expose the hand surface to an electromagnetic field and evaluate the effect of movement and proximity of conductive human skin on the electromagnetic field impedance at the antenna

According to the input spatial information 702, the machine learning model will output a set of predicted spatial information 710. Spatial information 710 may be used to construct the predicted pose of the articulated object. As mentioned earlier, this spatial information can represent the position and orientation of body parts of an articulated object

Related Patents: Microsoft Patent | Pose prediction for articulated object

Microsoft originally submitted a patent application called "Pose prediction for articulated object" in June 2022, and the application was recently published by the US Patent and Trademark Office

The above is the detailed content of Microsoft proposes patented technology for predicting the posture of articulated objects for AR/VR body posture capture. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Apple AIGC patent: AR/VR virtual scenes can be generated through voice commands Apple AIGC patent: AR/VR virtual scenes can be generated through voice commands Jul 16, 2023 pm 05:49 PM

In February last year, Meta demonstrated a virtual world development function based on AIGC gameplay: BuilderBot. It is characterized by the ability to generate corresponding elements in the virtual scene by recognizing voice commands, which can simplify the difficulty of VR scene generation. In a recent Apple patent published by the USPTO, a similar concept was also pointed out. For example, users can command the Siri voice assistant to add virtual furniture to the physical scene, change the color of the furniture, etc. According to Qingting.com, the patent number is US20230206912A1, which mainly describes a series of new Siri voice commands that can be used to control word processors (enter text, modify fonts, etc.), XR scenes, etc. Although it is not specifically designed for XR, it is patented

Configuring Linux systems to support AR/VR and augmented reality development Configuring Linux systems to support AR/VR and augmented reality development Jul 05, 2023 am 11:17 AM

Configuring Linux systems to support AR/VR and augmented reality development Introduction: With the development of augmented reality (AR) and virtual reality (VR) technologies, more and more developers are beginning to pay attention to these emerging areas. For AR/VR and augmented reality development, it is very important to configure your Linux system correctly. This article will introduce you to how to configure a Linux system to support AR/VR and augmented reality development, and provide corresponding code examples. Step 1: Install the Necessary Software Before starting, make sure your Linux system has

AR/VR solution provider Vection acquires VR retail solution provider Invrsion AR/VR solution provider Vection acquires VR retail solution provider Invrsion Jun 09, 2023 pm 10:19 PM

According to news on June 8, AR/VR solution company Vection announced the acquisition of VR smart retail solution provider Invrsion, aiming to accelerate XR solutions and market opportunities in the fashion and retail fields. It is reported that after the acquisition is completed, Invrsion’s technology will be integrated with Vection’s INTEGRATEDXR suite to better meet customer needs. According to Qingting.com, Invrsion was founded in 2015. It previously focused on 3D product digitalization business, providing customers with interactive simulation solutions based on 6DoFVR. Application scenarios include virtual exhibition halls, 3D e-commerce, industry, real estate, etc. On the client side, the company has served Walgreens, Ferrero, Nestlé, Coca-Cola, D

Microsoft patent proposes monitoring light source solution for AR/VR light projection calibration interference Microsoft patent proposes monitoring light source solution for AR/VR light projection calibration interference Aug 25, 2023 pm 01:01 PM

(Nweon August 24, 2023) XR equipment usually includes a left projector coupled to the left beam path and a right projector coupled to the right beam path. The left projector is configured to generate a left image and then propagate the left image through the left beam path to the user's left eye. The correct projector is configured to generate the correct image, which then travels through the correct beam path to the user's right eye. The structure of this XR device may change due to regular use, temperature changes, and/or shock. When the structure of the R device changes, the image may lose its visual axis and become incorrectly aligned. This problem may become severe in glasses-shaped devices. Have XR equipment configured to project a calibration image and use the calibration image to determine if the display is correct

Dig hundreds of AR/VR patents and explore all aspects of Apple XR Dig hundreds of AR/VR patents and explore all aspects of Apple XR Jun 03, 2023 pm 11:40 PM

Recently, everyone has also seen a lot of news about Apple XR. Instead of looking at various revelations, it is better to start with patents and see what different ways Apple XR headset may play. Therefore, we selected from Apple’s hundreds of AR/VR patents in recent years and compiled the ones that may directly determine and affect the XR product experience. Of course, these patents will not be concentrated on one product, and we cannot tell you exactly what Apple XR looks like, but we can still get a glimpse of it through the patents. When sorting out Apple's AR/VR patents in recent years, we mainly found the following characteristics: 1) Some AR/VR patents come from previously acquired companies, such as PrimeSense, Metaio, NextVR, SMI, Spaces, etc.; 2) Structured light

The future of the design industry in the era of artificial intelligence The future of the design industry in the era of artificial intelligence Apr 10, 2023 am 09:51 AM

Will artificial intelligence (AI) take over design work? Will it replace designers in the future? When artificial intelligence is mentioned, it is immediately portrayed as a replacement for humans. While there is no doubt that artificial intelligence will change the status quo of design work, the idea that this intelligent technology will replace humans is not entirely accurate. As technology develops and the economy changes, it is natural for business processes to change, and the design work process is also affected by this. As we learn how artificial intelligence will profoundly change the design process, (designers) should not view artificial intelligence as a threat, but should focus on the opportunities that artificial intelligence brings to the design field, its impact on design practices and design principles. impact, and how the work of designers will change. The role of AI in two design contexts To understand the role of AI in design

The United States opens the 6GHz frequency band for AR/VR, and Meta, Apple, and Google have expressed important measures The United States opens the 6GHz frequency band for AR/VR, and Meta, Apple, and Google have expressed important measures Oct 20, 2023 am 11:25 AM

(Nweon, October 20, 2023) According to CNBC, the US Federal Communications Commission (FCC) recently agreed to open the 6ghz frequency band to "a new class of extremely low-power devices", such as wearable devices, and this move will Helping pave the way for new applications for augmented reality and virtual reality wearables. The FCC stated in its statement that it expects opening this band will "stimulate the development of an ecosystem of cutting-edge applications, including wearable technology, augmented reality and virtual reality, that will help enterprises, increase learning opportunities, and advance health care opportunities." , and bring new entertainment experiences." Meta was the first to express excitement about this news. The company’s Kevin Martin said in a statement

Is Apple exhausted? Former marketing executive is not optimistic about Apple AR/VR headsets. Is Apple exhausted? Former marketing executive is not optimistic about Apple AR/VR headsets. May 26, 2023 pm 11:39 PM

The home of Apple fans, professional Apple mobile phone technology research for ten years! Apple Experts Around You ~ Apple will launch an AR/VR headset called “Reality Pro” in the near future. This news is no longer a secret, but as the release date approaches, more and more people seem to be interested in this Mixed reality products are not promising. This includes Apple insiders, and even Apple CEO Tim Cook has a less than positive attitude towards the project. Although Palmer Luckey, the founder of the VR headset company Oculus, praised the product after experiencing the Apple headset, he still cannot avoid other skeptical voices. Recently, former Apple marketing executive Michael Guttenberg (Mic

See all articles