Make3D
Pioneering Monocular 3D Reconstruction and Depth Estimation Framework

Compositional 3D-Aware Human Generation for High-Resolution Photorealistic Avatars
EVA3D is a pioneering technical framework designed for compositional 3D-aware human generation from unstructured 2D image collections. Unlike traditional global Neural Radiance Fields (NeRFs) that struggle with the complex articulations of the human body, EVA3D utilizes a part-based architecture. It decomposes the human figure into manageable, learnable components—such as limbs and torso—within a canonical space, guided by the SMPL (Skinned Multi-Person Linear) body model. This approach allows for unprecedented control over camera viewpoints and body poses while maintaining high-resolution outputs at 512x512 pixels. By 2026, EVA3D has transitioned from a purely academic project to a foundational open-source tool for the metaverse, fashion e-commerce, and synthetic data industries. Its architecture supports efficient rendering and avoids the blurring artifacts common in earlier NeRF-based human synthesizers. The model is particularly optimized for scenarios requiring diverse human assets where 3D scanning is cost-prohibitive, offering a scalable alternative for generating hyper-realistic digital humans that are fully compatible with standard animation pipelines.
Divides the human body into locally learned radiance fields for each body part.
Pioneering Monocular 3D Reconstruction and Depth Estimation Framework
Photorealistic 3D scene reconstruction and cinematic rendering via Neural Radiance Fields.
Verified feedback from the global deployment network.
Post queries, share implementation strategies, and help other users.
Integrates the Skinned Multi-Person Linear model to provide structural priors for 3D human shape.
Uses a progressive growing training strategy and part-aware sampling to achieve 512x512 resolution.
Allows users to manipulate the SMPL parameters to change the generated human's pose.
Ensures that lighting and texture mapping remain stable as the camera orbits the model.
Utilizes Marching Cubes algorithm on the density volume to create 3D meshes.
A sampling strategy that focuses computation on visible body parts during the rendering process.
Generating unique, diverse 3D avatars for virtual environments without manual modeling.
Registry Updated:2/7/2026
Visualizing clothing on varied human body shapes in 3D space.
Lack of high-quality, annotated 3D human data for training pose estimators.