If you find TAR3D useful for your research or applications, please give us a star and cite this paper:
@inproceedings{zhang2025tar3d,
title={Tar3d: Creating high-quality 3d assets via next-part prediction},
author={Zhang, Xuying and Liu, Yutong and Li, Yangguang and Zhang, Renrui and Liu, Yufei and Wang, Kai and Ouyang, Wanli and Xiong, Zhiwei and Gao, Peng and Hou, Qibin and Cheng, Ming-Ming},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
pages={5134--5145},
year={2025}
}
⚙️ Setup
1. Dependencies and Installation
We recommend using Python>=3.10, PyTorch>=2.1.0, and CUDA>=12.1.
conda create --name tar3d python=3.10
conda activate tar3d
pip install -U pip
# Ensure Ninja is installed
conda install Ninja
# Install the correct version of CUDA
conda install cuda -c nvidia/label/cuda-12.1.0
# Install PyTorch and xformers
# You may need to install another xformers version if you use a different PyTorch version
pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu121
pip install xformers==0.0.22.post7
# For Linux users: Install Triton
pip install triton
# Install other requirements
pip install -r requirements.txt
In practice, we first train the encoder and decoder of our VQ-VAE according to the scheme of VAE. Then, we add the vector quantization codebook and fine-tune the entire VQ-VAE.
🔥
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction[Paper] [Project Page] [Jittor Version] [Demo]
🚩 Todo List
If you find TAR3D useful for your research or applications, please give us a star and cite this paper:
⚙️ Setup
1. Dependencies and Installation
We recommend using
Python>=3.10,PyTorch>=2.1.0, andCUDA>=12.1.2. Downloading Datasets
3. Downloading Checkpoints
We are currently unable to access the ckpts stored on the aliyun space used during the internship.
We will retrain a version as soon as possible.
⚡ Quick Start
1. Reconstructing a 3D Geometry with 3D VQ-VAE
2. Conditional 3D Generation
💻 Training
1. Training 3D VQ-VAE
In practice, we first train the encoder and decoder of our VQ-VAE according to the scheme of VAE.
Then, we add the vector quantization codebook and fine-tune the entire VQ-VAE.
2. Training 3D GPT
💫 Evaluation
1. 2D Evaluation (PSNR, SSIM, Clip-Score, LPIPS)
2. 3D Evaluation (Chamfer Distance, F-Score)
🤗 Acknowledgements
We thank the authors of the following projects for their excellent contributions to 3D generative AI!