AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation

1Tongji University 2Tsinghua University 3Shengshu 4Xi'an Research Institute of High-Tech 5Fudan University 6Zhejiang University

ECCV 2024

Overview

AnimatableDreamer turns monocular video to skeleton and text-guided 3D model.

Abstract

Advances in 3D generation have facilitated sequential 3D model generation (a.k.a 4D generation), yet its application for animatable objects with large motion remains scarce. Our work proposes AnimatableDreamer, a text-to-4D generation framework capable of generating diverse categories of non-rigid objects on skeletons extracted from a monocular video. At its core, AnimatableDreamer is equipped with our novel optimization design dubbed Canonical Score Distillation (CSD), which lifts 2D diffusion for temporal consistent 4D generation. CSD, designed from a score gradient perspective, generates a canonical model with warp-robustness across different articulations. Notably, it also enhances the authenticity of bones and skinning by integrating inductive priors from a diffusion model. Furthermore, with multi-view distillation, CSD infers invisible regions, thereby improving the fidelity of monocular non-rigid reconstruction. Extensive experiments demonstrate the capability of our method in generating high-flexibility text-guided 3D models from the monocular video, while also showing improved reconstruction performance over existing non-rigid reconstruction methods.


Interactable Skeletons


Generated Animatable Models

"A squirrel with red sweater."   (Squirrel)

"Squirtle."   (Squirrel)

"A fox."   (Squirrel)

"A bear with red hat."   (Cat Pikachu)

"Holstein."   (Cat Pikachu)

"A toy dinosaur."   (Penguin)

"A steampunk penguin."   (Penguin)

"Pine tree in snow."   (Manipulator)

"A cat with armour."   (Cat Pikachu)

"Doraemon."   (Penguin)

"Eagle with crown."   (Finch)

"Penguin."   (Bird)

"A cat with armour."   (Cat Coco)

Reconstructed Animatable Models

Ours   (Squirrel)

BANMo   (Squirrel)

Ours   (Cat Coco)

BANMo   (Cat Coco)

Ours   (Penguin)

BANMo   (Penguin)

Ours   (Cat Pikachu)

Ours   (Manipulator)

Ours   (Hand)

Ours   (Knight)

Ours   (Bird)

Ours   (Finch)

Monocular Videos

Squirrel

Cat Pikachu

Penguin

Cat Coco

Bird

Finch

Knight

Dog Shiba