Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dataset #22

Open
Tiandishihua opened this issue Apr 22, 2024 · 11 comments
Open

dataset #22

Tiandishihua opened this issue Apr 22, 2024 · 11 comments

Comments

@Tiandishihua
Copy link

Can I use RGB video from a monocular camera for training? We look forward to hearing from you, thank you.

@YuelangX
Copy link
Owner

Yes, you can follow the instruction to preprocess your monocular video for training. Actually, I tested on a set of monocular data. The rendering results from the side view look a lot worse.

freeview.mp4
source.mp4

@jeb0813
Copy link

jeb0813 commented Apr 23, 2024

Hi @YuelangX , for monocular data, how do you get the camera params(extrinsic, intrinsic).

@YuelangX
Copy link
Owner

You could refer to https://github.com/YuelangX/Multiview-3DMM-Fitting. The camera params are manually specified.

@NikoBele1
Copy link

@YuelangX I am trying to use this repo with my monocular data not from Nersemble dataset.

Do you know a way to generate the camera params for the dataset structure required in your repo Multiview-3DMM-Fitting ? As in, frame1.jpg - camera1.npz, frame2.jpg - camera2.npz, etc.

@NikoBele1
Copy link

oh my bad, i thought the camera params where required, i misread that the repository creates them.

@NikoBele1
Copy link

@YuelangX sorry to disturb you again, but i have a question. I am currently training on a monocular video (539 frames, preprocessed with Multiview-3DMM-Fitting) and am at 600 epochs. I used checkpoint 600 to do self-reenactment on my monocular video and this is the result i get:

intermediate_GHA_1_720

This does not look like Gaussian Splatting, i would expect to see larger splats everywhere. Problems i know: The camera params are not correct, since they are manual in Multiview-3DMM-Fitting/preprocess_monocular_video.py. And i noticed that there is no lowres landmark in myDataset/mySubject/landmarks/*/. after preprocess.

Do you have an idea what could be wrong here? I know training is not finished, but i would expect to see a different intermediate result.

@YuelangX
Copy link
Owner

@NikoBele1 This seems strange, are the results during training also like this?

@NikoBele1
Copy link

NikoBele1 commented May 22, 2024

@YuelangX thanks for replying. Tracking looks fine, besides the strange cropping. This is probably because of the manual camera parameters i used from Multiview-3DMM-Fitting/preprocess_monocular_video.py ?
413900

@jryebread
Copy link

jryebread commented Jun 20, 2024

@NikoBele1 hi did you figure out the fix to your issue for custom character? have same issue

@NikoBele1
Copy link

@jryebread what issue do you have? the cropping or the other weird reenactment?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants