Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TensorRT 10.5 Flux Dit BF16 precision #4215

Open
QZH-eng opened this issue Oct 21, 2024 · 0 comments
Open

TensorRT 10.5 Flux Dit BF16 precision #4215

QZH-eng opened this issue Oct 21, 2024 · 0 comments

Comments

@QZH-eng
Copy link

QZH-eng commented Oct 21, 2024

Description

When I used TensorRT 10.5 to infer Flux Dit on A800 using BF16 dataType, I found that there was a significant decrease in accuracy, while there was no significant decrease in accuracy when I used Pytorch BF16 to infer

Environment

TensorRT Version:

NVIDIA GPU: A800

NVIDIA Driver Version: 535.54.03

CUDA Version:12.2

CUDNN Version:

Operating System:

Python Version (if applicable):

Tensorflow Version (if applicable):

PyTorch Version (if applicable):

Baremetal or Container (if so, version):

Relevant Files

Model link:

Steps To Reproduce

Commands or scripts:

Have you tried the latest release?:

Can this model run on other frameworks? For example run ONNX model with ONNXRuntime (polygraphy run <model.onnx> --onnxrt):

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant