fix: preserve original dtype in Upsample instead of hardcoding bfloat16 by Mr-Neutr0n · Pull Request #233 · deepseek-ai/Janus

Mr-Neutr0n · 2026-02-11T14:12:47Z

Summary

The Upsample.forward method in janus/models/vq_model.py hardcodes a cast to torch.bfloat16 after interpolation when the input is not float32:

x = F.interpolate(x.to(torch.float), scale_factor=2.0, mode="nearest").to(
    torch.bfloat16
)

This causes a problem when the input tensor is float16 (common on GPUs without native bfloat16 support, e.g. older NVIDIA architectures). The tensor is silently converted from float16 to bfloat16 after interpolation, leading to dtype mismatches with downstream convolution layers that still expect float16 weights.

Fix

Store the original dtype before casting to float32 for interpolation, then cast back to it afterward:

orig_dtype = x.dtype
x = F.interpolate(x.to(torch.float), scale_factor=2.0, mode="nearest").to(
    orig_dtype
)

This preserves whatever dtype the input originally had (float16, bfloat16, etc.) instead of always forcing bfloat16.

Test plan

Verified that the fix correctly preserves float16 input dtype through the upsample operation
Verified that bfloat16 inputs continue to work as before (orig_dtype would be bfloat16, matching the previous behavior)
No functional change for float32 inputs (the else branch is unchanged)

fix: preserve original dtype in Upsample instead of hardcoding bfloat16

0e8fa03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preserve original dtype in Upsample instead of hardcoding bfloat16#233

fix: preserve original dtype in Upsample instead of hardcoding bfloat16#233
Mr-Neutr0n wants to merge 1 commit into
deepseek-ai:mainfrom
Mr-Neutr0n:fix/upsample-preserve-dtype

Mr-Neutr0n commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Mr-Neutr0n commented Feb 11, 2026

Summary

Fix

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant