Skip to content

Commit

Permalink
Cleanup index types
Browse files Browse the repository at this point in the history
  • Loading branch information
cyyever committed May 11, 2024
1 parent 1a37190 commit ac6f63a
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions torch/csrc/distributed/c10d/quantization/quantization_gpu.cu
Expand Up @@ -76,8 +76,8 @@ at::Tensor _float_to_bfloat16_cuda(const at::Tensor& input) {
return output;
}

constexpr int threads_per_block = 256;
const int blockDim_x = std::min(output_columns, threads_per_block);
constexpr size_t threads_per_block = 256;
const auto blockDim_x = std::min(output_columns, threads_per_block);
dim3 blockDim(blockDim_x, threads_per_block / blockDim_x);
const auto gridDim_x = (output_columns + blockDim.x - 1) / blockDim.x;
const auto gridDim_y =
Expand Down

0 comments on commit ac6f63a

Please sign in to comment.