Support for `uint16`, `uint32`, and `uint64` #58734

pmeier · 2021-05-21T07:38:15Z

The array API specification stipulates the data types that we need to support to be compliant. Currently we are missing support for uint16, uint32, and uint64.

cc @mruberry @rgommers @asmeurer @leofang @AnirudhDagar @asi1024 @emcastillo @kmaehashi @ezyang @msaroufim @wconstab @bdhirsh @anijain2305 @zou3519 @gchanan @soumith @ngimel

The text was updated successfully, but these errors were encountered:

rgommers · 2021-11-13T05:06:54Z

Someone just asked about other uint types on Slack. And from pytorch/vision#4326 (comment):

This is because PyTorch doesn't (yet) support uint16, and is also a problem when reading PNG images of type uint16.

16-bit image support making uint16 the most interesting one of the missing dtypes had been my guess before.

There are no plans to work on this issue currently I think, unless more demand materializes.

NicolasHug · 2021-11-13T13:34:58Z

There are no plans to work on this issue currently I think, unless more demand materializes.

I'll add one :)

Another tangible need for uint16 support in torchvision is pytorch/vision#4731

We added support for native 16 bits png decoding in torchvision, but we can't make this API public for now, because we output int32 tensors and this wouldn't be compatible with the rest of our transforms.
It'd be great if we could make it public because Pillow 16 bits png support is fairly limited.

kernelmethod · 2022-02-20T23:34:01Z

Bumping this.

My research collaborators and I are working on some cryptographic applications where we could really use uint32 / uint64. Some operations on Z_{2^64} that we'd like to calculate with secure multi-party computation, e.g. comparison, would be a lot more straightforward to implement with unsigned integers as the underlying dtype.

a-gn · 2023-01-18T15:47:27Z

We have a GIS pipeline that uses image transforms from multiple projects, some of which only support uint, but uint8 leads to too much loss of color information. We could really use uint16 support.

neelnanda-io · 2023-01-24T09:49:27Z

I would appreciate uint16 support! I'm trying to do NLP stuff with a large dataset of tokens between 0 and 51000, and it's annoying to consume double the storage to keep them as int32s (I'm currently storing them as uint16 via HuggingFace, but I need to load them as NumPy and manually convert them)

oliver-batchelor · 2023-02-02T01:04:43Z

I'm doing work on HDR imaging and we read images from the camera as 16-bit unsigned. It's possible to work around it by using other frameworks but it would be really useful.

VladShtompel · 2023-02-17T15:31:19Z

I'm doing work on HDR imaging and we read images from the camera as 16-bit unsigned. It's possible to work around it by using other frameworks but it would be really useful.

This is exactly the issue me and my team are faced with right now.

StrongChris · 2023-04-03T01:36:01Z

I'm doing work with DICOM data that is often 10 or even 14 unsigned bits. A uint16 would be very nice for these! My work is focused on speed so using the smallest possible datatype would be very appreciated.

ezyang · 2023-04-22T18:17:20Z

We should add these dtypes, and then build out support via PT2. We probably aren't going to add kernels for everything but Triton makes it very easy to JIT compile these operations.

NicolasHug · 2023-04-24T12:24:12Z

@ezyang would Triton be able to enable CPU support?

ezyang · 2023-04-24T12:36:51Z

Not Triton per se, but we have a CPU inductor backend, so the answer is yes!

soulitzer · 2023-05-08T17:27:26Z

From Triage review: We still need some limited eager support, e.g. factory functions, conversion functions. Also consideration with autocast? (maybe not too bad?)

vadimkantorov · 2023-07-30T10:48:41Z

Also, bit ops are only well-defined/standardized in CPUs for unsigned dtypes if I understand well: #105465

vadimkantorov · 2023-08-03T20:43:57Z

uint16 would also be useful for interop with opencv (CV_16U dtype)

Refer: pytorch/pytorch#58734

tchaton · 2023-11-22T14:15:29Z

Hey, torch.uint16 would be good to encode text into tokens to reduce memory footprint from uint32 when the vocab isn't too big.

smdrnks · 2023-11-22T15:56:23Z

+1, also have a language modelling use case where uint16 could save quite some memory. Would be great to have this.

DrDryg · 2023-12-07T07:31:05Z

I would also appreciate support for uint. Im developing software using for image processing with libtorch as a backend and it would be very useful with support for uint. In particular uint16 but uint32 and uint64 would be nice too.

vadimkantorov · 2023-12-30T23:46:12Z

A related issue on having uint16 images:

pil_to_tensor() doesn't work for PIL Image with I;16 mode vision#8188

ezyang · 2024-01-01T14:52:21Z

Some dumb problems we will have to work out.

uint8 tensor accumulates into int64 tensor, which makes sense when you don't have uint64 tensor but makes a lot less sense when you have uint64 tensor. This leads to a potential inconsistency with the larger types; in particular, it is an incredibly bad idea for uint64 to accumulate into int64, and uint32 is probably not a good idea either. A short term stop gap is to leave sum unimplemented but chances are someone will come asking for it. By the way, this is a use case for defining arithmetic operations on our bits types: uint should have a wider accumulate type to prevent overflow, but bits would never widen and always do modular arithmetic.

vadimkantorov · 2024-01-01T16:29:43Z

I think it's better to go ahead and have some dtype representations even if meaningful ops are not supported at first and only conversions/casts/reinterprets/restride are implemented: mainly for expected interop and faithfulness of representation. As long as there is a dedicated docs page for that dtype with explained quirks, I think it's fine

Same reasoning might apply for the following :)

[feature request] Specialized memory layouts and wide blocked/tiled dtypes for cublasLt/onednn: e.g. torch.float16x32 / torch.int8x32 / torch.bits1x512 (akin to torch.quint2x4) #104702

vadimkantorov · 2024-01-01T16:55:00Z

Regarding sum, maybe a transitory option might be to require explicit args specifying the out_dtype and acc_dtype? (it would also be nice to elide temporary full upcasting allocations
#55366)

The dtypes are very useless right now (not even fill works), but it makes torch.uint16, uint32 and uint64 available as a dtype. Towards #58734 Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

The dtypes are very useless right now (not even fill works), but it makes torch.uint16, uint32 and uint64 available as a dtype. Towards #58734 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #116594 Approved by: https://github.com/albanD ghstack dependencies: #116698, #116693

ionutmodo · 2024-03-06T12:30:37Z

Hi guys! I would like to add another usage for uint16 on GPUs, which can be used in designing efficient adaptive sparse optimizers.

When working with torch, the output is often float32. Torch does not have good support for conversion to uint types: pytorch/pytorch#58734 Support float32 and the signed integer types for convenience.

pmeier added the module: python array api Issues related to the Python Array API label May 21, 2021

pmeier mentioned this issue May 21, 2021

Python Array API Compatibility Tracker #58743

Open

29 tasks

H-Huang added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label May 22, 2021

fmassa mentioned this issue Aug 27, 2021

Clean up to_tensor. pytorch/vision#4326

Closed

honno mentioned this issue Mar 30, 2022

Support testing any array module in tests/array_api HypothesisWorks/hypothesis#3275

Merged

mworchel mentioned this issue Dec 10, 2022

Inconsistent behavior of UInt constructor for PyTorch and numpy mitsuba-renderer/drjit#111

Closed

StrongChris mentioned this issue Apr 3, 2023

Support more datatypes StrongResearch/dimble#1

Open

ezyang added high priority oncall: pt2 labels Apr 22, 2023

pytorch-bot bot added the triage review label Apr 22, 2023

soulitzer removed the triage review label May 8, 2023

rgommers mentioned this issue Jun 11, 2023

scipy.ndimage.find_objects #102201

Open

vadimkantorov mentioned this issue Jul 30, 2023

[proposal] Bit ops: e.g. setbit/getbit/togglebit/byteswap #105465

Open

ClaudiaComito mentioned this issue Aug 21, 2023

Aliases for unsigned integer types uint16, uint32 and uint64 helmholtz-analytics/heat#782

Open

voznesenskym removed the high priority label Nov 13, 2023

vivekkhandelwal1 added a commit to vivekkhandelwal1/torch-mlir that referenced this issue Nov 21, 2023

[MLIR][TORCH] Add support for unsigned integer types

ddb574e

Refer: pytorch/pytorch#58734

vivekkhandelwal1 mentioned this issue Nov 21, 2023

[MLIR][TORCH] Add support for unsigned integer types llvm/torch-mlir#2589

Merged

vivekkhandelwal1 added a commit to llvm/torch-mlir that referenced this issue Nov 21, 2023

[MLIR][TORCH] Add support for unsigned integer types

d50d3aa

Refer: pytorch/pytorch#58734

penguinwu added the feature A request for a proper, new feature. label Dec 12, 2023

ezyang added the ezyang's list Stuff ezyang doesn't want to lose label Jan 1, 2024

ezyang mentioned this issue Jan 2, 2024

Add unsigned integer dtypes to PyTorch #116594

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for `uint16`, `uint32`, and `uint64` #58734

Support for `uint16`, `uint32`, and `uint64` #58734

pmeier commented May 21, 2021 •

edited by pytorch-bot bot

rgommers commented Nov 13, 2021

NicolasHug commented Nov 13, 2021 •

edited

kernelmethod commented Feb 20, 2022 •

edited

a-gn commented Jan 18, 2023

neelnanda-io commented Jan 24, 2023

oliver-batchelor commented Feb 2, 2023

VladShtompel commented Feb 17, 2023

StrongChris commented Apr 3, 2023

ezyang commented Apr 22, 2023

NicolasHug commented Apr 24, 2023

ezyang commented Apr 24, 2023

soulitzer commented May 8, 2023

vadimkantorov commented Jul 30, 2023

vadimkantorov commented Aug 3, 2023

tchaton commented Nov 22, 2023

smdrnks commented Nov 22, 2023

DrDryg commented Dec 7, 2023 •

edited

vadimkantorov commented Dec 30, 2023

ezyang commented Jan 1, 2024 •

edited

vadimkantorov commented Jan 1, 2024

vadimkantorov commented Jan 1, 2024

ionutmodo commented Mar 6, 2024 •

edited

Support for uint16, uint32, and uint64 #58734

Support for uint16, uint32, and uint64 #58734

Comments

pmeier commented May 21, 2021 • edited by pytorch-bot bot

rgommers commented Nov 13, 2021

NicolasHug commented Nov 13, 2021 • edited

kernelmethod commented Feb 20, 2022 • edited

a-gn commented Jan 18, 2023

neelnanda-io commented Jan 24, 2023

oliver-batchelor commented Feb 2, 2023

VladShtompel commented Feb 17, 2023

StrongChris commented Apr 3, 2023

ezyang commented Apr 22, 2023

NicolasHug commented Apr 24, 2023

ezyang commented Apr 24, 2023

soulitzer commented May 8, 2023

vadimkantorov commented Jul 30, 2023

vadimkantorov commented Aug 3, 2023

tchaton commented Nov 22, 2023

smdrnks commented Nov 22, 2023

DrDryg commented Dec 7, 2023 • edited

vadimkantorov commented Dec 30, 2023

ezyang commented Jan 1, 2024 • edited

vadimkantorov commented Jan 1, 2024

vadimkantorov commented Jan 1, 2024

ionutmodo commented Mar 6, 2024 • edited

Support for `uint16`, `uint32`, and `uint64` #58734

Support for `uint16`, `uint32`, and `uint64` #58734

pmeier commented May 21, 2021 •

edited by pytorch-bot bot

NicolasHug commented Nov 13, 2021 •

edited

kernelmethod commented Feb 20, 2022 •

edited

DrDryg commented Dec 7, 2023 •

edited

ezyang commented Jan 1, 2024 •

edited

ionutmodo commented Mar 6, 2024 •

edited