Reusable constraint types to use with typing.Annotated

Last update: Dec 26, 2022

Related tags

Overview

annotated-types

PEP-593 added typing.Annotated as a way of adding context-specific metadata to existing types, and specifies that Annotated[T, x] should be treated as T by any tool or library without special logic for x.

This package provides metadata objects which can be used to represent common constraints such as upper and lower bounds on scalar values and collection sizes, a Predicate marker for runtime checks, and non-normative descriptions of how we intend these metadata to be interpreted. In some cases, we also note alternative representations which do not require this package.

Install

pip install annotated-types

Examples

from typing import Annotated
from annotated_types import Gt, Len

class MyClass:
    age: Annotated[int, Gt(18)]                         # Valid: 19, 20, ...
                                                        # Invalid: 17, 18, "19", 19.0, ...
    factors: list[Annotated[int, Predicate(is_prime)]]  # Valid: 2, 3, 5, 7, 11, ...
                                                        # Invalid: 4, 8, -2, 5.0, "prime", ...

    my_list: Annotated[list[int], 0:10]                 # Valid: [], [10, 20, 30, 40, 50]
                                                        # Invalid: (1, 2), ["abc"], [0] * 20
    your_set: Annotated[set[int], Len(0, 10)]           # Valid: {1, 2, 3}, ...
                                                        # Invalid: "Well, you get the idea!"

Documentation

While annotated-types avoids runtime checks for performance, users should not construct invalid combinations such as MultipleOf("non-numeric") or Annotated[int, Len(3)]. Downstream implementors may choose to raise an error, emit a warning, silently ignore a metadata item, etc., if the metadata objects described below are used with an incompatible type - or for any other reason!

Gt, Ge, Lt, Le

Express inclusive and/or exclusive bounds on orderable values - which may be numbers, dates, times, strings, sets, etc. Note that the boundary value need not be of the same type that was annotated, so long as they can be compared: Annotated[int, Gt(1.5)] is fine, for example, and implies that the value is an integer x such that x > 1.5. No interpretation is specified for special values such as nan.

We suggest that implementors may also interpret functools.partial(operator.le, 1.5) as being equivalent to Gt(1.5), for users who wish to avoid a runtime dependency on the annotated-types package.

To be explicit, these types have the following meanings:

Gt(x) - value must be "Greater Than" x - equivalent to exclusive minimum
Ge(x) - value must be "Greater than or Equal" to x - equivalent to inclusive minimum
Lt(x) - value must be "Less Than" x - equivalent to exclusive maximum
Le(x) - value must be "Less than or Equal" to x - equivalent to inclusive maximum

Interval

Interval(gt, ge, lt, le) allows you to specify an upper and lower bound with a single metadata object. None attributes should be ignored, and non-None attributes treated as per the single bounds above.

MultipleOf

MultipleOf(multiple_of=x) might be interpreted in two ways:

Python semantics, implying value % multiple_of == 0, or
JSONschema semantics, where int(value / multiple_of) == value / multiple_of.

We encourage users to be aware of these two common interpretations and their distinct behaviours, especially since very large or non-integer numbers make it easy to cause silent data corruption due to floating-point imprecision.

We encourage libraries to carefully document which interpretation they implement.

Len

Len() implies that min_inclusive <= len(value) < max_exclusive. We recommend that libraries interpret slice objects identically to Len(), making all the following cases equivalent:

Annotated[list, :10]
Annotated[list, 0:10]
Annotated[list, None:10]
Annotated[list, slice(0, 10)]
Annotated[list, Len(0, 10)]
Annotated[list, Len(max_exclusive=10)]

And of course you can describe lists of three or more elements (Len(min_inclusive=3)), four, five, or six elements (Len(4, 7) - note exclusive-maximum!) or exactly eight elements (Len(8, 9)).

Implementors: note that Len() should always have an integer value for min_inclusive, but slice objects can also have start=None.

Timezone

Timezone can be used with a datetime or a time to express which timezones are allowed. Annotated[datetime, Timezone[None]] must be a naive datetime. Timezone[...] (literal ellipsis) expresses that any timezone-aware datetime is allowed. You may also pass a specific timezone string or timezone object such as Timezone[timezone.utc] or Timezone["Africa/Abidjan"] to express that you only allow a specific timezone, though we note that this is often a symptom of fragile design.

Predicate

Predicate(func: Callable) expresses that func(value) is truthy for valid values. Users should prefer the statically inspectable metadata above, but if you need the full power and flexibility of arbitrary runtime predicates... here it is.

We provide a few predefined predicates for common string constraints: IsLower = Predicate(str.islower), IsUpper = Predicate(str.isupper), and IsDigit = Predicate(str.isdigit). Users are encouraged to use methods which can be given special handling, and avoid indirection like lambda s: s.lower().

Some libraries might have special logic to handle known or understandable predicates, for example by checking for str.isdigit and using its presence to both call custom logic to enforce digit-only strings, and customise some generated external schema.

We do not specify what behaviour should be expected for predicates that raise an exception. For example Annotated[int, Predicate(str.isdigit)] might silently skip invalid constraints, or statically raise an error; or it might try calling it and then propogate or discard the resulting TypeError: descriptor 'isdigit' for 'str' objects doesn't apply to a 'int' object exception. We encourage libraries to document the behaviour they choose.

Design & History

This package was designed at the PyCon 2022 sprints by the maintainers of Pydantic and Hypothesis, with the goal of making it as easy as possible for end-users to provide more informative annotations for use by runtime libraries.

It is deliberately minimal, and following PEP-593 allows considerable downstream discretion in what (if anything!) they choose to support. Nonetheless, we expect that staying simple and covering only the most common use-cases will give users and maintainers the best experience we can. If you'd like more constraints for your types - follow our lead, by defining them and documenting them downstream!

Comments

add GroupedMetadata as a base class for Interval

The main idea here is to generalize the pattern in Interval so that Pydantic and similar can define their Field as inheriting from GroupedMetadata, thus anything that can parse annotated-types will know how to unpack it (if it is not already unpacked by the PEP-646) even if it is a custom subclass in a library like Pydantic.

Without this, some generic annotated-types parser would not know how to handle Pydantic's Field type without specific knowledge of Pydantic.

opened by adriangb 19
I think we should remove `Regex`

I tried writing up Regex docs that would describe how people actually want to use them, and...

Regex(regex_pattern=p, regex_flags=x) implies that the string should contain a match for p with flags x, at any position in the string. If you want the full string to match, or the match to be at the start or end of the string, you can use boundary markers like ^...$.

Regex() can be used with unicode strings or byte strings; if either are allowed we suggest using one Regex() item for each type, and therefore ignoring Regex() items with the wrong string type.

We do not specify the pattern syntax: libraries may choose to interpret it as the Python stdlib re module, regex package, ECMAscript syntax, etc., and we encourage them to clearly document their chosen interpretation. The meaning of the regex_flags argument is also implementation-defined.

I think this is sufficiently-implementation-defined that we should just, well, make downstream implementations define their own Regex metadata with more precise semantics. Either that, or we explicitly separate PythonRegex(pattern: str|bytes, flags: int) from JSONschemaRegex(pattern: str) and make people choose one.

opened by Zac-HD 14
Switch to hatchling & pip-tools
poetry was annoying me, also it's lock file was out of date.

Changes here:

use hatchling for build

use pip-compile (from pip-tools) to lock linting and testing requirements

tweak pre-commit

use pre-commit for linting

check github ref before release

add python 3.11 classifier

remove CI caching - seems to reduce CI time from ~1m20 to <20seconds, also simplifies CI setup
opened by samuelcolvin 8
Magic syntax for constraints: `X > 5` -> `Gt(5)`, etc.

Closes #28; needs some more tests for validation errors and edge cases if we decide to go for it, and of course documentation.

Importantly, adding most of the magic to our existing Interval and Len classes means that we can keep the API surface minimal, without exposing 'incomplete constraint' objects that don't really mean anything. Note that I've set this up to error out immediately if you try using the comparison magics on something which already represents a constraint. You can still do e.g. Interval(gt=3) < 5; this seems weird but OK to me in that it's very clear what it does and there's no duplication of bounds.

opened by Zac-HD 6
ImportError: cannot import name 'Gt' from 'annotated_types'
Hi,

I got an ImportError when I was trying to import Gt:

ImportError: cannot import name 'Gt' from 'annotated_types' (/root/venv/lib/python3.9/site-packages/annotated_types/__init__.py)

I tried installing and using the package on my personal computer (Python 3.9.12) and an EC2 instance (Python 3.9.13). The same error showed. I am wondering how to solve the problem.

Thanks!
opened by ychen878 6
use type aliases for shorthand constraints

https://github.com/annotated-types/annotated-types/pull/5#issuecomment-1118094109

I think these names make more sense when they're wrapping the type, but I'm happy to keep the old ones

opened by adriangb 6
Rethink `max_exclusive`, convert to `max_inclusive`?
If we have MaxLen, and (in pydantic at least) that can be set via a max_length argument.

I really think this should mean "maximum inclusive", not "maximum exclusive" as currently documented.

This matches (IMHO) much better people's assumption about what MaxLen(5) or max_length=5 or Len(0, 5) means:

"The airbnb allows maximum 5 guests", you would assume 5 guests were allowed, not just 4

If for the sake of correctness, that involves either:

treating slices differently

or, removing the recommendation on allowing slices

That's sad, but I think a price worth paying.

At the end of the way max_length=5 meaning any length up to 4, won't fly in pydantic.
opened by samuelcolvin 5
Operator based `BaseMetadata` creation
Hi all, I really like the idea of BaseMetadata to formulate Annotated types.

However, configuration using functions such as Gt(10) is not as easy as x > 10 to read. It would be nice to provide a non-constraint class All and support methods such as __gt__ like below.

class All(BaseMetadata): def __gt__(self, other): return Gt(other) X = All() # could be better to keep `All` private and provide `X` publically. Annotated[int, X > 5] # equivalent to Annotated[int, Gt(5)]

Similar logic can by applied to construct several objects.

Interval from 2 < X < 5

MultipleOf from X % 3 == 0

MaxLen from X.len() <= 10
opened by hanjinliu 4
Chained constraints

Thought from #28: we could also do something like Exponent = Annotated[int, Gt(0) & MultipleOf(2)].

Implementing & would be easy: it just creates a GroupedMetadata.

Other operators would be trickier: Annotated[str, Foo | Bar] should actually be expressed as Annotated[str, Bar] | Annotated[str, Foo], and I have no idea what XOR would be, I think we’d have to offload that to implementers. So I think this idea has short legs.

opened by adriangb 2
Use `any()` instead of for loop & Invert `any/all` to simplify comparisons

Using Python's any() and all() built-in functions is a more concise way of doing this than using a for a loop.

any() will return True when at least one of the elements evaluates to True, all() will return True only when all the elements evaluate to True.

opened by yezz123 2
List of annotations vs. nested annotations
Hello, I was brought here from https://github.com/samuelcolvin/pydantic/discussions/4110.

Perhaps this feature is already available or on the drawing-board, but I just wanted to say that it would be very useful if the user could provide a list of type-annotations like this:

age: Annotated[int, [Gt(18), Lt(35)]]

instead of having to recursively nest the annotations like this (although this should also be possible in case that might be useful in some applications):

age: Annotated[Annotated[int, Gt(18)], Lt(35)]

Thanks!
opened by Hvass-Labs 2

Releases(v0.4.0)

v0.4.0(Oct 12, 2022)
What's Changed

Switch to hatchling & pip-tools by @samuelcolvin in https://github.com/annotated-types/annotated-types/pull/22

convert Len to GroupedMetadata, add MinLen and MaxLen by @samuelcolvin in https://github.com/annotated-types/annotated-types/pull/21

switch from max_exclusive to max_length (inclusive) by @samuelcolvin in https://github.com/annotated-types/annotated-types/pull/24

Full Changelog: https://github.com/annotated-types/annotated-types/compare/v0.3.1...v0.4.0
Source code(tar.gz)
Source code(zip)
v0.3.1(Sep 25, 2022)
What's Changed

Add BaseMetadata to __all__ by @samuelcolvin in https://github.com/annotated-types/annotated-types/pull/19

Full Changelog: https://github.com/annotated-types/annotated-types/compare/v0.3.0...v0.3.1
Source code(tar.gz)
Source code(zip)
v0.3.0(Sep 25, 2022)
What's Changed

Remove regex from tests by @adriangb in https://github.com/annotated-types/annotated-types/pull/13

add GroupedMetadata as a base class for Interval by @adriangb in https://github.com/annotated-types/annotated-types/pull/12

Remove regex from tests (again) by @adriangb in https://github.com/annotated-types/annotated-types/pull/14

use init_subclass instead of ABC by @adriangb in https://github.com/annotated-types/annotated-types/pull/16

add docs for GroupedMetadata and BaseMetadata by @adriangb in https://github.com/annotated-types/annotated-types/pull/15

Full Changelog: https://github.com/annotated-types/annotated-types/compare/v0.2.0...v0.3.0
Source code(tar.gz)
Source code(zip)
v0.2.0(Jun 15, 2022)

Initial release
Source code(tar.gz)
Source code(zip)

Owner

GitHub Repository

Lightweight Python library for adding real-time object tracking to any detector.

Norfair is a customizable lightweight Python library for real-time 2D object tracking. Using Norfair, you can add tracking capabilities to any detecto

1.7k Jan 05, 2023

You Only 👀 One Sequence

You Only 👀 One Sequence TL;DR: We study the transferability of the vanilla ViT pre-trained on mid-sized ImageNet-1k to the more challenging COCO obje

666 Jan 03, 2023

Hub is a dataset format with a simple API for creating, storing, and collaborating on AI datasets of any size.

Hub is a dataset format with a simple API for creating, storing, and collaborating on AI datasets of any size. The hub data layout enables rapid transformations and streaming of data while training m

5.1k Jan 08, 2023

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction Requirements The code has been tested running under Python 3.7.4, with the foll

84 Jan 01, 2023

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

TUPE PyTorch implementation of Rethinking Positional Encoding in Language Pre-training. Quickstart Clone this repository. git clone https://github.com

5 Jan 27, 2022

GeoTransformer - Geometric Transformer for Fast and Robust Point Cloud Registration

Geometric Transformer for Fast and Robust Point Cloud Registration PyTorch imple

220 Jan 05, 2023

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

3 Nov 15, 2022

[ICCV 2021 Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Just Ask: Learning to Answer Questions from Millions of Narrated Videos Webpage • Demo • Paper This repository provides the code for our paper, includ

87 Jan 05, 2023

A deep learning CNN model to identify and classify and check if a person is wearing a mask or not.

Face Mask Detection The Model is designed to check if any human is wearing a mask or not. Dataset Description The Dataset contains a total of 11,792 i

1 Mar 01, 2022

A medical imaging framework for Pytorch

Welcome to MedicalTorch MedicalTorch is an open-source framework for PyTorch, implementing an extensive set of loaders, pre-processors and datasets fo

799 Jan 03, 2023

Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models

Official PyTorch implementation for "On Fast Sampling of Diffusion Probabilistic Models". FastDPM generation on CIFAR-10, CelebA, and LSUN datasets. S

68 Dec 26, 2022

The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"

201 Nov 21, 2022

Official repository for "On Generating Transferable Targeted Perturbations" (ICCV 2021)

On Generating Transferable Targeted Perturbations (ICCV'21) Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli Paper:

46 Nov 17, 2022

an Evolutionary Algorithm assisted GAN

EvoGAN an Evolutionary Algorithm assisted GAN ckpts

3 Oct 09, 2022

Twins: Revisiting the Design of Spatial Attention in Vision Transformers

Twins: Revisiting the Design of Spatial Attention in Vision Transformers Very recently, a variety of vision transformer architectures for dense predic

482 Dec 18, 2022

This library is a location of the LegacyLogger for PyTorch Lightning.

neptune-contrib Documentation See neptune-contrib documentation site Installation Get prerequisites python versions 3.5.6/3.6 are supported Install li

26 Oct 07, 2021

Implementation of Diverse Semantic Image Synthesis via Probability Distribution Modeling

Diverse Semantic Image Synthesis via Probability Distribution Modeling (CVPR 2021) Paper Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu,

45 Nov 17, 2022

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .

F-Clip — Fully Convolutional Line Parsing This repository contains the official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang

115 Dec 28, 2022

Code for Understanding Pooling in Graph Neural Networks

Select, Reduce, Connect This repository contains the code used for the experiments of: "Understanding Pooling in Graph Neural Networks" Setup Install

37 Dec 13, 2022

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

Deep Reinforcement Learning Agents This repository contains a collection of reinforcement learning algorithms written in Tensorflow. The ipython noteb

2.2k Jan 01, 2023

Reusable constraint types to use with typing.Annotated

Related tags

Overview

annotated-types

Install

Examples

Documentation

Gt, Ge, Lt, Le

Interval

MultipleOf

Len

Timezone

Predicate

Design & History

Comments

Releases(v0.4.0)

v0.4.0(Oct 12, 2022)

What's Changed

v0.3.1(Sep 25, 2022)

What's Changed

v0.3.0(Sep 25, 2022)

What's Changed

v0.2.0(Jun 15, 2022)

Owner

Lightweight Python library for adding real-time object tracking to any detector.

You Only 👀 One Sequence

Hub is a dataset format with a simple API for creating, storing, and collaborating on AI datasets of any size.

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

GeoTransformer - Geometric Transformer for Fast and Robust Point Cloud Registration

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

[ICCV 2021 Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

A deep learning CNN model to identify and classify and check if a person is wearing a mask or not.

A medical imaging framework for Pytorch

Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models

The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"

Official repository for "On Generating Transferable Targeted Perturbations" (ICCV 2021)

an Evolutionary Algorithm assisted GAN

Twins: Revisiting the Design of Spatial Attention in Vision Transformers

This library is a location of the LegacyLogger for PyTorch Lightning.

Implementation of Diverse Semantic Image Synthesis via Probability Distribution Modeling

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

Code for Understanding Pooling in Graph Neural Networks

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .