ClassUnicode in regex_syntax::hir - Rust (original) (raw)
Struct ClassUnicode
pub struct ClassUnicode { /* private fields */ }
Expand description
A set of characters represented by Unicode scalar values.
Create a new class from a sequence of ranges.
The given ranges do not need to be in any specific order, and ranges may overlap. Ranges will automatically be sorted into a canonical non-overlapping order.
Create a new class with no ranges.
An empty class matches nothing. That is, it is equivalent toHir::fail.
Add a new range to this set.
Return an iterator over all ranges in this class.
The iterator yields ranges in ascending order.
Return the underlying ranges as a slice.
Expand this character class such that it contains all case folded characters, according to Unicode’s “simple” mapping. For example, if this class consists of the range a-z
, then applying case folding will result in the class containing both the ranges a-z
and A-Z
.
§Panics
This routine panics when the case mapping data necessary for this routine to complete is unavailable. This occurs when the unicode-case
feature is not enabled.
Callers should prefer using try_case_fold_simple
instead, which will return an error instead of panicking.
Expand this character class such that it contains all case folded characters, according to Unicode’s “simple” mapping. For example, if this class consists of the range a-z
, then applying case folding will result in the class containing both the ranges a-z
and A-Z
.
§Error
This routine returns an error when the case mapping data necessary for this routine to complete is unavailable. This occurs when theunicode-case
feature is not enabled.
Negate this character class.
For all c
where c
is a Unicode scalar value, if c
was in this set, then it will not be in this set after negation.
Union this character class with the given character class, in place.
Intersect this character class with the given character class, in place.
Subtract the given character class from this character class, in place.
Compute the symmetric difference of the given character classes, in place.
This computes the symmetric difference of two character classes. This removes all elements in this class that are also in the given class, but all adds all elements from the given class that aren’t in this class. That is, the class will contain all elements in either class, but will not contain any elements that are in both classes.
Returns true if and only if this character class will either match nothing or only ASCII bytes. Stated differently, this returns false if and only if this class contains a non-ASCII codepoint.
Returns the length, in bytes, of the smallest string matched by this character class.
Returns None
when the class is empty.
Returns the length, in bytes, of the longest string matched by this character class.
Returns None
when the class is empty.
If this class consists of exactly one codepoint, then return it as a literal byte string.
If this class is empty or contains more than one codepoint, then None
is returned.
If this class consists of only ASCII ranges, then return its corresponding and equivalent byte class.
Tests for self
and other
values to be equal, and is used by ==
.
Tests for !=
. The default implementation is almost always sufficient, and should not be overridden without very good reason.