Unicode::Regex::SetUnicode::Regex::Set is a subtraction and intersection of Character Sets in Unicode Regular Expressions. | |
Download |
Unicode::Regex::Set Ranking & Summary
Advertisement
- License:
- Perl Artistic License
- Price:
- FREE
- Publisher Name:
- SADAHIRO Tomoyuki
- Publisher web site:
- http://search.cpan.org/~sadahiro/ShiftJIS-String-1.04/String.pod
Unicode::Regex::Set Tags
Unicode::Regex::Set Description
Unicode::Regex::Set is a subtraction and intersection of Character Sets in Unicode Regular Expressions. Unicode::Regex::Set is a subtraction and intersection of Character Sets in Unicode Regular Expressions.SYNOPSIS use Unicode::Regex::Set qw(parse); $regex = parse('');Perl 5.8.0 misses subtraction and intersection of characters, which is described in Unicode Regular Expressions (UTS #18). This module provides a mimic syntax of character classes including subtraction and intersection, taking advantage of look-ahead assertions.The syntax provided by this module is considerably incompatible with the standard Perl's regex syntax.Any whitespace character (that matches /s/) is allowed between any tokens. Square brackets ('') are used for grouping. A literal whitespace and square brackets must be backslashed (escaped with a backslash, ''). You cannot put literal ']' at the start of a group.A POSIX-style character class like is allowed since its ' is a list of 'A', '&', 'Z'. is a character range from 'A' to 'Z'. is a set by removal of from .Union operator '|' may be omitted. E.g. is equivalent to , and also to .Intersection operator '&' has high precedence, so is equivalent to | p{D}].Subtraction operator '-' has low precedence, so is equivalent to - ]. is a set by removal of p{B} and p{C} from p{A}. It is equivalent to ] and .Negation. when '^' just after a group-opening ' matches anything but neither nor . More clearly you can say this with grouping as ].If '^' that is not next to ' matches A-Z or a non-Latin character. But ] (or , for this is a simple case) is recommended for clarity.If you want to remove anything other than PERL from , use as well as ]. Similarly, if you want to intersect and a thing not JUNK, use as well as ]. Requirements: · Perl
Unicode::Regex::Set Related Software