Charset

A Charset object represents a set of Unicode characters that can be used with Scanner objects to parse strings for specific ranges of a string that match the characters in the charset.

The Charset class is not subclassable.

Class Methods

constructor([string])

Creates a new Charset object that contains all of the characters present in the provided string (or, if no string is provided, no characters).

Class Properties

alphanumeric

The character set containing all alphanumeric characters, as defined in Unicode General Category L*, M*, and N*.

digits

The character set containing all decimal digit characters, as defined in Unicode Category Decimal Numbers.

letters

The character set containing all letter characters, as defined in Unicode General Category L* & M*.

lower

The character set containing all lowercase letter characters, as defined in Unicode General Category Ll.

newlines

The character set containing all newline characters (U+000A ~ U+000D, U+0085, U+2028, and U+2029).

symbols

The character set containing all symbol characters, as defined in Unicode General Category S*.

upper

The character set containing all uppercase letter characters, as defined in Unicode General Category Lu and Lt.

whitespace

The character set containing all whitespace characters, as defined in Unicode General Category Zs and CHARACTER TABULATION (U+0009).

whitespaceAndNewlines

The character set containing all whitespace and newline characters, as defined in Unicode General Category Z*, U+000A ~ U+000D, and U+0085.

Methods

concat(charset[, …])

Returns a new character set that concatenates all characters in the receiver as well as any other charsets provided as arguments.

intersect(charset)

Returns a new character set that forms an intersection of the receiver as well as any other charsets provided as arguments.

invert()

Returns a new character set that represents the inverse of the receiver.