Next: , Previous: , Up: srfi regexps syntax   [Index]


2.39.7.6 Named character sets

any

Match any character. Equivalent to ‘ascii’ in an ASCII context.

nonl

Match any character other than ‘#\return’ or ‘#\newline’.

ascii

Match any ASCII character [0, 127].

lower-case
lower

Matches any character for which char-lower-case? returns true. In a Unicode context this corresponds to the ‘Lowercase’ (‘Ll’ + ‘Other_Lowercase’) property. In an ASCII context corresponds to ‘(/ "az")’.

upper-case
upper

Matches any character for which char-upper-case? returns true. In a Unicode context this corresponds to the ‘Uppercase’ (‘Lu’ + ‘Other_Uppercase’) property. In an ASCII context corresponds to ‘(/ "AZ")’.

title-case
title

Matches any character with the Unicode ‘Titlecase’ (‘Lt’) property. This property only exists for the sake of ligature characters, of which only 31 exist at time of writing. In an ASCII context this is empty.

alphabetic
alpha

Matches any character for which char-alphabetic? returns true. In a Unicode context this corresponds to the ‘Alphabetic’ (‘L’ + ‘Nl’ + ‘Other_Alphabetic’) property. In an ASCII context corresponds to ‘(w/nocase (/ "az"))’.

numeric
num

Matches any character for which char-numeric? returns true. In a Unicode context this corresponds to the ‘Numeric_Digit’ (‘Nd’) property. In an ASCII context corresponds to ‘(/ "09")’.

alphanumeric
alphanum
alnum

Matches any character which is either a letter or number. Equivalent to ‘(or alphabetic numeric)’.

punctuation
punct

Matches any punctuation character. In a Unicode context this corresponds to the ‘Punctuation’ (‘P’) property. In an ASCII context this corresponds to ‘"!\"#%&'()*,-./:;?@[\]_{}"’.

symbol

Matches any symbol character. In a Unicode context this corresponds to the ‘Symbol’ property (‘Sm’, ‘Sc’, ‘Sk’, or ‘So’). In an ASCII context this corresponds to ‘"$+<=>^`|~"’.

graphic
graph

Matches any graphic character. Equivalent to ‘(or alphanumeric punctuation symbol)’.

whitespace
white
space

Matches any whitespace character. In a Unicode context this corresponds to the ‘Separator’ property (‘Zs’, ‘Zl’ or ‘Zp’). In an ASCII context this corresponds to space, tab, line feed, form feed, and carriage return.

printing
print

Matches any printing character. Equivalent to ‘(or graphic whitespace)’.

control
cntrl

Matches any control or other character. In a Unicode context this corresponds to the ‘Other’ property (‘Cc’, ‘Cf’, ‘Co’, ‘Cs’ or ‘Cn’). In an ASCII context this corresponds to:

`(/ ,(integer->char 0) ,(integer-char 31))
hex-digit
xdigit

Matches any valid digit in hexadecimal notation. Always ASCII–only. Equivalent to:

(w/ascii (w/nocase (or numeric "abcdef")))

Next: , Previous: , Up: srfi regexps syntax   [Index]