Main Page | Class Hierarchy | Alphabetical List | Data Structures | Directories | File List | Data Fields | Globals | Related Pages

regex.h File Reference

C++ API: Regular Expressions. More...

#include "unicode/utypes.h"
#include "unicode/uobject.h"
#include "unicode/unistr.h"
#include "unicode/parseerr.h"

Go to the source code of this file.

Defines

#define RegexPatternDump(pat)
 RBBIPatternDump Debug function, displays the compiled form of a pattern.

Enumerations

enum  {
  UREGEX_CANON_EQ = 128, UREGEX_CASE_INSENSITIVE = 2, UREGEX_COMMENTS = 4, UREGEX_DOTALL = 32,
  UREGEX_MULTILINE = 8, UREGEX_UWORD = 256
}
 Constants for Regular Expression Match Modes. More...


Detailed Description

C++ API: Regular Expressions.

Regular Expression API

The ICU API for processing regular expressions consists of two classes, RegexPattern and RegexMatcher. RegexPattern objects represent a pre-processed, or compiled regular expression. They are created from a regular expression pattern string, and can be used to create <regexmatcher> objects for the pattern.

Class RegexMatcher bundles together a regular expression pattern and a target string to which the search pattern will be applied. RegexMatcher includes API for doing plain find or search operations, for search and replace operations, and for obtaining detailed information about bounds of a match.


Define Documentation

#define RegexPatternDump pat   ) 
 

RBBIPatternDump Debug function, displays the compiled form of a pattern.

For internal use only.


Enumeration Type Documentation

anonymous enum
 

Constants for Regular Expression Match Modes.

ICU 2.4

Enumeration values:
UREGEX_CANON_EQ  Forces normalization of pattern and strings.

ICU 2.4

UREGEX_CASE_INSENSITIVE  Enable case insensitive matching.

ICU 2.4

UREGEX_COMMENTS  Allow white space and comments within patterns ICU 2.4.
UREGEX_DOTALL  If set, '.

' matches line terminators, otherwise '.' matching stops at line end. ICU 2.4

UREGEX_MULTILINE  Control behavior of "$" and "^" If set, recognize line terminators within string, otherwise, match only at start and end of input string.

ICU 2.4

UREGEX_UWORD  Unicode word boundaries.

If set, uses the Unicode TR 29 definition of word boundaries. Warning: Unicode word boundaries are quite different from traditional regular expression word boundaries. See http://unicode.org/reports/tr29/#Word_Boundaries ICU 2.8


Generated on Wed May 18 17:29:15 2005 for ICU 2.8 by  doxygen 1.4.2