For details on editing the templates to better suit your needs, see Working with Text Extractors.
An advanced text extractor contains the following components:
Component | Description |
Extractors | Details the information to be extracted from resource files for analysis by the rules engine. Extractors are refined by grammars. |
Grammars | A collection of entities that refine the matches made by extractors by either excluding unnecessary patterns, or building new compound patterns. For example, the 'First Name' and 'Last Name' entities provided in the .ecr libraries can be combined within a custom grammar to form the 'Full Names' entity. Note: The grammar name cannot begin with a number. Grammars can be defined by headwords. This is a component that allows an analyst to define their own word libraries to be used in extractors. Rather than using regular expression within a <pattern> tag with zero or more additional parameters, you can create an entity with headwords that you can then reference. This facilitates updating custom lists. You can add or remove headwords from the entity created for them, and any extractor referencing that entity will be automatically updated. |
Patterns | A grammar defines patterns for matching text in a document. A pattern is a combination of characters and operators. An operator is a sequence of special characters that match text by following the rules associated with the operator. |
Name | ID | Use to find... |
Credit Card Number - Delimited | Extrators.Credit.Card.Delim | Credit card delimited numbers of 13 - 16 digits |
Credit Card Number - Non | Extrators.Credit.Card.NoDelim | Credit card non- delimited numbers of 13 - 16 digits |
Credit Card Provider | Extrators.Credit.Card.Providers | Names of major credit card providers |
Bank Account Number - IBAN | Extractors.Bank.IBAN | International bank account numbers |
Bank Account Type - IBAN | Extracors.IBAN.Name | “IBAN” or “International Bank Account Number” |
Name | Use to MAtch Patterns for... |
number_cc.ecr number_iban.ecr |
Credit card numbers International Bank Account Number |
Headwords | Major credit card providers, “IBAN” |
Pattern | Description |
(?A^number_cc/cc/delim) | Any dash-delimited credit card number |
(?A^number/ccds) | Any space-delimited credit card number |
Pattern | Description |
(?A^number_cc/cc/nodelim) | Any non-delimited credit card values |
Pattern | Description |
(?A^number_bank/banking/ibans) | Space-delimited IBAN number |
(?A^number_bank/banking/iban) | Undelimited IBAN number |
Pattern | Description |
Defined by the following headwords:"iban" and ‘international bank account" | Case insensitive use of the acronym and full name for “International bank account” |
Sample Headwords | Description |
Defined by the following: “american express", "visa", "amex", "austrailian bankcard", "diners", "discover", "voyager", "jcb", "enroute", "mastercard", "credit card", "eurocard", "union pay" | Case insensitive use of major credit card providers |
Name | ID | Use to Find... |
National Identifiers Delimited US CA GB | Extractors.National.Identity.cards.Delim | Delimited National Identifiers for Great Britain, Canada, and USA |
National Identifiers Non-Delimited US CA GB | Extractors.National.Identity.cards.NoDelim | Non-delimited National Identifiers for Great Britain, Canada, and USA |
National Identifiers France | Extractors.National.Identity.cards.France | French INSEE identification numbers |
National Identifiers Spain | Extractors.National.Identity.cards.Spain | Spanish national identifier numbers |
National Identity Card Names GB US Canada | Extractors.National.Identity.Cards.Names.Eng | Names of National Identity Cards for Great Britain, Unites States, and Canada (Such as Social Insurance Number, SIN, Social Security Number, SSN, National Insurance, NI, numéro d'assurance sociale, nas, medicare.) |
National Identity Card Names France | Extractors.National.Identity.Cards.Names.France | 'INSEE' or the fully spelled out version |
Spanish National Identity Card Names | Extractors.National.Identity.Cards.Names.Spain | Names of the Spain national identity cards |
Name | Use to MAtch PAtterns for... |
number_ni_gb.ecr, number_sin_ca.ec, number_ss_us.ecr, number_insee_fr.ecr | National Identifiers for Great Britain, Canada, USA, France, and Spain |
RegEx and Headwords | National Identifiers for Spain, Names of National Identity Cards for Great Britain, Unites States, Canada, France, Spain |
Pattern | Description |
(?A^number/nids/gb) | Space-delimited UK NI number |
(?A^number/nidh/gb) | Hyphen-delimited UK NI numbers |
(?A^number/sinds/ca) | Canadian space-delimited social insurance number |
(?A^number/sindh/ca) | Canadian undelimited social insurance number |
(?A^number/ssdh/us) | Dash-delimited social security number |
(?A^number/ssds/us) | Space-delimited Social security number |
(?A^number/medicareid/us) | Medicare ID |
Pattern | Description |
(?A^number/nidn/gb) | Non-delimited UK NI number |
(?A^number/sindn/ca) | Canadian non-delimited social insurance number |
(?A^number/ssdn/us) | Non-delimited American social security number |
Pattern | Description |
(?A^number/insee/fr) | French INSEE number |
RegEx | Description |
(?i:(?<!-)\b[0-9]{8}\-\p{L}\b(?!-)|(?<!-)\b\p{L}\-[0-9]{7}\-\p{L}\b(?!-)) | This matches the Spanish National Identifier formats A-9999999-A and 99999999-A |
Sample HEadwords | Description |
Defined by the following headwords: "social insurance", "sin", "social security", "ssn", "medicare", "NINO", "national insurance", "numéro d'assurance sociale" | Search for case insensitive use of National Identifiers for Great Britain, Unites States, and Canada |
Sample Headwords | Description |
Defined by the following headwords: "insee", "institut national de la statistique et des études économiques" | Search for case insensitive use of National Identifiers for France |
Sample HEadwords | Description |
Defined by the following headwords: "nacional de identidad", "ndi", “identificación de extranjeros", "nie" | Search for case insensitive use of National Identifiers for Spain |
Name | ID | Use to Find... |
Telephone numbers US | Extractors.Phone.US.Delim | Delimited American telephone numbers |
Telephone numbers Canada | Extractors.Phone.Canada.Delim | Delimited Canadian telephone numbers |
Telephone Description | Extractors.Phone.Name | “tel”, “telephone”, “mobile” and “cell” |
Telephone Numbers France | Extractors.Phone.France | French phone numbers |
Telephone Numbers Germany | Extractors.Phone.Germany | German phone numbers |
Telephone numbers Great Britain | Extractors.Phone.GB | Great Britain telephone numbers |
Name | Use to find... |
number_phone_us.ecr, number_phone_ca.ecr, number_phone_gb.ecr, number_phone_de.ecr, number_phone_es.ecr, number_phone_fr.ecr' | Numeric phone numbers for Canada, US, Great Britain, Germany, France and Spain |
RegEx and Headwords | Descriptive words for 'telephone' in French, German, Spanish and English |
Pattern | Description |
(?A^number_tel/phone/USdelim) | Hyphen delimited numeric American telephone numbers |
Pattern | Description |
(?A^phone/numdh/ca) | Hyphen delimited numeric Canadian telephone numbers |
Pattern | Description |
(?A^phone/all/fr) | French telephone numbers |
Pattern | Description |
(?A^phone/all/de) | German telephone numbers |
Pattern | Description |
(?A^phone/all/es) | Spanish telephone numbers |
Sample HEadwords | Description |
Defined by the following headwords: "tel", “telephone", "telefax", "telefon”, "mobile", "fax", "cell", "téléphone", "tél" | Search for case insensitive use of telephone descriptions |
Pattern | Description |
(?A^phone/all/gb) | Any complete phone number in the UK |
© 2025 One Identity LLC. ALL RIGHTS RESERVED. 利用規約 プライバシー Cookie Preference Center