Chat now with support
Chat with Support

Classification Module 6.1.3 - User Guide

Introduction Deploying Classification in Identity Manager Configuring Classification: Taxonomies, Categories, and Rules
An Overview of Classification Configuration Steps Required to Implement Classification Creating Taxonomies Implementing Rules for Automated Categorization Classifying Resources When Do Categorization and Classification Occur? Managing the Life Cycle of Taxonomies and Categories
Working with Categorized Resources Appendix A: PowerShell Commands Appendix B: Oracle Configuration Appendix C: Classifying Data with Data Governance Templates Appendix D: Creating a Taxonomy to Classify Data

Sample Advanced Text Extractors Details

For details on editing the templates to better suit your needs, see Working with Text Extractors.

An advanced text extractor contains the following components:

Advanced Text Extractor Components
Component Description
Extractors Details the information to be extracted from resource files for analysis by the rules engine. Extractors are refined by grammars.
Grammars A collection of entities that refine the matches made by extractors by either excluding unnecessary patterns, or building new compound patterns. For example, the 'First Name' and 'Last Name' entities provided in the .ecr libraries can be combined within a custom grammar to form the 'Full Names' entity.

Note: The grammar name cannot begin with a number.

Grammars can be defined by headwords. This is a component that allows an analyst to define their own word libraries to be used in extractors.

Rather than using regular expression within a <pattern> tag with zero or more additional parameters, you can create an entity with headwords that you can then reference.

This facilitates updating custom lists. You can add or remove headwords from the entity created for them, and any extractor referencing that entity will be automatically updated.
Patterns A grammar defines patterns for matching text in a document. A pattern is a combination of characters and operators. An operator is a sequence of special characters that match text by following the rules associated with the operator.

Credit Card and Banking

Extractor Details
Name ID Use to find...
Credit Card Number - Delimited Extrators.Credit.Card.Delim Credit card delimited numbers of 13 - 16 digits
Credit Card Number - Non Extrators.Credit.Card.NoDelim Credit card non- delimited numbers of 13 - 16 digits
Credit Card Provider Extrators.Credit.Card.Providers Names of major credit card providers
Bank Account Number - IBAN Extractors.Bank.IBAN International bank account numbers
Bank Account Type - IBAN Extracors.IBAN.Name “IBAN” or “International Bank Account Number”

Grammar File contents
Name Use to MAtch Patterns for...
number_cc.ecr
number_iban.ecr
Credit card numbers
International Bank Account Number
Headwords Major credit card providers, “IBAN”

Grammar Pattern Match Details

  • Credit card numbers
    number_cc/cc/delim
    Pattern Description
    (?A^number_cc/cc/delim) Any dash-delimited credit card number
    (?A^number/ccds) Any space-delimited credit card number
    number_cc/cc/nodelim
    Pattern Description
    (?A^number_cc/cc/nodelim) Any non-delimited credit card values
  • number_bank/banking/iban
    Pattern Description
    (?A^number_bank/banking/ibans) Space-delimited IBAN number
    (?A^number_bank/banking/iban) Undelimited IBAN number
    number/bank/IBANName
    Pattern Description
    Defined by the following headwords:"iban" and ‘international bank account" Case insensitive use of the acronym and full name for “International bank account”
    Bank accounts
  • Credit Card Providers
    number_cc/cc/providers
    Sample Headwords Description
    Defined by the following: “american express", "visa", "amex", "austrailian bankcard", "diners", "discover", "voyager", "jcb", "enroute", "mastercard", "credit card", "eurocard", "union pay" Case insensitive use of major credit card providers

National Identifiers

Extractor Details
Name ID Use to Find...
National Identifiers Delimited US CA GB Extractors.National.Identity.cards.Delim Delimited National Identifiers for Great Britain, Canada, and USA
National Identifiers Non-Delimited US CA GB Extractors.National.Identity.cards.NoDelim Non-delimited National Identifiers for Great Britain, Canada, and USA
National Identifiers France Extractors.National.Identity.cards.France French INSEE identification numbers
National Identifiers Spain Extractors.National.Identity.cards.Spain Spanish national identifier numbers
National Identity Card Names GB US Canada Extractors.National.Identity.Cards.Names.Eng Names of National Identity Cards for Great Britain, Unites States, and Canada (Such as Social Insurance Number, SIN, Social Security Number, SSN, National Insurance, NI, numéro d'assurance sociale, nas, medicare.)
National Identity Card Names France Extractors.National.Identity.Cards.Names.France 'INSEE' or the fully spelled out version
Spanish National Identity Card Names Extractors.National.Identity.Cards.Names.Spain Names of the Spain national identity cards

Grammar File Details
Name Use to MAtch PAtterns for...
number_ni_gb.ecr, number_sin_ca.ec, number_ss_us.ecr, number_insee_fr.ecr National Identifiers for Great Britain, Canada, USA, France, and Spain
RegEx and Headwords National Identifiers for Spain, Names of National Identity Cards for Great Britain, Unites States, Canada, France, Spain

Grammar Pattern Match Details

  • National Identifiers Delimited US CA GB
    number_NID/NationalID/delim
    Pattern Description
    (?A^number/nids/gb) Space-delimited UK NI number
    (?A^number/nidh/gb) Hyphen-delimited UK NI numbers
    (?A^number/sinds/ca) Canadian space-delimited social insurance number
    (?A^number/sindh/ca) Canadian undelimited social insurance number
    (?A^number/ssdh/us) Dash-delimited social security number
    (?A^number/ssds/us) Space-delimited Social security number
    (?A^number/medicareid/us) Medicare ID
    number_NID/NationalID/nodelim
    Pattern Description
    (?A^number/nidn/gb) Non-delimited UK NI number
    (?A^number/sindn/ca) Canadian non-delimited social insurance number
    (?A^number/ssdn/us) Non-delimited American social security number
  • number_NID/NationalID/France
    Pattern Description
    (?A^number/insee/fr) French INSEE number
    National Identifiers France
  • RegEx Description
    (?i:(?<!-)\b[0-9]{8}\-\p{L}\b(?!-)|(?<!-)\b\p{L}\-[0-9]{7}\-\p{L}\b(?!-)) This matches the Spanish National Identifier formats A-9999999-A and 99999999-A
    Spanish National Identity Card
  • National Identity Card Names GB US Canada
    number_NID/NationalID/NamesEng
    Sample HEadwords Description
    Defined by the following headwords: "social insurance", "sin", "social security", "ssn", "medicare", "NINO", "national insurance", "numéro d'assurance sociale" Search for case insensitive use of National Identifiers for Great Britain, Unites States, and Canada
  • number_NID/NationalID/NamesFrance
    Sample Headwords Description
    Defined by the following headwords: "insee", "institut national de la statistique et des études économiques" Search for case insensitive use of National Identifiers for France
    National Identity Card Names France
  • number_NID/NationalID/NamesSpain
    Sample HEadwords Description
    Defined by the following headwords: "nacional de identidad", "ndi", “identificación de extranjeros", "nie" Search for case insensitive use of National Identifiers for Spain
    Spanish National Identity Card Names

Telephone Numbers

Extractor Details
Name ID Use to Find...
Telephone numbers US Extractors.Phone.US.Delim Delimited American telephone numbers
Telephone numbers Canada Extractors.Phone.Canada.Delim Delimited Canadian telephone numbers
Telephone Description Extractors.Phone.Name “tel”, “telephone”, “mobile” and “cell”
Telephone Numbers France Extractors.Phone.France French phone numbers
Telephone Numbers Germany Extractors.Phone.Germany German phone numbers
Telephone numbers Great Britain Extractors.Phone.GB Great Britain telephone numbers

Grammar File Details
Name Use to find...
number_phone_us.ecr, number_phone_ca.ecr, number_phone_gb.ecr, number_phone_de.ecr, number_phone_es.ecr, number_phone_fr.ecr' Numeric phone numbers for Canada, US, Great Britain, Germany, France and Spain
RegEx and Headwords Descriptive words for 'telephone' in French, German, Spanish and English

Grammar Pattern Match Details

  • Telephone Numbers
    number_tel/phone/USDelim Note: Set TangibleCharacters to include '+' and '(' for correct performance by this entity.
    Pattern Description
    (?A^number_tel/phone/USdelim) Hyphen delimited numeric American telephone numbers
    number_tel/phone/CADelim Note: Set TangibleCharacters to include '+' and '(' for correct performance by this entity.
    Pattern Description
    (?A^phone/numdh/ca) Hyphen delimited numeric Canadian telephone numbers
    number_tel/phone/fr
    Pattern Description
    (?A^phone/all/fr) French telephone numbers
    number_tel/phone/de
    Pattern Description
    (?A^phone/all/de) German telephone numbers
    number_tel/phone/es
    Pattern Description
    (?A^phone/all/es) Spanish telephone numbers
    number_tel/phone/desc
    Sample HEadwords Description
    Defined by the following headwords: "tel", “telephone", "telefax", "telefon”, "mobile", "fax", "cell", "téléphone", "tél" Search for case insensitive use of telephone descriptions
    number_tel/phone/GB Telephone numbers Great Britain
    Pattern Description
    (?A^phone/all/gb) Any complete phone number in the UK
Related Documents

The document was helpful.

Select Rating

I easily found the information I needed.

Select Rating