立即与支持人员聊天
与支持团队交流

Classification Module 6.1.1 - User Guide

Introduction Deploying Classification in Identity Manager Configuring Classification: Taxonomies, Categories, and Rules
An Overview of Classification Configuration Steps Required to Implement Classification Creating Taxonomies Setting Up Manual Categorization Implementing Rules for Automated Categorization Classifying Resources When Do Categorization and Classification Occur? Importing and Exporting Taxonomies Working with a Taxonomy XML File Managing the Life Cycle of Taxonomies and Categories Advanced Rule Applications
Working with Categorized Resources Appendix A: PowerShell cmdlets Appendix B: Oracle Configuration Appendix C: Classifying Data with Data Governance Templates Glossary

Determining Why Categorizations Were Applied to a Resource

A number of factors determine a resource’s categorization. The text in the resource is evaluated against all the rules in the system. When a rule matches, its match strength is determined. This, in combination with the weight of the rule associated with a category, and the category threshold, determine whether a category is applied. When evaluating why a categorization is applied, it is important to understand these concepts and the relationships between them. For more information, see How Rules Affect Categorization.

You may want to examine a particular categorization for many reasons, including:

  • You want to see why a category was applied or not applied to a particular resource.
  • You may be working with category thresholds and rule strengths to ensure your system is working as desired.
  • You may be testing a new category.
  • You may be introducing new rules into the system and need to ensure that your system is still working as desired.

The Get-QClassificationAnalysis cmdlet returns results in a structured format for you to analyze. The following information is included:

Information available for examining a resource’s categorization
tag Description
<Taxonomy> The name of the taxonomy that contains the categories.
<Category Name> Identifies the category that is being analyzed.
<Category ID> Provides the ID so you can easily run more cmdlets on the category if needed, for example to modify the threshold or change a rule weight.
<Applied> Indicates whether the category was applied after all rules were taken into account. Only categories with at least one rule match are listed in the results.
<Threshold> The threshold value currently set on the category. The sum of the (rule weights x match strength) for all matching rules must equal or exceed this number for a category to be applied. For further explanation, see How Rules Affect Categorization.
<Rule> Identifies the rule that is being analyzed. All rules associated with category are listed, regardless of whether they match.
<Name> The name of the rule.
<Match> Indicates whether the rule matched.
<Strength> The strength of the rule match. This is used to calculate the value this rule contributes towards meeting the threshold, along with the rule weight. Match strength is set in the rule xml. For more information, see Writing XML Rules.
<Weight> The weight is part of the association between a rule and a category, and is used to calculate the value this rule contributes towards meeting the threshold, along with the match strength. You can change the rule weight using the Add-QRuleToCategory cmdlet. For more information, see Associating Rules to Categories.
<Strength_x_Weight> The value of this rule. The strength x weight is applied towards the threshold. If the sum of all rule values (for the rules associated with the category) exceeds the threshold, the category is applied.

To examine the categorization of a resource:

  1. Run the Get-QClassificationAnalysis cmdlet with the following mandatory parameters:
    1. ServerAddress
      Provide the name of the computer hosting the Data Governance server, and the port. Enter in the form computername:port number. The default port is 8723.
    2. ResourcePath
      The full file path to the resource, including the computer name.
  2. Examine the output.
    If your results are lengthy, you may find it helpful to store the output in a file, using the PowerShell command > C:\path\filename.txt.

Viewing Categorized Resources

Using the taxonomies available in your organization, resources can be automatically or manually categorized. You may need to examine the results of the automated system, or the way resources are being manually categorized by your users in order to evaluate the success of the system, and to help identify areas of improvement. For example, you may notice that a particular category is applied less often than you expect by the automated system. This may indicate that the rules attached to the categories in that taxonomy need some tweaking. You must be a Classification Analyst or Compliance Officer to use this view.

You may wish to view the categorization details of a resource to better understand the content. You may also wish to edit the categorization; for example, allowing the automated system to take over categorization if it has been overridden. Remember that business owners can manage the categorizations of their owned resources, and you should communicate with them as necessary to prevent your changes from being undone, or to gain further insight into a particular resource if required. If a manually categorized resource is owned, you can use the categorization details to understand how the automated system would categorize the resource. You may find it helpful to see how the system would work without disturbing the business owner’s categorization decisions.

NOTE! The Categorized Resources view shows all scanned resources that have at least one category. You can only view and edit categories for governed resources. Resources can be placed under governance using the Manager by your Data Governance administrator. For more information, see Placing a Resource Under Governance of your Data Governance User Guide.
NOTE! Resources that are encrypted or password protected cannot be categorized. You can identify these resources in the View Provider log. A message will indicate that the file could not be filtered. For more information on retrieving logs, see Using the Classification Logs.
NOTE! Only SharePoint documents in libraries will be automatically categorized. Also, the contents of a list are shown as “list item”, regardless of the type of item in SharePoint.

The Categorized Resources view allows you to start by choosing a category, and then filter down your results to get the information you need. You can use the following filters:

Available filters
Filter Description
Include resources categorized with subcategories In addition to the selected category, all subcategories and their nested categories will be included. For example, to display all resources categorized with a particular taxonomy, select the top level node, and then select this filter.
Host This narrows your results down to just those located on a particular server or SharePoint farm. You must enter the full, matching name. This is a case sensitive filter. You may find it helpful to first locate a resource on this host in the returned results, so you can see the exact name.
Categorized by This narrows your results down to resources categorized by a particular employee. You must enter this exactly as the employee is listed in the Categorized By column. For example, if the employee was shown as Smith, John you should enter exactly that, including matching the case. For this filter to work, make sure that Manual is selected in the categorization type, or that ”Any” is selected.
Categorization type This narrows your results down to the way the resource was categorized. If you want to see both types, select “Any”. To view inherited categorization, select “Manual”.

To view and filter categorized resources in your environment

  1. Click Governed Data | View Resources | Categorized Resources.
  2. If necessary, expand the Select Your Filters section.
  3. Click Select category and in the Select Category dialog box, choose a taxonomy and category, then click Select category.
  4. Enable Include resources categorized with subcategories of this category if desired.
  5. Select any other filters.
  6. Click Search.
    You can work with the filters interactively. Add and remove filters, and click Search to apply your changes.

To view or edit a resource’s categorization from the Categorized Resources view

  1. Locate a resource of interest in the view.
  2. Click the link on the resource name.
    If the resource is not governed, you cannot edit its categorization.
  3. Click the Categorization tab.
  4. View the categorization of the resource across all taxonomies, including whether any taxonomies are currently ignored.
  5. If you need to modify the categorization, follow the instructions in Working With Manually Categorized Resources or Working With Automatically Categorized Resources.

Making a Category Available to the Automated System

Before you put a category into production, you should ensure that it is properly configured and has the expected results. See Testing and Reviewing Automated Classification and Taxonomy Deployment Considerations for more details. Once you make it available for automated classification, any scans that start after this time will include the category. For any content that is being watched with changes, categorizations are possible after the next change.

NOTE! Categories must be both published and automated in order for automated categorization to occur.

To make a category available for automation

  1. Select Governed Data | Taxonomy Manager | Categorizations.
  2. Edit the category and ensure it is published.
  3. Select Allow this category to be used by the automated system.

To make a category available for automation using PowerShell

  1. Run the Set-QCategory cmdlet with the following parameters:
    1. ServerAddress
      Provide the name of the computer hosting the Data Governance server, and the port. Enter in the form computername:port number. The default port is 8723.
    2. CategoryID
      For information on locating a category ID, see Finding a Taxonomy or Category ID using PowerShell.
    3. Set IsPublished to $true.
    4. Set IsAutomaticClassificationEnabled to $true.

Classifying Resources

Once documents are categorized, the system can classify them. Categorization is based on the content of a resource, while classification is based on the risk the resource poses. By classifying your unstructured data, you can begin to understand the security risks posed, and put policies in place to control these resources.

相关文档

The document was helpful.

选择评级

I easily found the information I needed.

选择评级