Built-in Data Identifiers
Secure Access provides a wide selection of built-in data identifiers you can choose to incorporate into DLP policy to classify a wide range of sensitive data, both structured and unstructured.
The built-in identifier can classify specific data values based on pattern matching, bloom filters, and dictionaries incorporating proximity terms.
The ML (machine learning) built-in data identifiers are based on a LLM (Large Language Model) that was trained to classify unstructured sensitive data based on its true context. The ML data identifiers work for DLP-supported file types (see Supported File and Form Types); they do not work for form data. The ML built-in data identifiers are:
- Bank Statement
- Consulting Agreement
- CV/Resume
- Employment Agreement
- IRS Forms
- Medical Power of Attorney
- Mergers And Acquisitions
- NDA
- Partnership Agreement
- Source Code (ML)
- Stock
- US Patents
Built-in identifiers are not directly incorporated into DLP rules; you must first select and incorporate them into Data Classifications which you then apply to DLP rules. (See Manage Data Classifications ).
The built-in data identifiers are available as an Excel table here. The table is updated frequently, so be sure to download the most recent version.