# Sample Datasets

The following datasets can be used to test Nightfall's advanced AI-based detection capabilities. The data has been fully de-identified and can be used to test any data loss prevention (DLP) platform.

### PII Samples

This dataset showcases Nightfall’s ability to detect Personally Identifiable Information (PII) with exceptional precision and minimal noise across text, spreadsheets, and screenshots. Samples include names, U.S. social security numbers, driver's license numbers, and more. See [Image ID Samples](#image-id-samples) for image samples of driver licenses and other ID types.&#x20;

{% code title="pii clipboard sample" overflow="wrap" %}

```
Hi Support - My name is Julie Walsh. I tried to purchase a life insurance policy online yesterday however the site said, "an unexpected error occurred." I tried to pay with a credit card. My DOB is 02-10-97 and SSN is 523-23-6145. Could you take a look on your end?
```

{% endcode %}

{% file src="/files/plXZkX3Te3MOTvKbF9kW" %}

{% file src="/files/HB8qk5zgfAqzMdq1Dqqr" %}

{% file src="/files/P1e5uXn9vwh8vtIJvu3Y" %}

{% file src="/files/Xorbx6btWhah4FZBTz6h" %}
This .ZIP contains the ***positive samples*** shown above, along with additional examples and ***negative lookalike samples*** for testing.
{% endfile %}

### PCI / Banking Samples

This sample dataset demonstrates Nightfall's ability to detect sensitive banking and payment information with high precision and low noise in text, spreadsheets, and screen grabs. Samples include positive and negative examples of credit card numbers, routing numbers, IBAN codes, and SWIFT codes.

{% code title="pci clipboard sample" overflow="wrap" %}

```
Hi Support - This is Julie Walsh. I tried to purchase an electric bike using my credit card 6771-8979-6102-7961. The app is telling me the card was declined. Could you take a look on your end?
```

{% endcode %}

{% file src="/files/QutBo2q1DKeZNcpJzq0S" %}

{% file src="/files/zSUNc13BXCQCJCM3UINP" %}

{% file src="/files/KeL4FvoSzRyUfVGA5SIJ" %}

{% file src="/files/htnQE7p1vNxaxk2GFiOJ" %}
This .ZIP contains the ***positive samples*** shown above, along with additional examples and ***negative lookalike samples*** for testing.
{% endfile %}

### API Keys

Nightfall AI's fine-tuned API key detection LLM detects secrets with high precision and dramatically reduces false positives.

{% code title="api key clipboard sample" overflow="wrap" %}

```
import stripe stripe.api_key = "sk_live_4eC39HqLyjWDarjtT1zdp7dcTYooMQauvdEDq54NiTphI7jx"
stripe.Charge.create( amount=2000, currency="usd", source="tok_amex", # obtained with Stripe.js description="Charge for jenny.rosen@example.com" )
```

{% endcode %}

{% file src="/files/NWV6yC2ggXrnhOQEN239" %}

{% file src="/files/P3lsxvvnIDOM5DGmepYN" %}

{% file src="/files/kwtBJL364bwswXWAYwMR" %}

{% file src="/files/8jG9erM1XHRWuB5S5csc" %}
This .ZIP contains the ***positive samples*** shown above, along with additional examples and ***negative lookalike samples*** for testing.
{% endfile %}

*Testing note:* If a key status is marked as ‘Active’, please rotate the key immediately. Not all vendors provide an "Inactive" response code. In these cases or if the vendor service is offline, the finding status will be marked ‘Unverified’.

### Password Samples

Nightfall AI detects passwords shared in conversational text and code.&#x20;

{% code title="password clipboard sample" overflow="wrap" %}

```
Alex, Here are the credentials to get onto the new training platform. 
loginid=fitnessFreak99 passphrase=Activ3Life22!
```

{% endcode %}

{% file src="/files/bduYLavHCj0N9M6W5KlZ" %}

{% file src="/files/eWEDBsrIC7JMipJsKVaD" %}

{% file src="/files/TzwsftcBrhOhBte8qZ6J" %}
This .ZIP contains the ***positive samples*** shown above, along with additional examples and ***negative lookalike samples*** for testing.
{% endfile %}

### PHI Samples

Nightfall’s PHI model surpasses traditional entity-based detectors by combining multiple signals — including PII and medical indicators — and analyzing their relationships and context to ensure only patient health–related content is flagged.

{% code title="phi clipboard sample" overflow="wrap" %}

```
The patient, Anthony Smith (DOB 05/10/1993), presents with a sustained elevated heart rate. 

The patient has a past medical history of atrial fibrillation. 
Attending Physician: Harwood, Andrew MD 
```

{% endcode %}

{% file src="/files/OrzL6X5F7vHOUwXbePzx" %}

{% file src="/files/GkHwXr1Dm8UpZSzhhBtq" %}

{% file src="/files/RAX48TNoPP9t81tpzDNA" %}

{% file src="/files/z6hJPfIY55oy24mjtvNJ" %}

{% file src="/files/tv763N9SI40bgxgSwB4n" %}
This .ZIP contains the ***positive PHI samples*** shown above, along with additional examples and ***negative lookalike samples*** for testing.
{% endfile %}

### Crypto Key Samples

This sample dataset demonstrates Nightfall's ability to detect cryptographic keys.&#x20;

{% code title="cryptographic key clipboard sample" overflow="wrap" %}

```
-----BEGIN EC PRIVATE KEY-----
MHcCAQEEIGNDB1AYI5yJ4ysmzfnMzAe/gFJup+pY0qt7U7SaQiK/oAoGCCqGSM49
AwEHoUQDQgAEN+yEGcEGA6x31zryD4HUcbHhNVS8nkzhlNR4NWJN2HsCzjBvpq0j
e8CV5iMmLaaQA5BFng0ZbGUPOgLNHhVq1g==
-----END EC PRIVATE KEY-----
```

{% endcode %}

{% file src="/files/863fOwq1HfFrauVb6U6d" %}

{% file src="/files/Dp9DWmvIXLFYMSRpb0zt" %}

{% file src="/files/JMRWnfB8ihDU5guE1WDr" %}
This .ZIP contains the ***positive samples*** shown above, along with additional examples and ***negative lookalike samples*** for testing.
{% endfile %}

### Image ID Samples

Nightfall’s computer vision (CV) transformer model outperforms legacy Optical Character Recognition (OCR) text scanning to identify driver’s licenses, passports, credit cards, and US social security cards even though images may be degraded (rotated, glossy, low contrast, blurry, skewed, or cropped).<br>

<figure><img src="/files/MSHoTfmHnXOBKYXSoMJt" alt="" width="375"><figcaption></figcaption></figure>

{% file src="/files/vVMpvrfBM2jNEl3fQjmY" %}

{% file src="/files/9ZVBoHkePHc6ML9Uv8sa" %}

{% file src="/files/HOlVIhSbkMP56dtAyYjq" %}

{% file src="/files/bqIBDfdoBiVYzEikrZtE" %}

{% file src="/files/9xRz3HJ2q6szoS2LoUPu" %}
This .ZIP contains the ***positive samples*** shown above, along with additional examples.
{% endfile %}

### File Classifier Examples

The File Classifier goes beyond entity detection by analyzing a document’s purpose, structure, format, and contextual signals to accurately identify intellectual property, proprietary source code, and other sensitive internal records. These examples demonstrate how the classifier protects confidential materials—including internal source code, legal and regulatory drafts, HR documents, and strategic planning files—across a wide range of real-world scenarios.

{% file src="/files/RvFXLIf3gNpyb1aabWpr" %}

{% file src="/files/4huX6Jt7w2eocPdpcrIP" %}

{% file src="/files/JsCtpNyJnHufMGTr7eF6" %}

{% file src="/files/xQraQF596mHn8THMyr5S" %}

{% file src="/files/kUxxvpx8VE7TMravDumm" %}

{% file src="/files/n1Quj4Oufrt4ub4HIy2m" %}

{% file src="/files/WbBhxkBzBfwOIgVc4WOw" %}

{% file src="/files/z3HJsGQzfZgfdfnJUacW" %}

{% file src="/files/MuMxXLZc2MB2DBG4d2L0" %}

{% file src="/files/nzt4JTTFtlLWUHIpqlNa" %}

{% file src="/files/8oUDywOlzAg3xodOR0Ua" %}

{% file src="/files/WgPLROA9Duct2EDUADe5" %}

{% file src="/files/0vCGkqAsH5unMGsvx6d9" %}

{% file src="/files/EERkFQNsV2FSrsd6MtXc" %}

{% file src="/files/uXgklTxeKeM0R9Cm7zPr" %}

{% file src="/files/5q0zkVectJhYO1gcVGvo" %}

{% file src="/files/JJUvo3AwcBgdEsg4Ksd6" %}

{% file src="/files/6oUccoaypOB8bp1SshO9" %}

{% file src="/files/pnuBTqVI79vdDqhDkGhb" %}

### All Sample Datasets

This ZIP file includes all **positive** and ***negative lookalike samples*** across PII, PCI, Banking, PHI, credentials, and image-based datasets. It’s designed to help you evaluate Nightfall’s detection precision and compare performance in your DLP proof of value (POV) testing.

{% file src="/files/ALgApN4vWB2QwV7RlFQo" %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://help.nightfall.ai/nightfall_policy_templates/sample_data.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
