Nightfall Sample Data Sets

The following datasets can be used to test Nightfall's advanced AI-based detection capabilities. The data has been fully de-identified and can be used to test any data loss prevention (DLP) platform.

Protected Health Information (PHI)

Nightfall's genAI-based model outperforms traditional entity-based detectors by detecting PHI entities in patient healthcare-related documents.

The patient, (DOB ), presents with a sustained elevated heart rate. The patient has a past medical history of . Attending Physician: Harwood, Andrew MD

API Keys

Nightfall AI's fine-tuned API key detection LLM detects secrets with high precision and dramatically reduces false positives.

import stripe stripe.api_key = ""

stripe.Charge.create( amount=2000, currency="usd", source="tok_amex", # obtained with Stripe.js description="Charge for jenny.rosen@example.com" )

Testing note: If a key status is marked as ‘Active’, please rotate the key immediately. Not all vendors provide an "Inactive" response code. In these cases or if the vendor service is offline, the finding status will be marked ‘Unverified’.

Passwords

Nightfall AI detects passwords shared in conversational text and code.

Alex, Here credentials to get onto the new training platform. loginid=fitnessFreak99 passphrase=

Cryptographic Keys

This sample dataset demonstrates Nightfall's ability to detect cryptographic keys.

PII (Personally Identifiable Information)

This sample dataset demonstrates Nightfall's ability to detect PII with high precision and low noise in text, spreadsheets, and screen grabs. Samples include names, U.S. social security numbers, and driver's licenses.

Hi Support - My name is . I tried to purchase a life insurance policy online yesterday however the site said, "an unexpected error occurred." I tried to pay with a credit card. My DOB is and SSN is . Could you take a look on your end?

Financial - Credit Card Numbers and Banking Info

This sample dataset demonstrates Nightfall's ability to detect sensitive banking and payment information with high precision and low noise in text, spreadsheets, and screen grabs. Samples include positive and negative examples of credit card numbers, routing numbers, IBAN codes, and SWIFT codes.

Hi Support - This is . I tried to purchase an electric bike using my credit card . The app is telling me the card was declined. Could you take a look on your end?

Image ID Documents (ID Cards, Passports, Credit Cards, etc as images)

Nightfall’s computer vision (CV) transformer model outperforms legacy Optical Character Recognition (OCR) text scanning to identify driver’s licenses, passports, credit cards, and US social security cards even though images may be degraded (rotated, glossy, low contrast, blurry, skewed, or cropped).

Last updated