Skip to main content
Limina supports scanning TXT files for PII and creating de-identified or redacted copies. Limina’s supported entity types function across each file type, with localized variants of different PII (Personally Identifiable Information) entities, PHI (Protected Health Information) entities, and PCI (Payment Card Industry) entities being detected. Our Supported Languages and Supported Entity Types page provides a more detailed look.
If you’d like to try it yourself, please visit our free interactive web demo. No code or account is necessary.

How TXT Files Are Processed

TXT files are processed by simply reading in the contents of the TXT files verbatim and passing it through Limina’s text module. The resulting file will contain the labelled and redacted version of contents of the original.

Constraints

  • Limina currently only supports utf-8 encoding for text files.

Support Matrix

CPU ContainerGPU ContainerCommunity APIProfessional API
SupportedYesYesUp to 250 KiBNo

Sample Request

{
  "file": {
    "data": "<file_content_base64>",
    "content_type": "text/plain"
  },
  "entity_detection": {
    "return_entity": true
  }
}

Sample Response

Response
{
  "processed_file": "Base64 Encoded File Content of the Redacted File",
  "processed_text": "string",
  "entities": "List[Entity]",
  "entities_present": true,
  "languages_detected": {"lang_1": 0.67, "lang_2": 0.74}
}