Skip to main content
Limina supports scanning Microsoft PowerPoint PPT and PPTX files for PII and creating de-identified or redacted copies. Limina’s supported entity types function across each file type, with localized variants of different PII (Personally Identifiable Information) entities, PHI (Protected Health Information) entities, and PCI (Payment Card Industry) entities being detected. Our Supported Languages and Supported Entity Types page provides a more detailed look.
If you’d like to try it yourself, please visit our free interactive web demo. No code or account is necessary.

How PPTX Files Are Processed

PPTX files are processed by extracting each element and processing according to the table below. The de-identified or redacted file is created by according to the behaviour specified in the table.
Property TypeDetailsBehaviour
Core propertiesAuthor, Category, Comments, Content Status, Identifier, Keywords, Language, Last Modified By, Subject, Title, VersionRedact
Speaker notesAny content in the speakers notesRedact
TablesTable objects with text and imagesRedact
ImagesThe Images page provides a more detailed look at Image processingRedact, unsupported image types are removed
Text boxesMain slide contentRedact
Embedded linksHyperlinks to internet pages or documentsRemove
External elementsTables and charts embedded from another document or file, such as an Excel chartRemove external file, redact cached values
Embedded audio & videoVideos and audio clipsRemove
Review commentsComments from document reviewsRedact
Shape objectsShapes containing textRedact
You can configure the OCR System by setting it as an Environment Variable or sending it in the request object. Check out our OCR Guide to further understand the OCR modes and their usage.

How PPT Files Are Processed

PPT files are processed by converting into PPTX files, followed the process described above and then converting back to PPT files.

Constraints

  • If a piece of PII text has more than one style (different fonts, font sizes, underline etc.), the redaction marker will use the first style.
  • Charts in PPTX files will have all of their numerical data set to 0.
  • We recommend using Microsoft PowerPoint to open the processed PPT/PPTX files. Other editors may not give ideal results.

Support Matrix

CPU ContainerGPU ContainerCommunity APIProfessional API
SupportedYesYesUp to 10 MiBNo

Sample Request

{
  "file": {
    "data": "<file_content_base64>",
    "content_type": "application/vnd.openxmlformats-officedocument.presentationml.presentation"
  },
  "entity_detection": {
    "return_entity": true
  }
}

Sample Response

Response
{
  "processed_file": "Base64 Encoded File Content of the Redacted File",
  "processed_text": "string",
  "entities": "List[Entity]",
  "entities_present": true,
  "languages_detected": {"lang_1": 0.67, "lang_2": 0.74}
}