> ## Documentation Index
> Fetch the complete documentation index at: https://docs.okrapdf.com/llms.txt
> Use this file to discover all available pages before exploring further.

# How It Works

> A workbench that helps you understand, verify, and manage PDF data

## The Simple Version

**Upload a PDF → Understand, verify, and manage the data inside.**

OkraPDF is a workbench that helps you extract structured data from documents, verify its accuracy, and put it to work.

```mermaid theme={null}
flowchart LR
    A[Your PDF] --> B[OkraPDF]
    B --> C[Understand]
    B --> D[Verify]
    B --> E[Manage]

    subgraph C[Understand]
        C1[Search]
        C2[Chat]
    end

    subgraph D[Verify]
        D1[Review]
        D2[Correct]
    end

    subgraph E[Manage]
        E1[Tables]
        E2[Charts]
        E3[Images]
        E4[Export]
    end

    style A fill:#1e293b,color:#f8fafc,stroke:#475569
    style B fill:#FFE01B,color:#1e293b,stroke:#E6C800
    style C fill:#16a34a,color:#fff,stroke:#15803d
    style D fill:#7c3aed,color:#fff,stroke:#6d28d9
    style E fill:#0369a1,color:#fff,stroke:#0284c7
```

## What You Get

<CardGroup cols={3}>
  <Card title="Understand" icon="magnifying-glass">
    Search across all pages. Chat with AI that's grounded in your document.
  </Card>

  <Card title="Verify" icon="check-double">
    Review extracted data side-by-side with the original. Correct errors with AI assistance.
  </Card>

  <Card title="Manage" icon="table">
    Tables, charts, images, footnotes — detected, structured, and exportable.
  </Card>
</CardGroup>

## Behind the Scenes

When you upload a PDF, four things happen automatically:

<Steps>
  <Step title="Secure Upload">
    Your file goes directly to encrypted cloud storage. We never store files on local servers.
  </Step>

  <Step title="Text Extraction">
    We use [Google's Document AI](https://cloud.google.com/document-ai/docs/processors-list#processor_doc-ocr) to extract text with high accuracy — even from scanned documents and complex layouts.
  </Step>

  <Step title="Entity Detection">
    Our AI scans each page for tables, charts, images, footnotes, and signatures.
  </Step>

  <Step title="Workbench Ready">
    Your document is ready for search, chat, review, and export.
  </Step>
</Steps>

**Processing time:** 10-30 seconds for most documents.

## Review & Verify

In the review page, you can verify extracted data side-by-side with the original document. AI assists with corrections without hallucinating content.

```mermaid theme={null}
flowchart LR
    A[Extracted Data] --> B[AI Review]
    C[Original Page] --> B
    B --> D[Verified Data]

    style A fill:#1e293b,color:#f8fafc,stroke:#475569
    style B fill:#FFE01B,color:#1e293b,stroke:#E6C800
    style C fill:#1e293b,color:#f8fafc,stroke:#475569
    style D fill:#16a34a,color:#fff,stroke:#15803d
```

**How it works:**

* View extracted data alongside the original page image
* AI suggests corrections for punctuation and character errors
* You approve changes — the AI is constrained to correct, not rewrite

## AI Chat

When you chat with your document, we create a private workspace with all your extracted data:

```mermaid theme={null}
flowchart LR
    Q[Ask: What's the revenue in Q3?] --> AI

    subgraph Workspace["Your Private Workspace"]
        AI[AI Agent]
        PDF[Original PDF]
        T[Tables]
        C[Charts]
        TXT[Full Text]
    end

    AI --> PDF
    AI --> T
    AI --> C
    AI --> TXT
    AI --> R[Answer: $4.2M per Table 3 on page 12]

    style Q fill:#1e293b,color:#f8fafc,stroke:#475569
    style R fill:#16a34a,color:#fff,stroke:#15803d
    style AI fill:#FFE01B,color:#1e293b,stroke:#E6C800
    style Workspace fill:#1e3a5f,color:#f8fafc,stroke:#0ea5e9
    style PDF fill:#0ea5e9,color:#fff,stroke:#0284c7
    style T fill:#0ea5e9,color:#fff,stroke:#0284c7
    style C fill:#0ea5e9,color:#fff,stroke:#0284c7
    style TXT fill:#0ea5e9,color:#fff,stroke:#0284c7
```

**What's in your workspace:**

* Your original PDF
* All extracted tables, charts, and images
* Full document text
* Isolated from other users

## Security

| What We Do               | Why It Matters                                              |
| ------------------------ | ----------------------------------------------------------- |
| Encrypt files at rest    | Your documents are protected even if storage is compromised |
| Encrypt all transfers    | No one can intercept your uploads or downloads              |
| Isolate AI sessions      | Your data never mixes with other users' data                |
| No training on your data | Your documents are never used to train AI models            |
