|
1 | | -# PDR AI - Professional Document Reader AI |
| 1 | +# Launchstack - Professional Document Reader AI |
2 | 2 |
|
3 | | -PDR AI is a Next.js platform for role-based document management, AI-assisted Q&A, and predictive document analysis. It combines document upload, optional OCR, embeddings, and retrieval to help teams find gaps and act faster. |
| 3 | +Launchstack is a Next.js platform for role-based document management, AI-assisted Q&A, and predictive document analysis. It combines document upload, optional OCR, embeddings, and retrieval to help teams find gaps and act faster. |
4 | 4 |
|
5 | 5 | ## Core Features |
6 | 6 |
|
7 | 7 | - Clerk-based Employer/Employee authentication with role-aware middleware. |
8 | 8 | - Document upload pipeline with optional OCR for scanned PDFs. |
9 | 9 | - PostgreSQL + pgvector semantic retrieval for RAG workflows. |
10 | 10 | - AI chat and predictive document analysis over uploaded content. |
| 11 | +- Agent guardrails with PII filtering, grounding checks, and confidence gating. |
| 12 | +- Supervisor agent that validates outputs against domain-specific rubrics. |
| 13 | +- Marketing pipeline with content generation for Reddit, X, LinkedIn, and Bluesky. |
11 | 14 | - Optional web-enriched analysis with Tavily. |
12 | 15 | - Optional reliability/observability via Inngest and LangSmith. |
13 | 16 |
|
| 17 | +## Predictive Analysis — Supported Document Types |
| 18 | + |
| 19 | +Launchstack runs domain-specific analysis tailored to your document type: |
| 20 | + |
| 21 | +| Type | What It Detects | |
| 22 | +|------|----------------| |
| 23 | +| **Contract** | Missing exhibits, schedules, addendums, and supporting agreements | |
| 24 | +| **Financial** | Missing balance sheets, audit reports, income statements | |
| 25 | +| **Technical** | Missing specifications, manuals, diagrams, deliverables | |
| 26 | +| **Compliance** | Missing regulatory filings, certifications, policy documents | |
| 27 | +| **Educational** | Missing syllabi, handouts, readings, linked resources | |
| 28 | +| **HR** | Missing policies, forms, benefits materials, handbooks | |
| 29 | +| **Research** | Missing cited papers, datasets, supplementary materials | |
| 30 | +| **General** | Any document with cross-references and attachments | |
| 31 | + |
| 32 | +Each analysis type also extracts insights (deadlines, action items, resources, caveats) and runs chain-of-verification on high-priority predictions. |
| 33 | + |
| 34 | +## Importing External Knowledge |
| 35 | + |
| 36 | +Launchstack can ingest content exported from third-party tools. No API keys or OAuth setup required — export your data, upload the files, and the ingestion pipeline handles the rest. |
| 37 | + |
| 38 | +### Supported Export Formats |
| 39 | + |
| 40 | +| Source | Export Method | Resulting Format | Launchstack Adapter | |
| 41 | +|--------|-------------|-----------------|----------------| |
| 42 | +| **Notion** | Settings > Export > Markdown & CSV | `.md`, `.csv` (ZIP) | TextAdapter, SpreadsheetAdapter | |
| 43 | +| **Notion** | Page > Export > HTML | `.html` | HtmlAdapter | |
| 44 | +| **Google Docs** | File > Download > Microsoft Word | `.docx` | DocxAdapter | |
| 45 | +| **Google Sheets** | File > Download > CSV or Excel | `.csv`, `.xlsx` | SpreadsheetAdapter | |
| 46 | +| **Google Drive** | Google Takeout (takeout.google.com) | `.docx` (ZIP) | DocxAdapter | |
| 47 | +| **Slack** | Workspace Settings > Import/Export > Export | `.json` (ZIP) | JsonExportAdapter | |
| 48 | +| **GitHub** | Code > Download ZIP | `.md`, `.txt` (ZIP) | TextAdapter | |
| 49 | +| **GitHub** | `gh issue list --json ...` | `.json` | JsonExportAdapter | |
| 50 | +| **GitHub** | `gh pr list --json ...` | `.json` | JsonExportAdapter | |
| 51 | + |
| 52 | +### How to Export |
| 53 | + |
| 54 | +**Notion** |
| 55 | +1. Open your Notion workspace. |
| 56 | +2. Click the **...** menu on a page, or go to **Settings & members > Export** for a full workspace export. |
| 57 | +3. Select **Markdown & CSV** as the format and check **Include subpages** if needed. |
| 58 | +4. Download the ZIP and upload it directly to Launchstack. |
| 59 | + |
| 60 | +**Google Docs / Sheets** |
| 61 | +1. Open the document in Google Docs or Sheets. |
| 62 | +2. Go to **File > Download** and choose **Microsoft Word (.docx)** or **CSV / Excel (.xlsx)**. |
| 63 | +3. Upload the downloaded file. For bulk exports, use [Google Takeout](https://takeout.google.com) to export your Drive as a ZIP. |
| 64 | + |
| 65 | +**Slack** |
| 66 | +1. Go to **Workspace Settings > Import/Export Data > Export**. |
| 67 | +2. Choose a date range and start the export. |
| 68 | +3. Download the ZIP and upload it to Launchstack. Each channel's messages will be ingested as a separate document. |
| 69 | + |
| 70 | +**GitHub** |
| 71 | +1. **Repo docs**: Click **Code > Download ZIP** on any GitHub repository. Upload the ZIP — all Markdown and text files will be ingested. |
| 72 | +2. **Issues**: Install the [GitHub CLI](https://cli.github.com/) and run: |
| 73 | + ```bash |
| 74 | + gh issue list --state all --limit 1000 --json number,title,body,state,labels,author,createdAt,closedAt,comments > issues.json |
| 75 | + ``` |
| 76 | + Upload the resulting `issues.json` file. |
| 77 | +3. **Pull requests**: Run: |
| 78 | + ```bash |
| 79 | + gh pr list --state all --limit 1000 --json number,title,body,state,labels,author,createdAt,mergedAt,comments > prs.json |
| 80 | + ``` |
| 81 | + Upload the resulting `prs.json` file. |
| 82 | + |
| 83 | +All uploaded content flows through the standard ingestion pipeline (chunking, embedding, RAG indexing) and becomes searchable alongside your other documents. |
| 84 | + |
14 | 85 | ## Architecture |
15 | 86 |
|
16 | | -PDR AI follows a three-layer modular architecture: |
| 87 | +Launchstack follows a three-layer modular architecture: |
17 | 88 |
|
18 | 89 | ```mermaid |
19 | 90 | block-beta |
@@ -137,6 +208,10 @@ Optional integrations: |
137 | 208 | - `LANDING_AI_API_KEY`, `DATALAB_API_KEY` |
138 | 209 | - `LANGCHAIN_TRACING_V2`, `LANGCHAIN_API_KEY`, `LANGCHAIN_PROJECT` |
139 | 210 | - `DEBUG_PERF` (`1` or `true`) to enable dev perf logs for middleware and key auth/dashboard APIs |
| 211 | +- `SIDECAR_URL` |
| 212 | +- `NEO4J_URI` |
| 213 | +- `NEO4J_USERNAME` |
| 214 | +- `NEO4J_PASSWORD` |
140 | 215 |
|
141 | 216 | ### 2.1) Configure Vercel Blob Storage |
142 | 217 |
|
@@ -242,6 +317,12 @@ pnpm build |
242 | 317 | pnpm start |
243 | 318 | ``` |
244 | 319 |
|
| 320 | +## Roadmap — Future Integrations |
| 321 | + |
| 322 | +- **Notion API-key connector**: Paste your Notion Internal Integration token in settings, select pages to sync. No OAuth required. Contributions welcome. |
| 323 | +- **GitHub webhook sync**: Automatically ingest new issues and PRs via repository webhooks. |
| 324 | +- **Google Drive watch**: Automatic re-sync when Google Docs are updated, using Drive push notifications. |
| 325 | + |
245 | 326 | ## Troubleshooting |
246 | 327 |
|
247 | 328 | - Confirm Docker is running before DB startup. |
|
0 commit comments