openKMS Development Plan¶

Current State (as of latest commit)¶

Document channels: CRUD, tree, description
Document upload + parsing via PaddleOCR-VL; DOCX/PPTX converted with LibreOffice then parsed like PDF; XLSX preview (openpyxl) at upload + run_spreadsheet_preview job on re-process; store in S3/MinIO under {file_hash}/
Document detail view with Markdown, layout images, block images; loads files via backend proxy
Document list by channel: GET /api/documents?channel_id=
Delete document: DELETE /api/documents/{id}
Document info & metadata: Edit name (PUT /api/documents/{id}), edit metadata (PUT /metadata), Extract via pydantic-ai Agent + StructuredDict
Document markdown: Edit and save (PUT /markdown; rebuilds page index in S3), restore from S3 (POST /restore-markdown; rebuilds page index), optional POST /rebuild-page-index (also triggered from Page Index tab refresh); detail page shows Save/Cancel in panel header when editing (not View toggle); Page Index tab has refresh control (tooltip: parse markdown to tree)
Document versions: document_versions table; explicit snapshots of markdown + metadata (POST /versions, GET /versions, GET /versions/{id}, POST /versions/{id}/restore); version checkpoint uses JSON field tag (DB column tag); UI: version column in Document Information (3-column stats), Save version when working copy newer than last snapshot, optional tag in Save as version modal, Versions modal as a table (Version / Tag / Saved / Actions); list/preview/restore with optional save-current-first; not created on routine markdown/metadata save
Document lifecycle & lineage: series_id, effective_from / effective_to, lifecycle_status on documents; document_relationships (supersedes, amends, implements, see_also); PATCH /lifecycle, GET/POST/DELETE /relationships; document API is_current_for_rag (computed: currently applicable for normal KB answers/indexing); default KB semantic search and kb-index (lifecycle_index_mode default current_only) respect that unless opted out; document detail Lineage & lifecycle under the METADATA block, collapsed by default (expand loads relationships)
Documents overview, channel management, channel settings (tabbed: General, Processing, Metadata extraction, Manual Labels)
Document metadata (unified): extracted metadata and manual labels stored in single metadata JSONB; channel extraction_schema supports object_type and list[object_type]; label_config (Manual Labels tab) maps keys to Master Data object types with type (object_type | list[object_type]); single METADATA section on document detail
Authentication: OPENKMS_AUTH_MODE=oidc (default, OIDC via issuer discovery + JWKS) or local (PostgreSQL users, /api/auth/*, CLI HTTP Basic); backend verifies JWT Bearer or session; GET /api/auth/public-config (no auth) exposes auth_mode and allow_signup only; GET /internal-api/models/document-parse-defaults (auth; optional model_name) supplies VLM base_url, model_name, and provider api_key for openkms-cli; SPA OIDC uses oidc-client-ts (VITE_OIDC_ISSUER); frontend resolves local vs OIDC from the API with VITE_AUTH_MODE as fallback; Vite proxy for /api and /internal-api in dev
User profile: /profile shows current user from GET /api/auth/me (is_admin, roles, resolved permissions, header menu). User settings /settings: personal API keys (POST/GET/DELETE /api/auth/api-keys). Console → Users & Roles: /api/admin/users with console:users; Permission management (/console/permission-management): All under Roles edits the security_permissions catalog; a named role uses checkboxes + Save role permissions (draft, no per-click API); GET /api/admin/permission-reference includes operation_key_hints; overview nudge when catalog is only all; Data security (/console/data-security/*) remains local-user–centric; group data scopes behind OPENKMS_ENFORCE_GROUP_DATA_SCOPES
Route protection: Home (/) is always reachable without sign-in (static marketing content via HomeStaticLanding); all other MainLayout routes require authentication. 401 responses whose body indicates invalid/expired JWT clear SPA session via authAwareFetch / AuthContext so the same gate appears instead of raw API error JSON
Knowledge Map & home hub: SQLAlchemy app.models.knowledge_map (KnowledgeMapNode, KnowledgeMapResourceLink → taxonomy_nodes / taxonomy_resource_links); API app.api.knowledge_map at GET /api/taxonomy/nodes/tree, node PATCH (move/reorder/edit) + link CRUD; GET /api/home/hub (taxonomy summary field in JSON + scoped document relationship work items + placeholder share requests); SPA KnowledgeMap.tsx at /knowledge-map (legacy /taxonomy redirects; sidebar above Glossaries; Tree + Node details panels with scoped refer-tos; New node modal); signed-in Home.tsx with taxonomy:read centers KnowledgeMapForceGraph (react-force-graph-2d, wiki-style pan/zoom; tree + links APIs; term → /knowledge-map?node=, resource → channel/wiki/articles); MainLayout applies app-content--home on / for hub padding; permissions taxonomy:read / taxonomy:write; feature toggle key taxonomy (Console label: Knowledge Map)
Articles: Backend article_channels, articles, article_versions, article_attachments, access_group_article_channels; APIs /api/article-channels, /api/articles (list, CRUD, lifecycle, markdown, files redirect, attachments, versions); MinIO prefix articles/{article_id}/; Knowledge Map validates article_channel links; permissions articles:read / articles:write; SPA ArticleChannelsContext, /articles, /articles/channels, /articles/channels/:id, /articles/channels/:id/settings, detail + markdown asset URLs
Knowledge Bases: Full CRUD, documents, FAQs (manual + LLM-generated), chunks (pgvector), semantic search with hybrid filters (metadata_filters) and optional include_historical_documents, Q&A proxy, settings (chunk_config incl. lifecycle_index_mode, faq_prompt, metadata_keys); doc_metadata propagated from documents to FAQs/chunks per metadata_keys; openkms-cli pipeline run --pipeline-name kb-index; QA Agent service (FastAPI + LangGraph)
Wiki spaces: wiki_spaces, wiki_pages, wiki_files, wiki_space_documents (+ access_group_wiki_spaces); API /api/wiki-spaces (scoped like KBs when OPENKMS_ENFORCE_GROUP_DATA_SCOPES); PageIndex; GET /api/wiki-spaces/{id}/graph; vault mirror + POST .../import/vault; paginated page list (15); GET/POST/DELETE /api/wiki-spaces/{id}/documents for channel document links (GET list: linked_at + linked document updated_at for SPA “last updated”); embedded agent POST/GET/DELETE/PATCH /api/agent/conversations, .../messages (list by wiki space, conversation delete/title optional, GFM + auto-scroll in SPA; LangGraph read-only tools; OPENKMS_AGENT_MODEL_ID or default LLM on Models /models) — wiki_agent_prototype.md; openkms-cli wiki put / sync / upload-file
openkms-cli tests: openkms-cli/tests/ — pip install -e ".[dev]" && pytest tests/ (VLM defaults merge + mocked fetch; parser _restructure_pages_after_predict and layout/bbox helpers; no Paddle in test env)
Console: System settings (/console/settings) — system_settings table (system_name, default_timezone, api_base_url_note); GET /api/public/system (unauthenticated) returns trimmed system_name only; GET/PUT /api/system/settings with console:settings; sidebar title is blank until that public response, then shows openKMS when the name is empty or whitespace; users, feature toggles, object types, link types, data sources, datasets, permission management, data security (groups + resource scopes); entry gated by console:* permissions or JWT admin; per-page permissions (e.g. console:feature_toggles)
Evaluation (experimental, feature toggle): query + expected answer pairs per KB; topic column; CSV import (topic, query, answer); items list paginated (GET .../items offset/limit, default limit 10); run types search_retrieval (hybrid search + judge) and qa_answer (KB agent + judge); persisted evaluation_runs / evaluation_run_items; list/get/delete/compare runs in API and dataset detail UI; sidebar link when evaluationDatasets enabled
Glossaries: CRUD glossaries, terms with bilingual (EN/CN) support, definition, synonyms, AI suggestion (translation + definition + synonyms), search (EN, CN, definition, synonyms), export/import; dev.sh ensures pgvector on start; backend README + dev setup doc: pgvector install, Docker/PGDG, $libdir/vector troubleshooting
Objects & Links: ontology layer (object types, link types, instances); schema in Console; user-facing browse at /ontology (overview), /objects, /links; feature toggle objectsAndLinks
Data Sources: Console → Data Sources (PostgreSQL/Neo4j connections, encrypted creds). Datasets & object/link schema admin: Ontology sidebar (/ontology/datasets, /ontology/object-types, /ontology/link-types); ontology:read/ontology:write can use the same APIs as console:datasets / console:object_types / console:link_types where wired with require_any_permission.
Docs site: mkdocs.yml (Material theme) + .github/workflows/docs.yml publish docs/ to GitHub Pages at https://yingrui.github.io/openKMS/ on every push to main that touches docs/**, mkdocs.yml, or the workflow; reader-friendly entry pages (index.md, overview.md, quickstart.md, operations/docker.md, developer/setup.md) sit on top of the existing canonical references (architecture.md, functionalities.md, development_plan.md, security.md, tech_debt.md); docs/agents.md documents where each kind of doc edit goes, mirroring .cursor/rules/*.mdc. Folder rename docs/for developer/ → docs/developer/ to keep URLs space-free.

Short-Term (Next Steps)¶

Wiki Copilot and linked documents (build on wiki_agent_prototype.md)¶

Pages | Documents; linked-docs picker; Wiki Copilot wired to /api/agent (persisted conversations; read tools; list/delete conversations, markdown + auto-scroll in panel); wiki-skills vendored via git subtree at third-party/wiki-skills, SKILL.md content in LangGraph system prompt
wiki_space_documents + agent_* tables; link/unlink/list; SPA uses API (not sessionStorage) for links
Backend embedded agent (v1): LangGraph create_react_agent + agent_conversations / agent_messages
Tool visibility while streaming: astream_events (v2) → NDJSON tool_start / tool_end / tool_error (paired by run_id); wiki panel shows compact terminal-style rows interleaved with streamed text (not all tools then all text) and expandable I/O
optional: Langfuse tracing; write tools for pages

0. openkms-cli (document parsing CLI)¶

1. Document List Integration¶

Replace mockDocumentsByChannel with backend API
Add GET /api/documents?channel_id=... (filter by channel + descendants)
Wire DocumentChannel page to real document list

2. Channel Management (Rename, Move, Merge, Delete)¶

Rename channel: Name field in channel settings; backend PUT supports name
Edit description: Description in channel create form and settings; backend supports it
Move channel: parent_id in ChannelUpdate; Move button in manage UI with parent dropdown
Delete channel: DELETE /api/document-channels/{id}; blocks if has documents or sub-channels; confirm UI
Merge channels: POST /api/document-channels/merge; move docs to target, delete source; optional include_descendants

3. Document Operations¶

Move document between channels (PUT /api/documents/{id} with channel_id; Move modal in document list)
Delete document
Document metadata extraction: LLM extracts abstract, author, publish_date, tags, etc.; configurable schema per channel in settings; Extract button on detail page
Search in document list (GET /api/documents?search=...; optional when no channel)
Advanced filter in channel

4a. Objects & Links (Ontology)¶

4b. Data Sources (Console) & Datasets / schema (Ontology)¶

4. Authentication¶

Medium-Term¶

5. Pipelines¶

6. Jobs (procrastinate)¶

6b. Unify Metadata and Labels (2026-03)¶

Merge labels into metadata; single METADATA concept in DB and UI
Alembic migration: merge labels → metadata, label_keys → metadata_keys, drop labels/label_keys columns
Add object_type and list[object_type] to extraction schema; object_type_extraction_max_instances on channel
Rename Labels tab to Manual Labels; label_config uses type (object_type | list[object_type]) instead of allow_multiple
KB: metadata_keys only; openkms-cli and backend propagation use _propagate_metadata(doc_metadata, metadata_keys)

6c. Tech Debt Mitigation (2026-03)¶

7. Knowledge Bases (RAG)¶

8. Articles Backend¶

Article model and API
Article channels (separate from document channels)
Rich text / Markdown editor

Long-Term¶

Multi-tenancy
Audit logging
Glossary export/import (implemented); document export/import
Plugin/extensibility
Mobile/responsive polish

Conventions¶

Before commit: Update docs/architecture.md, docs/development_plan.md, docs/functionalities.md to reflect changes. See .cursor/rules/docs-before-commit.mdc.

Open Questions¶

All documents view – Show documents from all channels when no channel selected?
Article channels – Same tree model as documents or different?
Default channel – Auto-select first channel or require explicit selection?