factflow-sharepoint
SharePoint ingest and document conversion adapters. Pulls documents from SharePoint via Microsoft Graph, converts them (Word, PDF, etc.) to markdown for downstream processing.
Tier and role
Section titled “Tier and role”- Tier: workflow
- Import name:
factflow_sharepoint - Source:
backend/packages/workflows/factflow-sharepoint/
Closed pipeline producing markdown from SharePoint-hosted Office documents. Slots into pipelines where segmentation / embedding / knowledge extraction run downstream.
Context
Section titled “Context”Two subpackages:
sharepoint
Section titled “sharepoint”Ingestion from Microsoft Graph.
adapter.py— pipeline adapter (type: sharepoint_ingestin YAML)graph_client.py— thin async wrapper over the Microsoft Graph REST APIsettings.py— auth + site / drive selectionmodels.py— SharePoint-specific metadata
document_converter
Section titled “document_converter”Convert the downloaded binary documents to markdown.
adapter.py— pipeline adapter (type: document_converter)models.py— conversion output metadata
Rationale
Section titled “Rationale”- Graph over SOAP. Microsoft Graph is the current API; the legacy SOAP SharePoint API is not supported.
- Converter stage is separate. Ingest and conversion are separate adapters so conversion can be replayed independently if the converter is updated — the binary is persisted between stages.
Public API
Section titled “Public API”Top-level factflow_sharepoint/__init__.py is empty. Consumers:
- Configure both adapters in pipeline YAML
- Set Graph credentials in env (
SHAREPOINT_CLIENT_ID,SHAREPOINT_CLIENT_SECRET,SHAREPOINT_TENANT_ID, or equivalent persettings.py)
Direct imports for custom integrations:
from factflow_sharepoint.sharepoint.adapter import SharePointIngestAdapterfrom factflow_sharepoint.document_converter.adapter import DocumentConverterAdapterDependencies
Section titled “Dependencies”- Workspace:
factflow-protocols,factflow-foundation,factflow-engine - External services: Microsoft Graph (tenant access), storage provider for downloaded binaries + converted markdown
Testing
Section titled “Testing”Tests at backend/packages/workflows/factflow-sharepoint/tests/. Integration tests hit a sandboxed tenant — skipped by default in CI.
Related
Section titled “Related”factflow-markdown— typical downstream (segment the converted markdown)factflow-knowledge— end of the typical SharePoint ingest chain