Skip to main content

Document Intelligence

🟢 Production Service | docsgpt.deltaops.ca

An advanced RAG (Retrieval-Augmented Generation) assistant with project management extensions that provides intelligent document analysis, AI-powered chat, and organizational knowledge management.

Overview​

Document Intelligence combines the open-source DocsGPT platform with custom project management capabilities, creating a comprehensive solution for document analysis, knowledge extraction, and intelligent conversation with organizational data.

Key Capabilities​

  • RAG Document Analysis: Upload and analyze documents with AI-powered insights
  • Project Management Integration: Extended functionality for project intelligence and coordination
  • Multi-tenant Architecture: Organizational isolation with secure data boundaries
  • Advanced Chat Interface: Natural language interaction with document collections
  • Knowledge Graph Integration: Semantic relationships and cross-project intelligence

Architecture​

Core Platform​

  • DocsGPT Base: Open-source RAG assistant for document intelligence
  • Project Manager Extensions: Custom modules for project coordination and management
  • Enhanced Database Service: Multi-table D1 integration with organizational schemas
  • Cloudflare Workers: Edge-native deployment with global performance

Technology Stack​

  • Frontend: React with Vite build system and Tailwind CSS
  • Backend: Cloudflare Workers with Hono framework
  • Database: Cloudflare D1 with organizational base schemas + application-specific tables
  • Vector Store: Multiple options (Faiss, Elasticsearch, Qdrant)
  • LLM Integration: OpenAI, Anthropic, Google AI, and local model support

Features​

📚 Document Intelligence​

Advanced document processing and analysis:

  • Multi-format Support: PDF, DOCX, TXT, Markdown, and more
  • Intelligent Chunking: Semantic document segmentation for optimal retrieval
  • Vector Embeddings: High-quality semantic search across document collections
  • Context-aware Responses: AI answers grounded in uploaded documentation

🎯 Project Management Extensions​

Enhanced project intelligence capabilities:

  • Project Coordination: Multi-project status tracking and reporting
  • Team Engagement: Automated stakeholder communication and updates
  • Cognitive Load Management: Intelligent timing and preference handling
  • Cross-project Intelligence: Knowledge sharing across organizational boundaries

🔄 Integration Ecosystem​

Seamless connectivity with organizational tools:

  • Azure DevOps Integration: Work item tracking and project metrics
  • Neo4j Knowledge Graph: Semantic relationships and organizational intelligence
  • Redis Coordination: Multi-agent state management and coordination
  • Cloudflare Analytics: Performance monitoring and usage insights

🔒 Security & Compliance​

Enterprise-grade security implementation:

  • Multi-tenant Isolation: Database-level tenant separation
  • Cloudflare Access Authentication: SSO integration with organizational directory
  • Data Encryption: End-to-end encryption for sensitive documents
  • Audit Logging: Comprehensive access and activity tracking

Getting Started​

Access Requirements​

  • RESTIV Technology SSO account
  • Cloudflare Access authorization
  • Document upload permissions (role-based)

Quick Start​

  1. Access the Platform

    https://docsgpt.deltaops.ca
  2. Upload Documents

    • Navigate to document management section
    • Upload files via drag-and-drop or file browser
    • Wait for processing and indexing completion
  3. Start Conversations

    • Use the chat interface to ask questions about your documents
    • Reference specific documents or search across collections
    • Leverage AI insights for document analysis and summarization

API Integration​

Document Upload API​

# Upload document for processing
POST /api/documents/upload
Content-Type: multipart/form-data
Authorization: Bearer <token>

{
"file": <binary-data>,
"collection": "project-docs",
"metadata": {
"project": "project-id",
"type": "specification"
}
}

Chat API​

# Send chat message
POST /api/chat
Content-Type: application/json
Authorization: Bearer <token>

{
"message": "What are the key requirements for the new feature?",
"collection": "project-docs",
"conversation_id": "uuid"
}

Response Format​

{
"data": {
"response": "Based on the uploaded documents...",
"sources": [
{
"document": "requirements.pdf",
"page": 3,
"confidence": 0.95
}
],
"conversation_id": "uuid"
},
"meta": {
"timestamp": "2025-01-14T12:00:00Z",
"requestId": "correlation-id",
"model": "gpt-4-turbo"
}
}

Database Architecture​

Organizational Base Schemas​

  • auth-nextauthv5-base-schema.sql: Organizations, Users, Sessions, Accounts
  • messaging-conversations-base-schema.sql: Conversations, Messages, Attachments

Application-Specific Schema​

  • document-rfp-editor-schema.sql: Sources, Prompts, Agents, Token Usage, Documents, Edits

Key Features​

  • Multi-tenant Support: Automatic organization isolation
  • MongoDB-compatible API: Seamless migration from existing systems
  • Field Mapping: Handles base schema differences transparently
  • JSON Field Support: Automatic serialization/deserialization

Deployment Architecture​

Production Environment​

  • Cloudflare Workers: Global edge deployment with intelligent routing
  • Cloudflare D1: Serverless SQLite with organizational schema compliance
  • Vector Storage: Integrated with multiple vector database options
  • Custom Domain: docsgpt.deltaops.ca with SSL/TLS termination

Development Environment​

  • Local MongoDB: Development database with Redis for caching
  • Docker/Podman: Containerized services for consistent development
  • Hot Reload: Automatic reloading during development
  • Testing Suite: Comprehensive unit, integration, and E2E tests

Project Management Extensions​

Enhanced Capabilities​

The DocsGPT platform includes custom project management extensions providing:

  • 92.3% Organizational Standards Compliance (12/13 architectural principles)
  • Enhanced Database Service: Multi-table D1 integration with project management schema
  • Project Intelligence Routes: /api/project/* endpoints for project analysis
  • Cross-repository Coordination: Integration with PPTX system and infrastructure services
  • Natural Language Processing: Advanced chat and document intelligence

Integration Points​

  • PPTX System Integration: Shared authentication and API coordination
  • Infrastructure Services: Redis MCP and Neo4j knowledge graph integration
  • Azure DevOps Coordination: Unified project intelligence across platforms

Support & Troubleshooting​

Common Issues​

Document Upload Failures

  • Check file format compatibility (PDF, DOCX, TXT supported)
  • Verify file size limits (50MB maximum per document)
  • Ensure proper authentication and upload permissions

Chat Response Quality

  • Upload relevant documents to the collection first
  • Use specific questions rather than broad queries
  • Reference document names or sections for better context

Performance Issues

  • Document processing takes 30-60 seconds for large files
  • Chat responses typically return within 5-10 seconds
  • Vector search may be slower for large document collections

Getting Help​

  • Internal Support: Slack #document-intelligence channel
  • Technical Issues: Create GitHub issue in RestivTech/private-docsgpt
  • Feature Requests: Submit via internal feature request process

Roadmap​

Current Development (Q1 2025)​

  • Enhanced project management extensions
  • Advanced multi-agent coordination capabilities
  • Improved cross-project intelligence flows

Planned Features (Q2 2025)​

  • Advanced document workflow automation
  • Enhanced semantic search with knowledge graph integration
  • Custom model fine-tuning for organizational terminology

Status: ✅ Production Ready | Compliance: 92.3% Organizational Standards | Last Updated: January 2025