Document Intelligence
🟢 Production Service | docsgpt.deltaops.ca
An advanced RAG (Retrieval-Augmented Generation) assistant with project management extensions that provides intelligent document analysis, AI-powered chat, and organizational knowledge management.
Overview​
Document Intelligence combines the open-source DocsGPT platform with custom project management capabilities, creating a comprehensive solution for document analysis, knowledge extraction, and intelligent conversation with organizational data.
Key Capabilities​
- RAG Document Analysis: Upload and analyze documents with AI-powered insights
- Project Management Integration: Extended functionality for project intelligence and coordination
- Multi-tenant Architecture: Organizational isolation with secure data boundaries
- Advanced Chat Interface: Natural language interaction with document collections
- Knowledge Graph Integration: Semantic relationships and cross-project intelligence
Architecture​
Core Platform​
- DocsGPT Base: Open-source RAG assistant for document intelligence
- Project Manager Extensions: Custom modules for project coordination and management
- Enhanced Database Service: Multi-table D1 integration with organizational schemas
- Cloudflare Workers: Edge-native deployment with global performance
Technology Stack​
- Frontend: React with Vite build system and Tailwind CSS
- Backend: Cloudflare Workers with Hono framework
- Database: Cloudflare D1 with organizational base schemas + application-specific tables
- Vector Store: Multiple options (Faiss, Elasticsearch, Qdrant)
- LLM Integration: OpenAI, Anthropic, Google AI, and local model support
Features​
📚 Document Intelligence​
Advanced document processing and analysis:
- Multi-format Support: PDF, DOCX, TXT, Markdown, and more
- Intelligent Chunking: Semantic document segmentation for optimal retrieval
- Vector Embeddings: High-quality semantic search across document collections
- Context-aware Responses: AI answers grounded in uploaded documentation
🎯 Project Management Extensions​
Enhanced project intelligence capabilities:
- Project Coordination: Multi-project status tracking and reporting
- Team Engagement: Automated stakeholder communication and updates
- Cognitive Load Management: Intelligent timing and preference handling
- Cross-project Intelligence: Knowledge sharing across organizational boundaries
🔄 Integration Ecosystem​
Seamless connectivity with organizational tools:
- Azure DevOps Integration: Work item tracking and project metrics
- Neo4j Knowledge Graph: Semantic relationships and organizational intelligence
- Redis Coordination: Multi-agent state management and coordination
- Cloudflare Analytics: Performance monitoring and usage insights
🔒 Security & Compliance​
Enterprise-grade security implementation:
- Multi-tenant Isolation: Database-level tenant separation
- Cloudflare Access Authentication: SSO integration with organizational directory
- Data Encryption: End-to-end encryption for sensitive documents
- Audit Logging: Comprehensive access and activity tracking
Getting Started​
Access Requirements​
- RESTIV Technology SSO account
- Cloudflare Access authorization
- Document upload permissions (role-based)
Quick Start​
-
Access the Platform
https://docsgpt.deltaops.ca -
Upload Documents
- Navigate to document management section
- Upload files via drag-and-drop or file browser
- Wait for processing and indexing completion
-
Start Conversations
- Use the chat interface to ask questions about your documents
- Reference specific documents or search across collections
- Leverage AI insights for document analysis and summarization
API Integration​
Document Upload API​
# Upload document for processing
POST /api/documents/upload
Content-Type: multipart/form-data
Authorization: Bearer <token>
{
"file": <binary-data>,
"collection": "project-docs",
"metadata": {
"project": "project-id",
"type": "specification"
}
}
Chat API​
# Send chat message
POST /api/chat
Content-Type: application/json
Authorization: Bearer <token>
{
"message": "What are the key requirements for the new feature?",
"collection": "project-docs",
"conversation_id": "uuid"
}
Response Format​
{
"data": {
"response": "Based on the uploaded documents...",
"sources": [
{
"document": "requirements.pdf",
"page": 3,
"confidence": 0.95
}
],
"conversation_id": "uuid"
},
"meta": {
"timestamp": "2025-01-14T12:00:00Z",
"requestId": "correlation-id",
"model": "gpt-4-turbo"
}
}
Database Architecture​
Organizational Base Schemas​
- auth-nextauthv5-base-schema.sql: Organizations, Users, Sessions, Accounts
- messaging-conversations-base-schema.sql: Conversations, Messages, Attachments
Application-Specific Schema​
- document-rfp-editor-schema.sql: Sources, Prompts, Agents, Token Usage, Documents, Edits
Key Features​
- Multi-tenant Support: Automatic organization isolation
- MongoDB-compatible API: Seamless migration from existing systems
- Field Mapping: Handles base schema differences transparently
- JSON Field Support: Automatic serialization/deserialization
Deployment Architecture​
Production Environment​
- Cloudflare Workers: Global edge deployment with intelligent routing
- Cloudflare D1: Serverless SQLite with organizational schema compliance
- Vector Storage: Integrated with multiple vector database options
- Custom Domain:
docsgpt.deltaops.cawith SSL/TLS termination
Development Environment​
- Local MongoDB: Development database with Redis for caching
- Docker/Podman: Containerized services for consistent development
- Hot Reload: Automatic reloading during development
- Testing Suite: Comprehensive unit, integration, and E2E tests
Project Management Extensions​
Enhanced Capabilities​
The DocsGPT platform includes custom project management extensions providing:
- 92.3% Organizational Standards Compliance (12/13 architectural principles)
- Enhanced Database Service: Multi-table D1 integration with project management schema
- Project Intelligence Routes:
/api/project/*endpoints for project analysis - Cross-repository Coordination: Integration with PPTX system and infrastructure services
- Natural Language Processing: Advanced chat and document intelligence
Integration Points​
- PPTX System Integration: Shared authentication and API coordination
- Infrastructure Services: Redis MCP and Neo4j knowledge graph integration
- Azure DevOps Coordination: Unified project intelligence across platforms
Support & Troubleshooting​
Common Issues​
Document Upload Failures
- Check file format compatibility (PDF, DOCX, TXT supported)
- Verify file size limits (50MB maximum per document)
- Ensure proper authentication and upload permissions
Chat Response Quality
- Upload relevant documents to the collection first
- Use specific questions rather than broad queries
- Reference document names or sections for better context
Performance Issues
- Document processing takes 30-60 seconds for large files
- Chat responses typically return within 5-10 seconds
- Vector search may be slower for large document collections
Getting Help​
- Internal Support: Slack #document-intelligence channel
- Technical Issues: Create GitHub issue in RestivTech/private-docsgpt
- Feature Requests: Submit via internal feature request process
Roadmap​
Current Development (Q1 2025)​
- Enhanced project management extensions
- Advanced multi-agent coordination capabilities
- Improved cross-project intelligence flows
Planned Features (Q2 2025)​
- Advanced document workflow automation
- Enhanced semantic search with knowledge graph integration
- Custom model fine-tuning for organizational terminology
Status: ✅ Production Ready | Compliance: 92.3% Organizational Standards | Last Updated: January 2025