Complete Platform User Guide
This comprehensive guide will walk you through the entire Graphora platform workflow, from initial setup to successfully managing your knowledge graphs. Follow these steps in order for the best experience.Step 1: Configure Database and API Settings
When you first access Graphora, you’ll need to provide your database details and AI API key on the Config page.Required Configuration
- Navigate to the Config page in your Graphora dashboard
-
Database Configuration: Provide your Neo4j database connection details:
- URI: Your Neo4j database URI (e.g.,
bolt://localhost:7687). - Username: Database username
- Password: Database password
- Environment: Choose between staging and production databases
- URI: Your Neo4j database URI (e.g.,
-
AI API Key: Configure your Gemini API key for AI-powered processing:
- Gemini API Key: Your Google AI Studio API key
- This enables AI-powered document processing and knowledge extraction
Graphora follows a BYOK (Bring Your Own Key) architecture, meaning your API keys and database credentials remain under your control and are never stored on Graphora’s servers.
Validation
After entering your configuration:- Test your database connection to ensure it’s working
- Verify your Gemini API key is valid
- Save your configuration before proceeding
Step 2: Create or Import Your Schema
Next, you’ll need to define the structure of your knowledge graph using the AI Copilot page.For New Users (No Existing Schema)
Use the Schema Copilot — a chat-based AI assistant that builds ontology schemas from natural language descriptions.- Navigate to the AI Copilot page
- Describe your domain: Provide a clear description of what you want to model
- Example: “I want to model a company’s organizational structure with employees, departments, and projects”
- Use AI to generate schema: The copilot uses streaming responses to build your schema in real time
- Refine through conversation: Ask follow-up questions to adjust entities, properties, and relationships
- Save your ontology: The system will validate and store your schema
For Users with Existing Schemas
If you already have an ontology definition, it must adhere to the Ontology Editor YAML format:Ontology YAML Format
Learn about the complete YAML syntax, structure, and requirements for ontology definitions.
- Use camelCase for all property names
- Follow the version 1 format specification
- Include proper entity definitions with properties and relationships
- Define quality rules for data validation
Step 3: Transform Documents into Knowledge Graphs
Once your schema is ready, move to the Transform page to process your documents.Document Upload and Processing
- Select your ontology: Choose the schema you created or imported
- Upload documents: Support for multiple formats:
- PDF files
- Text files (.txt)
- Markdown files (.md)
- Start transformation: The system will:
- Extract text from your documents
- Use AI to identify entities and relationships
- Structure the data according to your ontology
- Generate a knowledge graph
Monitoring Progress
- Real-time status updates: Track the transformation progress
- Resource metrics: Monitor processing time and resource usage
- Error handling: View any processing errors or warnings
Step 4: Quality Assessment and Review
After transformation, the system automatically performs quality validation on your extracted knowledge graph.Understanding Quality Metrics
The quality system provides:- Overall Score: 0-100 scale with letter grades (A-F)
- Violation Summary: Categorized by severity (Error, Warning, Info)
- Rule Types: Format rules, business rules, and completeness checks
- Detailed Reports: Entity-level and property-level validation results
Quality Rules Include:
- Format Rules: Data type validation, format compliance
- Business Rules: Domain-specific constraints and validations
- Completeness Rules: Required property validation, relationship verification
- Consistency Rules: Cross-entity validation and constraint checking
Quality Decision Points
Based on the quality results, you have several options: High Quality (Score ≥ 90, No Errors):- ✅ Approve and Proceed: Automatically move to merge process
- Quality is sufficient for production use
- ⚠️ Manual Review Required: Examine violations and make informed decision
- Consider acceptable risk vs. data value
- Add feedback comments for future improvements
- ❌ Reject and Retry: Fix source data or adjust ontology
- Address critical errors before proceeding
- Provide detailed feedback for system learning
Step 5: Provide Feedback and Iterate
The quality assessment stage allows you to provide valuable feedback to improve future processing.Feedback Options
- Approval with Comments: Explain why you’re accepting despite issues
- Rejection with Reasons: Detail what needs to be fixed
- Quality Suggestions: Recommend improvements for similar data
Retry Process
If you reject the quality results:- Address root causes: Fix source documents or adjust ontology
- Re-run transformation: Process the corrected data
- Review improved results: Check if quality issues are resolved
Step 6: Merge Process and Conflict Resolution
If quality is approved, the system proceeds to the Merge page to integrate your new data with existing production data.Automatic Conflict Detection
The system automatically detects potential conflicts:- Duplicate entities: Same entity with different properties
- Conflicting values: Different values for the same property
- Relationship conflicts: Inconsistent relationship definitions
- Schema mismatches: Entity structure differences
Human-in-the-Loop Resolution
When conflicts are detected, human involvement is required:Conflict Resolution Options
- Accept New Data: Use the newly extracted information
- Keep Existing: Maintain current production data
- Merge Values: Combine information from both sources
- Custom Resolution: Create a hybrid solution
Learning Comments
For each resolution, you can provide learning comments:- Explain why you chose a particular resolution
- Provide context for similar future conflicts
- Help the system learn your preferences
- Enable automated resolution of similar issues
Future Automation
Your learning comments enable the system to:- Automatically resolve similar conflicts in the future
- Apply your business rules consistently
- Reduce manual intervention over time
- Improve overall system intelligence
Step 7: Final Persistence and Versioning
After all conflicts are resolved, the final step persists your data to the production database.Data Persistence
- Clean Integration: Resolved data is stored in production Neo4j
- Relationship Creation: All entity relationships are properly linked
- Index Updates: Search indexes are updated for optimal performance
Versioning Strategy
Graphora implements a sophisticated versioning system:Version Management
- New Nodes: Completely new entities get version 1
- Updated Nodes: Modified entities get incremented versions
- Historical Preservation: Previous versions are retained, not deleted
- Version Marking: Old nodes are marked as previous versions
Benefits of Versioning
- Audit Trail: Complete history of all changes
- Rollback Capability: Ability to revert to previous versions
- Temporal Queries: Query data as it existed at specific points in time
- Change Tracking: Monitor how your knowledge graph evolves
Version Resolution Example
Step 8: Monitoring and Maintenance
After successful data integration, monitor your knowledge graph’s ongoing performance.Available Monitoring
- Quality Trends: Track quality improvements over time
- Conflict Patterns: Identify common conflict types
- Processing Metrics: Monitor transformation performance
- Data Growth: Track knowledge graph expansion
Best Practices
- Regular Quality Reviews: Periodically review quality thresholds
- Ontology Evolution: Update schemas as your domain evolves
- Feedback Analysis: Review learning comments for optimization opportunities
- Performance Monitoring: Watch for processing bottlenecks
Troubleshooting Common Issues
Configuration Problems
- Database Connection Failed: Verify URI, credentials, and network access
- API Key Invalid: Check Gemini API key format and permissions
- Permission Errors: Ensure database user has required permissions
Schema Issues
- Validation Errors: Review YAML syntax and property naming (use camelCase)
- Missing Relationships: Ensure all entity relationships are properly defined
- Quality Rule Conflicts: Check for contradictory validation rules
Processing Problems
- Transformation Stuck: Check document format and file size limits
- Low Quality Scores: Review source data quality and ontology alignment
- Memory Issues: Consider breaking large documents into smaller chunks
Merge Conflicts
- Too Many Conflicts: Review entity matching criteria and uniqueness rules
- Resolution Failures: Ensure learning comments are clear and actionable
- Performance Issues: Monitor conflict resolution processing time
Getting Help
If you encounter issues during any step:- Check Documentation: Review relevant sections of this guide
- Quality Reports: Use detailed quality violation reports for debugging
- System Logs: Monitor processing logs for error details
- Support Contact: Reach out to support@graphora.io for assistance
Next Steps
Once you’ve completed your first successful workflow:- Explore Advanced Features: Learn about bulk processing and automation
- API Integration: Consider using the GraphoraClient library for programmatic access
- Quality Optimization: Fine-tune your ontology and quality rules
- Scale Your Operations: Process larger document sets and complex ontologies
Python Client Library
Learn how to integrate Graphora programmatically using our Python client library.
This guide covers the complete end-to-end workflow for Graphora platform users. Each step builds on the previous one to ensure a smooth and successful knowledge graph creation experience.
