Medical GraphRAG Assistant

MIT License

IRIS_VECTOR_RAG_PAIN_POINTS.md•21.8 kB

# Pain Points: GraphRAG Migration to AWS IRIS **Date**: December 12, 2025 **Context**: Migrating existing local GraphRAG implementation to AWS IRIS **Goal**: Provide feedback to iris-vector-rag team for improving cloud deployments ## Executive Summary While the GraphRAG local implementation works well, migrating to AWS revealed several pain points related to hardcoded settings, namespace access, and configuration flexibility. These issues are documented below with recommendations for the iris-vector-rag team. --- ## Pain Point #1: Hardcoded Connection Settings ⚠️ HIGH PRIORITY ### Issue **File**: `src/setup/create_knowledge_graph_tables.py` **Lines**: 19-23 ```python # Hardcoded connection settings IRIS_HOST = 'localhost' IRIS_PORT = 32782 IRIS_NAMESPACE = 'DEMO' IRIS_USERNAME = '_SYSTEM' IRIS_PASSWORD = 'ISCDEMO' ``` ### Impact - **Cannot be used for AWS deployment without modification** - **Requires creating separate AWS-specific scripts** (as we did) - **Not suitable for CI/CD or containerized deployments** ### Recommendation Load configuration from: 1. YAML config file (preferred) 2. Environment variables (fallback) 3. Command-line arguments (override) ```python def load_db_config(): """Load database configuration from multiple sources.""" # Priority: CLI args > env vars > config file > defaults config = { 'host': os.getenv('IRIS_HOST', 'localhost'), 'port': int(os.getenv('IRIS_PORT', '32782')), 'namespace': os.getenv('IRIS_NAMESPACE', 'DEMO'), 'username': os.getenv('IRIS_USERNAME', '_SYSTEM'), 'password': os.getenv('IRIS_PASSWORD', 'ISCDEMO') } return config ``` ### User Impact **Severity**: HIGH - Forces users to modify source code for different environments - Creates maintenance burden (multiple script versions) - Error-prone (easy to commit passwords to git) --- ## Pain Point #2: Inflexible Vector Dimensions ⚠️ MEDIUM PRIORITY ### Issue **File**: `src/setup/create_knowledge_graph_tables.py` **Line**: 36 ```sql EmbeddingVector VECTOR(DOUBLE, 384) ``` ### Context - Local implementation uses SentenceTransformers (384-dim) - **NVIDIA NIM uses 1024-dim embeddings** - AWS deployment requires 1024-dim vectors - Hardcoded dimension prevents migration ### Impact - **Cannot use NVIDIA NIM embeddings** without modifying DDL - **Requires separate table creation scripts** for different models - **Makes model upgrades painful** ### Recommendation Make vector dimension configurable: ```python def create_entities_table(vector_dim: int = 384): """Create entities table with configurable vector dimension.""" ddl = f""" CREATE TABLE RAG.Entities ( ... EmbeddingVector VECTOR(DOUBLE, {vector_dim}), ... ) """ return ddl ``` ### User Impact **Severity**: MEDIUM - Blocks use of modern embedding models (>384 dims) - Requires manual DDL modification - Risk of dimension mismatches causing runtime errors --- ## Pain Point #3: Namespace Access Documentation ⚠️ MEDIUM PRIORITY ### Issue **Context**: AWS IRIS Community Edition namespace permissions ### Discovery After extensive debugging, found that: - ✅ `%SYS` namespace: Full access for _SYSTEM user - ❌ `DEMO` namespace: Access denied (requires additional setup) ### Impact - **Initial connection attempts failed** with cryptic "Access Denied" errors - **Required diagnostic script** to identify the issue - **Not documented** in any GraphRAG or iris-vector-rag docs ### What Worked ```python # Connect to %SYS namespace, use fully qualified table names conn = iris.connect(host, port, '%SYS', username, password) cursor.execute("CREATE TABLE SQLUser.Entities (...)") ``` ### Recommendation Add to iris-vector-rag documentation: 1. **Cloud Deployment Guide** with namespace considerations 2. **Troubleshooting section** for "Access Denied" errors 3. **Best practices** for table name qualification ### User Impact **Severity**: MEDIUM - High frustration factor (cryptic error messages) - Time-consuming debugging (took ~20 mins to diagnose) - Easy to give up and assume "doesn't work on cloud" --- ## Pain Point #4: Schema Name Flexibility ⚠️ LOW PRIORITY ### Issue **File**: `src/setup/create_knowledge_graph_tables.py` **Lines**: 30, 54 Hardcoded schema names: ```sql CREATE TABLE RAG.Entities (...) CREATE TABLE RAG.EntityRelationships (...) ``` ### Context - Local: Can use `RAG` schema - AWS: Must use `SQLUser` schema (where IRIS SQL tables are created) - Different IRIS installations may have different schemas ### Impact - **Requires modifying DDL** for AWS deployment - **Not portable** across IRIS installations - **Assumes RAG schema exists** ### Recommendation Make schema configurable: ```python def create_tables(schema_name: str = "RAG"): """Create knowledge graph tables in specified schema.""" entities_table = f"{schema_name}.Entities" relationships_table = f"{schema_name}.EntityRelationships" # Use table names in DDL ``` ### User Impact **Severity**: LOW - Minor inconvenience for most users - Critical for cloud deployments - Workaround exists (manually edit DDL) --- ## Pain Point #5: No Data Migration Guide ⚠️ MEDIUM PRIORITY ### Issue **Missing**: Documentation for migrating existing FHIR data to AWS ### Context - Local FHIR server has 2,739 resources, 51 DocumentReferences - AWS IRIS has no FHIR data (empty ClinicalNoteVectors table) - No clear path to migrate data ### Impact - **Cannot test end-to-end GraphRAG** on AWS without data - **No guide for data export/import** workflow - **Unclear how to sync** local → AWS ### Recommendation Add documentation for: 1. **Exporting FHIR resources** from local IRIS 2. **Importing to AWS IRIS** via IRIS replication or SQL 3. **Incremental sync** strategies 4. **Data validation** after migration ### User Impact **Severity**: MEDIUM - Blocks full AWS validation - Forces manual data transfer - Risk of data inconsistencies --- ## Pain Point #6: No Config File Support in Main Scripts ⚠️ HIGH PRIORITY ### Issue **File**: `src/setup/fhir_graphrag_setup.py` The main setup script loads config but **sub-scripts don't accept config path**: - `create_knowledge_graph_tables.py` - hardcoded settings - Other utility scripts likely have same issue ### Impact - **Inconsistent configuration** across scripts - **Cannot orchestrate** multi-script workflows easily - **Each script needs separate modification** for AWS ### Recommendation All scripts should accept `--config` argument: ```python parser = argparse.ArgumentParser() parser.add_argument('--config', default='config/fhir_graphrag_config.yaml') parser.add_argument('--mode', choices=['init', 'build', 'stats']) ``` ### User Impact **Severity**: HIGH - Frustrating developer experience - Error-prone (forgetting to update one script) - Makes automation difficult --- ## Pain Point #7: IRISVectorDBClient Integration ⚠️ LOW PRIORITY ### Issue **Observation**: GraphRAG scripts use direct SQL instead of IRISVectorDBClient ### Context We just validated that `IRISVectorDBClient` provides: - Clean Python API (no manual SQL) - Automatic TO_VECTOR handling - Dimension validation - Works on local and AWS ### Impact - **Missed opportunity** for cleaner code - **Manual SQL** is error-prone - **Harder to maintain** with direct SQL strings ### Recommendation Consider refactoring to use IRISVectorDBClient: ```python from src.vectorization.vector_db_client import IRISVectorDBClient client = IRISVectorDBClient( host=config['host'], namespace=config['namespace'], vector_dimension=config['dimension'] ) with client: client.insert_vector(embedding=entity_embedding, ...) ``` ### User Impact **Severity**: LOW - Current approach works - Would improve code quality - Not blocking for production use --- ## Summary of Recommendations ### High Priority (Blocks AWS Deployment) 1. ✅ **Add config file support** to all scripts 2. ✅ **Remove hardcoded connection settings** 3. ✅ **Make vector dimensions configurable** ### Medium Priority (Improves UX) 4. ✅ **Document namespace access** for cloud deployments 5. ✅ **Add data migration guide** 6. ✅ **Make schema names configurable** ### Low Priority (Nice to Have) 7. ✅ **Consider IRISVectorDBClient integration** 8. ✅ **Add cloud deployment examples** --- ## Workarounds Implemented For this AWS migration, we implemented: ### 1. AWS-Specific Table Creation Script **File**: `src/setup/create_knowledge_graph_tables_aws.py` - Reads config from YAML file - Uses %SYS namespace - Uses SQLUser schema - Supports 1024-dim vectors ### 2. Updated AWS Configuration **File**: `config/fhir_graphrag_config.aws.yaml` - Namespace: `%SYS` (not DEMO) - Tables: `SQLUser.Entities`, `SQLUser.EntityRelationships` - Vector dimension: 1024 ### 3. Diagnostic Script **File**: `scripts/aws/diagnose-iris-connection.sh` - Tests namespace access - Validates connection formats - Provides troubleshooting guidance --- ## Pain Point #8: Table Name Qualification Inconsistency ⚠️ MEDIUM PRIORITY ### Issue **File**: `src/setup/fhir_graphrag_setup.py` **Line**: 423 ```python self.cursor.execute("SELECT MAX(ExtractedAt) FROM RAG.Entities") ``` ### Context - Script uses hardcoded `RAG.Entities` table reference - AWS uses `SQLUser.Entities` - This would fail on AWS in incremental sync mode ### Impact - **Incremental sync won't work on AWS** without modification - **Table references scattered** throughout code - **No central configuration** for table names ### Recommendation Load table names from configuration: ```python entities_table = config['knowledge_graph']['entities_table'] self.cursor.execute(f"SELECT MAX(ExtractedAt) FROM {entities_table}") ``` ### User Impact **Severity**: MEDIUM - Breaks incremental sync on AWS - Requires code changes for cloud deployment - Easy to miss (doesn't fail until runtime) --- ## Pain Point #9: init_tables() Doesn't Respect Config ⚠️ HIGH PRIORITY ### Issue **File**: `src/setup/fhir_graphrag_setup.py` **Line**: 133 ```python def init_tables(self): from src.setup.create_knowledge_graph_tables import create_tables success = create_tables() # ❌ Ignores config! ``` ### Context - Main script supports `--config` argument - But `init_tables()` calls hardcoded version - Creates tables with localhost settings, not AWS ### Impact - **Cannot use `--mode=init` with AWS config** - **Forced to use separate AWS script** (which we created) - **Inconsistent behavior** between init and build modes ### Recommendation Pass config to table creation: ```python def init_tables(self): from src.setup.create_knowledge_graph_tables import create_tables # Pass connection from config, or config itself success = create_tables( host=self.config['database']['iris']['host'], port=self.config['database']['iris']['port'], # ... other config ) ``` ### User Impact **Severity**: HIGH - Forces workarounds for cloud deployment - Wastes developer time creating duplicate scripts - Poor user experience --- ## Test Results ✅ **Successfully Tested**: 1. ✅ AWS IRIS connection via %SYS namespace 2. ✅ Knowledge graph tables created (SQLUser schema) 3. ✅ IRISVectorDBClient validated with AWS 4. ✅ Table structure verified (1024-dim vectors) 5. ✅ Configuration file loading 6. ✅ Connection string handling ❌ **Blocked (No Data on AWS)**: 1. ❌ Entity extraction (requires FHIR DocumentReference resources) 2. ❌ End-to-end GraphRAG query (needs data + entities) 3. ❌ Incremental sync (hardcoded table names would fail) ⚠️ **Not Tested (Out of Scope)**: - NVIDIA NIM LLM integration (LLM not deployed) - GPU-accelerated embeddings (no data to embed) - Multi-modal query fusion (no entities extracted) --- ## Migration Workflow Tested ### What Worked 1. ✅ Created AWS-specific table creation script 2. ✅ Updated AWS configuration with correct namespace 3. ✅ Successfully created knowledge graph tables on AWS 4. ✅ Verified table accessibility from local machine 5. ✅ Documented pain points as encountered ### Time Investment - **Configuration updates**: 10 minutes - **AWS script creation**: 20 minutes - **Table creation and verification**: 5 minutes - **Pain point documentation**: 30 minutes - **Total**: ~65 minutes ### What Would Have Been Faster With proper iris-vector-rag support: - **Load config from file**: Would save 20 mins (no custom script needed) - **Configurable dimensions**: Would save debugging time - **Better namespace docs**: Would save 20 mins (diagnostic time) - **Estimated time savings**: 40+ minutes (>60% faster) --- ## Feedback Priority Matrix ### Critical (Must Fix for Cloud) 1. 🔴 **Hardcoded connection settings** - Blocks all cloud deployments 2. 🔴 **init_tables() ignores config** - Forces workarounds 3. 🔴 **Inflexible vector dimensions** - Blocks modern embeddings ### Important (Improves UX) 4. 🟡 **Namespace access documentation** - Frustrating to debug 5. 🟡 **Table name inconsistency** - Breaks incremental sync 6. 🟡 **No data migration guide** - Blocks full testing 7. 🟡 **Schema name flexibility** - Cloud portability issue ### Nice to Have (Future) 8. 🟢 **IRISVectorDBClient integration** - Code quality 9. 🟢 **Cloud deployment examples** - Better docs --- ## Positive Findings ### What Worked Well ✅ 1. **YAML Configuration Structure** - Well-organized and comprehensive - Easy to understand and customize - Good comments and documentation 2. **Table Schema Design** - Proper use of VECTOR type - Good index selection - Clear column names 3. **Command-Line Interface** - `--mode` argument is intuitive - `--config` support exists (mostly) - Good help messages 4. **Code Organization** - Clean separation of concerns - Adapters pattern is good - Extractor abstraction is solid 5. **Local Implementation** - Works perfectly for local development - Good performance - Reliable entity extraction --- ## Recommendations for iris-vector-rag Team ### Short Term (Next Release) 1. **Add Environment Variable Support** - `IRIS_HOST`, `IRIS_PORT`, `IRIS_NAMESPACE`, etc. - Makes container deployments easy - Minimal code changes required 2. **Fix init_tables() Config Handling** - Pass config to create_tables() - Critical for `--mode=init` to work with cloud 3. **Update Documentation** - Add "Cloud Deployment" section - Document namespace access issues - Provide AWS/Azure examples ### Medium Term (Next Quarter) 4. **Configurable Vector Dimensions** - Load from config file - Support 384, 768, 1024, 1536, 4096 - Critical for NVIDIA NIM, OpenAI, etc. 5. **Schema Name Configuration** - Replace hardcoded `RAG.*` references - Load from config - Support SQLUser, custom schemas 6. **Data Migration Guide** - Document export/import workflows - Provide utility scripts - Cover sync strategies ### Long Term (Future) 7. **IRISVectorDBClient Integration** - Replace manual SQL with client - Cleaner, more maintainable code - Better error handling 8. **Cloud Monitoring** - CloudWatch integration - Performance metrics - Health checks --- ## Sample Configuration for AWS For reference, here's what worked for AWS deployment: ```yaml database: iris: host: "3.84.250.46" port: 1972 namespace: "%SYS" # ⚠️ Use %SYS, not DEMO username: "_SYSTEM" password: "SYS" knowledge_graph: entities_table: "SQLUser.Entities" # ⚠️ Use SQLUser schema relationships_table: "SQLUser.EntityRelationships" vector_storage: table_name: "SQLUser.ClinicalNoteVectors" dimension: 1024 # ⚠️ NVIDIA NIM uses 1024-dim embeddings: model: "nvidia/nv-embedqa-e5-v5" dimension: 1024 device: "cuda" ``` --- ## Conclusion The GraphRAG implementation is **solid for local development** but needs **configuration flexibility** for cloud deployments. The main pain points are: 1. **Hardcoded settings** prevent cloud use 2. **Inconsistent config handling** across scripts 3. **Fixed vector dimensions** block modern embeddings With these changes, iris-vector-rag would be **production-ready for cloud deployments** and significantly easier to use in diverse environments. **Overall Assessment**: 🟡 Good local implementation, needs cloud polish --- **Status**: ✅ Testing and documentation complete **Date**: December 12, 2025 **Tested By**: AWS EC2 g5.xlarge migration project **Next**: Share with iris-vector-rag team for review --- ## 🎉 UPDATE: December 12, 2025 - iris-vector-rag 0.5.2 Improvements ### Major Pain Points NOW RESOLVED! After reinstalling iris-vector-rag 0.5.2, we're pleased to report that **the iris-vector-rag team has addressed our critical pain points!** ### ✅ RESOLVED Pain Points #### 🎉 #1: Hardcoded Connection Settings → NOW FIXED **Status**: ✅ **RESOLVED** in iris-vector-rag 0.5.2 The package now includes comprehensive **environment variable support**: ```bash # Set via environment variables export RAG_DATABASE__IRIS__HOST="3.84.250.46" export RAG_DATABASE__IRIS__PORT="1972" export RAG_DATABASE__IRIS__NAMESPACE="%SYS" export RAG_DATABASE__IRIS__USERNAME="_SYSTEM" export RAG_DATABASE__IRIS__PASSWORD="SYS" ``` **Implementation**: - ConfigurationManager with RAG_ prefix - Double underscore (__) for nested keys - Automatic type casting - Config file + env var override support **Impact**: Cloud deployments no longer require code modification! 🎉 --- #### 🎉 #2: Inflexible Vector Dimensions → NOW FIXED **Status**: ✅ **RESOLVED** in iris-vector-rag 0.5.2 Vector dimensions are now **fully configurable**: ```yaml storage: iris: vector_dimension: 1024 # Or 384, 768, 1536, 3072, 4096, etc. ``` ```bash # Or via environment variable export RAG_STORAGE__IRIS__VECTOR_DIMENSION=1024 ``` **Supports**: - 384 (SentenceTransformers) - 1024 (NVIDIA NIM) - 1536 (OpenAI ada-002) - 3072+ (Any modern embedding model) **Impact**: Can now use NVIDIA NIM, OpenAI, and any embedding model! 🎉 --- #### 🎉 #6: No Config File Support → NOW FIXED **Status**: ✅ **RESOLVED** in iris-vector-rag 0.5.2 Comprehensive ConfigurationManager implementation: ```python from iris_vector_rag.config.manager import ConfigurationManager # Load from config file config = ConfigurationManager(config_path="config/aws_config.yaml") # Environment variables automatically override config file host = config.get("database:iris:host") ``` **Features**: - YAML config loading - Environment variable overrides - Nested key support with colon notation - Type casting (string → int/float/bool) - Configuration validation - Default fallback values **Impact**: Production-ready configuration management! 🎉 --- ### 🟡 PARTIALLY RESOLVED Pain Points #### #3: Namespace Access Documentation **Status**: 🟡 Configuration system supports it, documentation needs cloud examples **What Works**: ```bash export RAG_DATABASE__IRIS__NAMESPACE="%SYS" ``` **Still Needed**: Cloud deployment guide with namespace troubleshooting --- #### #4: Schema Name Flexibility **Status**: 🟡 Configurable via config, needs testing with fully qualified names **What Works**: ```yaml storage: iris: table_name: "SQLUser.ClinicalNoteVectors" ``` **Needs Testing**: Verify SchemaManager handles schema-qualified names correctly --- ### ❌ STILL PENDING Pain Points #### #5: No Data Migration Guide **Status**: ❌ No built-in utilities found **Recommendation**: Community contribution opportunity --- ## 🎊 NEW FEATURES DISCOVERED ### 1. IRIS EMBEDDING Support (Feature 051) Native IRIS auto-vectorization with model caching ### 2. HNSW Index Configuration Fine-tune vector search performance ### 3. Schema Manager Automatic table creation with correct vector dimensions ### 4. Entity Extraction Pipeline Built-in medical entity extraction (DRUG, DISEASE, SYMPTOM, etc.) --- ## Updated Priority Matrix | Pain Point | Was | Now | Impact | |-----------|-----|-----|---------| | Hardcoded Settings | 🔴 CRITICAL | ✅ **RESOLVED** | Cloud ready! | | Vector Dimensions | 🔴 CRITICAL | ✅ **RESOLVED** | Modern embeddings! | | Config Manager | 🔴 HIGH | ✅ **RESOLVED** | Production ready! | | Namespace Docs | 🟡 IMPORTANT | 🟡 Partial | Config supports | | Table Names | 🟡 IMPORTANT | 🟡 Partial | Needs testing | | Data Migration | 🟡 IMPORTANT | ❌ Pending | Future work | **Overall**: 🎉 **3 of 6 issues RESOLVED, 2 partially addressed** --- ## Next Steps ### Immediate Testing Required 1. ✅ Test iris-vector-rag with AWS IRIS (%SYS namespace) 2. ✅ Verify 1024-dimensional vectors work 3. ✅ Test IRISVectorStore with SQLUser.* tables 4. ✅ Validate environment variable overrides ### Migration Considerations 1. **Replace IRISVectorDBClient** with iris-vector-rag IRISVectorStore 2. **Update configs** to use RAG_ environment variable conventions 3. **Test GraphRAG** with new configuration system 4. **Contribute docs** for AWS deployment patterns --- ## Conclusion (Updated) **Original Assessment**: 🟡 Good local implementation, needs cloud polish **Updated Assessment**: 🎉 **EXCELLENT! Cloud-ready with major improvements** The iris-vector-rag team has delivered on the critical pain points. With environment variable support and configurable dimensions, **the package is now production-ready for cloud deployments**. **Recommendation**: **Adopt iris-vector-rag for AWS deployment** and contribute cloud deployment documentation back to the project. --- **Document Status**: ✅ Updated with iris-vector-rag 0.5.2 improvements **Date Updated**: December 12, 2025 **Overall Result**: 🎉 **Major pain points resolved!**

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/isc-tdyar/medical-graphrag-assistant'

If you have feedback or need assistance with the MCP directory API, please join our Discord server