Spaces:

ybchen928
/

oncall-guide-ai

Sleeping

App Files Files Community

YanBoChen commited on Jul 31

Commit

9e8cbc8

1 Parent(s): 5888ce4

feat and fix: update medical advice extraction logic to prioritize raw response

Browse files

Files changed (3) hide show

README.md +264 -0
app.py +517 -0
src/generation.py +3 -3

README.md ADDED Viewed

	@@ -0,0 +1,264 @@

+# OnCall.ai - Medical Emergency Assistant
+A RAG-based medical assistant system that provides evidence-based clinical guidance for emergency medical situations using real medical guidelines and advanced language models.
+## 🎯 Project Overview
+OnCall.ai helps healthcare professionals by:
+- Processing medical queries through multi-level validation
+- Retrieving relevant medical guidelines from curated datasets
+- Generating evidence-based clinical advice using specialized medical LLMs
+- Providing transparent, traceable medical guidance
+## ✅ Current Implementation Status
+### **🎉 COMPLETED MODULES (2025-07-31)**
+#### **1. Multi-Level Query Processing System**
+- ✅ **UserPromptProcessor** (`src/user_prompt.py`)
+  - Level 1: Predefined medical condition mapping (instant response)
+  - Level 2: LLM-based condition extraction (Llama3-Med42-70B)
+  - Level 3: Semantic search fallback
+  - Level 4: Medical query validation (100% non-medical rejection)
+  - Level 5: Generic medical search for rare conditions
+#### **2. Dual-Index Retrieval System**
+- ✅ **BasicRetrievalSystem** (`src/retrieval.py`)
+  - Emergency medical guidelines index (emergency.ann)
+  - Treatment protocols index (treatment.ann)
+  - Vector-based similarity search using PubMedBERT embeddings
+  - Intelligent deduplication and result ranking
+#### **3. Medical Knowledge Base**
+- ✅ **MedicalConditions** (`src/medical_conditions.py`)
+  - Predefined condition-keyword mappings
+  - Medical terminology validation
+  - Extensible condition database
+#### **4. LLM Integration**
+- ✅ **Med42-70B Client** (`src/llm_clients.py`)
+  - Specialized medical language model integration
+  - Dual-layer rejection detection for non-medical queries
+  - Robust error handling and timeout management
+#### **5. Medical Advice Generation**
+- ✅ **MedicalAdviceGenerator** (`src/generation.py`)
+  - RAG-based prompt construction
+  - Intention-aware chunk selection (treatment/diagnosis)
+  - Confidence scoring and response formatting
+  - Integration with Med42-70B for clinical advice generation
+#### **6. Data Processing Pipeline**
+- ✅ **Processed Medical Guidelines** (`src/data_processing.py`)
+  - ~4000 medical guidelines from EPFL-LLM dataset
+  - Emergency subset: ~2000-2500 records
+  - Treatment subset: ~2000-2500 records
+  - PubMedBERT embeddings (768 dimensions)
+  - ANNOY vector indices for fast retrieval
+## 📊 **System Performance (Validated)**
+### **Test Results Summary**
+```
+🎯 Multi-Level Fallback Validation: 69.2% success rate
+   - Level 1 (Predefined): 100% success (instant response)
+   - Level 4a (Non-medical rejection): 100% success
+   - Level 4b→5 (Rare medical): 100% success
+📈 End-to-End Pipeline: 100% technical completion
+   - Condition extraction: 2.6s average
+   - Medical guideline retrieval: 0.3s average
+   - Total pipeline: 15.5s average (including generation)
+```
+### **Quality Metrics**
+```
+🔍 Retrieval Performance:
+   - Guidelines retrieved: 8-9 per query
+   - Relevance scores: 0.245-0.326 (good for medical domain)
+   - Emergency/Treatment balance: Correctly maintained
+🧠 Generation Quality:
+   - Confidence scores: 0.90 for successful generations
+   - Evidence-based responses with specific guideline references
+   - Appropriate medical caution and clinical judgment emphasis
+```
+## 🛠️ **Technical Architecture**
+### **Data Flow**
+```
+User Query → Multi-Level Processing → Dual-Index Retrieval → RAG Generation
+     ↓              ↓                      ↓                    ↓
+  Validation    Condition Mapping    Guidelines Search    Medical Advice
+```
+### **Core Technologies**
+- **Embeddings**: NeuML/pubmedbert-base-embeddings (768D)
+- **Vector Search**: ANNOY indices with angular distance
+- **LLM**: m42-health/Llama3-Med42-70B (medical specialist)
+- **Dataset**: EPFL-LLM medical guidelines (~4000 documents)
+### **Fallback Mechanism**
+```
+Level 1: Predefined Mapping (0.001s) → Success: Direct return
+Level 2: LLM Extraction (8-15s) → Success: Condition mapping
+Level 3: Semantic Search (1-2s) → Success: Sliding window chunks
+Level 4: Medical Validation (8-10s) → Fail: Return rejection
+Level 5: Generic Search (1s) → Final: General medical guidance
+```
+## 🚀 **NEXT PHASE: Interactive Interface**
+### **🎯 Immediate Goals (Next 1-2 Days)**
+#### **Phase 1: Gradio Interface Development**
+- [ ] **Create `app.py`** - Interactive web interface
+  - [ ] Complete pipeline integration
+  - [ ] Multi-output display (advice + guidelines + technical details)
+  - [ ] Environment-controlled debug mode
+  - [ ] User-friendly error handling
+#### **Phase 2: Local Validation Testing**
+- [ ] **Manual testing** with 20-30 realistic medical queries
+  - [ ] Emergency scenarios (cardiac arrest, stroke, MI)
+  - [ ] Diagnostic queries (chest pain, respiratory distress)
+  - [ ] Treatment protocols (medication management, procedures)
+  - [ ] Edge cases (rare conditions, complex symptoms)
+#### **Phase 3: HuggingFace Spaces Deployment**
+- [ ] **Create requirements.txt** for deployment
+- [ ] **Deploy to HF Spaces** for public testing
+- [ ] **Production mode configuration** (limited technical details)
+- [ ] **Performance monitoring** and user feedback collection
+### **🔮 Future Enhancements (Next 1-2 Weeks)**
+#### **Audio Input Integration**
+- [ ] **Whisper ASR integration** for voice queries
+- [ ] **Audio preprocessing** and quality validation
+- [ ] **Multi-modal interface** (text + audio input)
+#### **Evaluation & Metrics**
+- [ ] **Faithfulness scoring** implementation
+- [ ] **Automated evaluation pipeline**
+- [ ] **Clinical validation** with medical professionals
+- [ ] **Performance benchmarking** against target metrics
+#### **Dataset Expansion (Future)**
+- [ ] **Dataset B integration** (symptom/diagnosis subsets)
+- [ ] **Multi-dataset RAG** architecture
+- [ ] **Enhanced medical knowledge** coverage
+## 📋 **Target Performance Metrics**
+### **Response Quality**
+- [ ] Physician satisfaction: ≥ 4/5
+- [ ] RAG content coverage: ≥ 80%
+- [ ] Retrieval precision (P@5): ≥ 0.7
+- [ ] Medical advice faithfulness: ≥ 0.8
+### **System Performance**
+- [ ] Total response latency: ≤ 30 seconds
+- [ ] Condition extraction: ≤ 5 seconds
+- [ ] Guideline retrieval: ≤ 2 seconds
+- [ ] Medical advice generation: ≤ 25 seconds
+### **User Experience**
+- [ ] Non-medical query rejection: 100%
+- [ ] System availability: ≥ 99%
+- [ ] Error handling: Graceful degradation
+- [ ] Interface responsiveness: Immediate feedback
+## 🏗️ **Project Structure**
+```
+OnCall.ai/
+├── src/                          # Core modules (✅ Complete)
+│   ├── user_prompt.py           # Multi-level query processing
+│   ├── retrieval.py             # Dual-index vector search
+│   ├── generation.py            # RAG-based advice generation
+│   ├── llm_clients.py           # Med42-70B integration
+│   ├── medical_conditions.py    # Medical knowledge configuration
+│   └── data_processing.py       # Dataset preprocessing
+├── models/                       # Pre-processed data (✅ Complete)
+│   ├── embeddings/              # Vector embeddings and chunks
+│   └── indices/                 # ANNOY vector indices
+├── tests/                        # Validation tests (✅ Complete)
+│   ├── test_multilevel_fallback_validation.py
+│   ├── test_end_to_end_pipeline.py
+│   └── test_userinput_userprompt_medical_*.py
+├── docs/                         # Documentation and planning
+│   ├── next/                    # Current implementation docs
+│   └── next_gradio_evaluation/  # Interface planning
+├── app.py                        # 🎯 NEXT: Gradio interface
+├── requirements.txt              # 🎯 NEXT: Deployment dependencies
+└── README.md                     # This file
+```
+## 🧪 **Testing Validation**
+### **Completed Tests**
+- ✅ **Multi-level fallback validation**: 13 test cases, 69.2% success
+- ✅ **End-to-end pipeline testing**: 6 scenarios, 100% technical completion
+- ✅ **Component integration**: All modules working together
+- ✅ **Error handling**: Graceful degradation and user-friendly messages
+### **Key Findings**
+- **Predefined mapping**: Instant response for known conditions
+- **LLM extraction**: Reliable for complex symptom descriptions
+- **Non-medical rejection**: Perfect accuracy with updated prompt engineering
+- **Retrieval quality**: High-relevance medical guidelines (0.2-0.4 relevance scores)
+- **Generation capability**: Evidence-based advice with proper medical caution
+## 🤝 **Contributing & Development**
+### **Environment Setup**
+```bash
+# Clone repository
+git clone [repository-url]
+cd OnCall.ai
+# Setup virtual environment
+python -m venv genAIvenv
+source genAIvenv/bin/activate  # On Windows: genAIvenv\Scripts\activate
+# Install dependencies
+pip install -r requirements.txt
+# Run tests
+python tests/test_end_to_end_pipeline.py
+# Start Gradio interface (coming soon)
+python app.py
+```
+### **API Configuration**
+```bash
+# Set up HuggingFace token for LLM access
+export HF_TOKEN=your_huggingface_token
+# Enable debug mode for development
+export ONCALL_DEBUG=true
+```
+## ⚠️ **Important Notes**
+### **Medical Disclaimer**
+This system is designed for **research and educational purposes only**. It should not replace professional medical consultation, diagnosis, or treatment. Always consult qualified healthcare providers for medical decisions.
+### **Current Limitations**
+- **API Dependencies**: Requires HuggingFace API access for LLM functionality
+- **Dataset Scope**: Currently focused on emergency and treatment guidelines
+- **Language Support**: English medical terminology only
+- **Validation Stage**: System under active development and testing
+## 📞 **Contact & Support**
+**Development Team**: OnCall.ai Team
+**Last Updated**: 2025-07-31
+**Version**: 0.9.0 (Pre-release)
+**Status**: 🚧 Ready for Interactive Testing Phase
+---
+*Built with ❤️ for healthcare professionals*

app.py ADDED Viewed

	@@ -0,0 +1,517 @@

+#!/usr/bin/env python3
+"""
+OnCall.ai - Interactive Medical Emergency Assistant
+A Gradio-based web interface for the OnCall.ai medical query processing system.
+Provides real-time medical guidance based on evidence from medical guidelines.
+Features:
+- Complete pipeline: Query → Condition Extraction → Retrieval → Generation
+- Multi-level fallback validation system
+- Evidence-based medical advice with source attribution
+- Environment-controlled debug mode
+- Audio input ready (future enhancement)
+Author: OnCall.ai Team
+Date: 2025-07-31
+Version: 0.9.0
+"""
+import os
+import sys
+import gradio as gr
+import json
+import traceback
+from datetime import datetime
+from typing import Dict, List, Any, Tuple, Optional
+from pathlib import Path
+# Add src directory to Python path
+current_dir = Path(__file__).parent
+src_dir = current_dir / "src"
+sys.path.insert(0, str(src_dir))
+# Import OnCall.ai modules
+try:
+    from user_prompt import UserPromptProcessor
+    from retrieval import BasicRetrievalSystem
+    from llm_clients import llm_Med42_70BClient
+    from generation import MedicalAdviceGenerator
+    from medical_conditions import CONDITION_KEYWORD_MAPPING
+except ImportError as e:
+    print(f"❌ Failed to import OnCall.ai modules: {e}")
+    print("Please ensure you're running from the project root directory")
+    sys.exit(1)
+# Configuration
+DEBUG_MODE = os.getenv('ONCALL_DEBUG', 'false').lower() == 'true'
+print(f"🔧 Debug mode: {'ON' if DEBUG_MODE else 'OFF'}")
+class OnCallAIInterface:
+    """
+    Main interface class for OnCall.ai Gradio application
+    """
+    def __init__(self):
+        """Initialize the complete OnCall.ai pipeline"""
+        self.initialized = False
+        self.initialization_error = None
+        # Pipeline components
+        self.llm_client = None
+        self.retrieval_system = None
+        self.user_prompt_processor = None
+        self.medical_generator = None
+        # Initialize pipeline
+        self._initialize_pipeline()
+    def _initialize_pipeline(self):
+        """Initialize all pipeline components with error handling"""
+        try:
+            print("🔧 Initializing OnCall.ai Pipeline...")
+            # Initialize LLM client
+            print("  1. Loading Med42-70B client...")
+            self.llm_client = llm_Med42_70BClient()
+            # Initialize retrieval system
+            print("  2. Loading medical guidelines indices...")
+            self.retrieval_system = BasicRetrievalSystem()
+            # Initialize user prompt processor
+            print("  3. Setting up multi-level query processor...")
+            self.user_prompt_processor = UserPromptProcessor(
+                llm_client=self.llm_client,
+                retrieval_system=self.retrieval_system
+            )
+            # Initialize medical advice generator
+            print("  4. Preparing medical advice generator...")
+            self.medical_generator = MedicalAdviceGenerator(
+                llm_client=self.llm_client
+            )
+            self.initialized = True
+            print("✅ OnCall.ai pipeline initialized successfully!")
+        except Exception as e:
+            self.initialization_error = str(e)
+            print(f"❌ Pipeline initialization failed: {e}")
+            print(f"Traceback: {traceback.format_exc()}")
+    def process_medical_query(self, user_query: str, intention_override: Optional[str] = None) -> Tuple[str, str, str, str]:
+        """
+        Complete medical query processing pipeline
+        Args:
+            user_query: User's medical query
+            intention_override: Optional intention override for testing
+        Returns:
+            Tuple of (medical_advice, processing_steps, retrieved_guidelines, technical_details)
+        """
+        if not self.initialized:
+            error_msg = f"❌ System not initialized: {self.initialization_error}"
+            return error_msg, error_msg, "{}", "{}"
+        if not user_query or not user_query.strip():
+            return "Please enter a medical query to get started.", "", "{}", "{}"
+        processing_start = datetime.now()
+        processing_steps = []
+        technical_details = {}
+        try:
+            # STEP 1: Query Processing and Condition Extraction
+            processing_steps.append("🎯 Step 1: Processing medical query and extracting conditions...")
+            step1_start = datetime.now()
+            condition_result = self.user_prompt_processor.extract_condition_keywords(user_query)
+            step1_time = (datetime.now() - step1_start).total_seconds()
+            processing_steps.append(f"   ✅ Condition: {condition_result.get('condition', 'None')}")
+            processing_steps.append(f"   📋 Emergency Keywords: {condition_result.get('emergency_keywords', 'None')}")
+            processing_steps.append(f"   💊 Treatment Keywords: {condition_result.get('treatment_keywords', 'None')}")
+            processing_steps.append(f"   ⏱️ Processing Time: {step1_time:.3f}s")
+            # Handle non-medical queries
+            if condition_result.get('type') == 'invalid_query':
+                non_medical_msg = condition_result.get('message', 'This appears to be a non-medical query.')
+                processing_steps.append("   🚫 Query identified as non-medical")
+                return non_medical_msg, '\n'.join(processing_steps), "{}", "{}"
+            # STEP 2: User Confirmation (Auto-simulated)
+            processing_steps.append("\n🤝 Step 2: User confirmation (auto-confirmed for demo)")
+            confirmation = self.user_prompt_processor.handle_user_confirmation(condition_result)
+            if not condition_result.get('condition'):
+                no_condition_msg = "Unable to identify a specific medical condition. Please rephrase your query with more specific medical terms."
+                processing_steps.append("   ⚠️ No medical condition identified")
+                return no_condition_msg, '\n'.join(processing_steps), "{}", "{}"
+            processing_steps.append(f"   ✅ Confirmed condition: {condition_result.get('condition')}")
+            # STEP 3: Medical Guidelines Retrieval
+            processing_steps.append("\n🔍 Step 3: Retrieving relevant medical guidelines...")
+            step3_start = datetime.now()
+            # Construct search query
+            search_query = f"{condition_result.get('emergency_keywords', '')} {condition_result.get('treatment_keywords', '')}".strip()
+            if not search_query:
+                search_query = condition_result.get('condition', user_query)
+            retrieval_results = self.retrieval_system.search(search_query, top_k=5)
+            step3_time = (datetime.now() - step3_start).total_seconds()
+            processed_results = retrieval_results.get('processed_results', [])
+            emergency_count = len([r for r in processed_results if r.get('type') == 'emergency'])
+            treatment_count = len([r for r in processed_results if r.get('type') == 'treatment'])
+            processing_steps.append(f"   📊 Found {len(processed_results)} relevant guidelines")
+            processing_steps.append(f"   🚨 Emergency guidelines: {emergency_count}")
+            processing_steps.append(f"   💊 Treatment guidelines: {treatment_count}")
+            processing_steps.append(f"   ⏱️ Retrieval time: {step3_time:.3f}s")
+            # Format retrieved guidelines for display
+            guidelines_display = self._format_guidelines_display(processed_results)
+            # STEP 4: Medical Advice Generation
+            processing_steps.append("\n🧠 Step 4: Generating evidence-based medical advice...")
+            step4_start = datetime.now()
+            # Determine intention (use override if provided, otherwise detect)
+            intention = intention_override or self._detect_query_intention(user_query)
+            medical_advice_result = self.medical_generator.generate_medical_advice(
+                user_query=user_query,
+                retrieval_results=retrieval_results,
+                intention=intention
+            )
+            step4_time = (datetime.now() - step4_start).total_seconds()
+            # Extract medical advice
+            medical_advice = medical_advice_result.get('medical_advice', 'Unable to generate medical advice.')
+            confidence_score = medical_advice_result.get('confidence_score', 0.0)
+            processing_steps.append(f"   🎯 Intention: {intention}")
+            processing_steps.append(f"   📈 Confidence: {confidence_score:.2f}")
+            processing_steps.append(f"   ⏱️ Generation time: {step4_time:.3f}s")
+            # STEP 5: Final Summary
+            total_time = (datetime.now() - processing_start).total_seconds()
+            processing_steps.append(f"\n✅ Complete pipeline finished in {total_time:.3f}s")
+            # Prepare technical details
+            technical_details = {
+                "condition_extraction": {
+                    "method": self._determine_extraction_source(condition_result),
+                    "condition": condition_result.get('condition', ''),
+                    "processing_time": step1_time
+                },
+                "retrieval": {
+                    "search_query": search_query if DEBUG_MODE else "[Hidden in production]",
+                    "total_results": len(processed_results),
+                    "emergency_results": emergency_count,
+                    "treatment_results": treatment_count,
+                    "processing_time": step3_time
+                },
+                "generation": {
+                    "intention": intention,
+                    "confidence_score": confidence_score,
+                    "chunks_used": medical_advice_result.get('query_metadata', {}).get('total_chunks_used', 0),
+                    "processing_time": step4_time
+                },
+                "performance": {
+                    "total_pipeline_time": total_time,
+                    "debug_mode": DEBUG_MODE
+                }
+            }
+            # Apply security filtering for production
+            if not DEBUG_MODE:
+                technical_details = self._sanitize_technical_details(technical_details)
+            return (
+                medical_advice,
+                '\n'.join(processing_steps),
+                guidelines_display,
+                json.dumps(technical_details, indent=2)
+            )
+        except Exception as e:
+            error_msg = f"❌ System error: {str(e)}"
+            processing_steps.append(f"\n❌ Error occurred: {str(e)}")
+            error_details = {
+                "error": str(e),
+                "timestamp": datetime.now().isoformat(),
+                "query": user_query
+            }
+            return (
+                "I apologize, but I encountered an error while processing your medical query. Please try rephrasing your question or contact technical support.",
+                '\n'.join(processing_steps),
+                "{}",
+                json.dumps(error_details, indent=2)
+            )
+    def _format_guidelines_display(self, processed_results: List[Dict]) -> str:
+        """Format retrieved guidelines for user-friendly display"""
+        if not processed_results:
+            return json.dumps({"message": "No guidelines retrieved"}, indent=2)
+        guidelines = []
+        for i, result in enumerate(processed_results[:6], 1):  # Show top 6
+            guideline = {
+                "guideline_id": i,
+                "source_type": result.get('type', 'unknown').title(),
+                "relevance_score": f"{1 - result.get('distance', 1):.3f}",
+                "content_preview": result.get('text', '')[:200] + "..." if len(result.get('text', '')) > 200 else result.get('text', ''),
+                "matched_keywords": result.get('matched', '') if DEBUG_MODE else "[Keywords used for matching]"
+            }
+            guidelines.append(guideline)
+        return json.dumps({
+            "total_guidelines": len(processed_results),
+            "displayed_guidelines": guidelines
+        }, indent=2)
+    def _detect_query_intention(self, user_query: str) -> str:
+        """Simple intention detection based on query content"""
+        query_lower = user_query.lower()
+        treatment_indicators = ['treat', 'treatment', 'manage', 'therapy', 'protocol', 'how to']
+        diagnosis_indicators = ['diagnos', 'differential', 'symptoms', 'signs', 'what is']
+        treatment_score = sum(1 for indicator in treatment_indicators if indicator in query_lower)
+        diagnosis_score = sum(1 for indicator in diagnosis_indicators if indicator in query_lower)
+        if treatment_score > diagnosis_score:
+            return "treatment"
+        elif diagnosis_score > treatment_score:
+            return "diagnosis"
+        else:
+            return "treatment"  # Default to treatment for emergency scenarios
+    def _determine_extraction_source(self, condition_result: Dict) -> str:
+        """Determine how the condition was extracted"""
+        if condition_result.get('semantic_confidence') is not None:
+            return "semantic_search"
+        elif condition_result.get('generic_confidence') is not None:
+            return "generic_search"
+        elif condition_result.get('condition') in CONDITION_KEYWORD_MAPPING:
+            return "predefined_mapping"
+        else:
+            return "llm_extraction"
+    def _sanitize_technical_details(self, technical_details: Dict) -> Dict:
+        """Remove sensitive technical information for production mode"""
+        sanitized = {
+            "processing_summary": {
+                "total_time": technical_details["performance"]["total_pipeline_time"],
+                "confidence": technical_details["generation"]["confidence_score"],
+                "guidelines_found": technical_details["retrieval"]["total_results"]
+            },
+            "medical_context": {
+                "condition_identified": bool(technical_details["condition_extraction"]["condition"]),
+                "intention_detected": technical_details["generation"]["intention"],
+                "evidence_sources": f"{technical_details['retrieval']['emergency_results']} emergency + {technical_details['retrieval']['treatment_results']} treatment"
+            },
+            "system_status": {
+                "all_components_operational": True,
+                "processing_mode": "production"
+            }
+        }
+        return sanitized
+def create_oncall_interface():
+    """Create and configure the Gradio interface"""
+    # Initialize OnCall.ai system
+    oncall_system = OnCallAIInterface()
+    # Define interface theme and styling
+    theme = gr.themes.Soft(
+        primary_hue="blue",
+        secondary_hue="green",
+        neutral_hue="slate"
+    )
+    # Create Gradio interface
+    with gr.Blocks(
+        theme=theme,
+        title="OnCall.ai - Medical Emergency Assistant",
+        css="""
+        .main-container { max-width: 1200px; margin: 0 auto; }
+        .medical-advice { font-size: 16px; line-height: 1.6; }
+        .processing-steps { font-family: monospace; font-size: 14px; }
+        .guidelines-display { max-height: 400px; overflow-y: auto; }
+        """
+    ) as interface:
+        # Header
+        gr.Markdown("""
+        # 🏥 OnCall.ai - Medical Emergency Assistant
+        **Evidence-based clinical guidance for healthcare professionals**
+        ⚠️ **Medical Disclaimer**: This system is for research and educational purposes only.
+        Always consult qualified healthcare providers for medical decisions.
+        """)
+        # Main interface
+        with gr.Row():
+            with gr.Column(scale=1):
+                # Input section
+                gr.Markdown("## 📝 Medical Query Input")
+                user_input = gr.Textbox(
+                    label="Enter your medical query",
+                    placeholder="Example: How to treat acute myocardial infarction in emergency department?",
+                    lines=3,
+                    max_lines=5
+                )
+                # Optional intention override for testing
+                if DEBUG_MODE:
+                    intention_override = gr.Dropdown(
+                        choices=[None, "treatment", "diagnosis"],
+                        label="🎯 Override Intention (Debug Mode)",
+                        value=None
+                    )
+                else:
+                    intention_override = gr.State(None)
+                submit_btn = gr.Button("🔍 Get Medical Guidance", variant="primary", size="lg")
+                # Example queries
+                gr.Markdown("""
+                ### 💡 Example Queries
+                - "How to treat acute myocardial infarction?"
+                - "Patient with severe chest pain and shortness of breath"
+                - "Emergency protocols for acute stroke management"
+                - "Differential diagnosis for sudden onset chest pain"
+                """)
+        # Output sections
+        gr.Markdown("## 📋 Medical Guidance Results")
+        with gr.Row():
+            with gr.Column(scale=2):
+                # Primary output - Medical Advice
+                medical_advice_output = gr.Textbox(
+                    label="🩺 Medical Advice",
+                    lines=10,
+                    max_lines=15,
+                    elem_classes="medical-advice"
+                )
+                # Processing steps
+                processing_steps_output = gr.Textbox(
+                    label="📊 Processing Steps",
+                    lines=8,
+                    max_lines=12,
+                    elem_classes="processing-steps"
+                )
+            with gr.Column(scale=1):
+                # Retrieved guidelines
+                guidelines_output = gr.JSON(
+                    label="📚 Retrieved Medical Guidelines",
+                    elem_classes="guidelines-display"
+                )
+                # Technical details (collapsible in production)
+                if DEBUG_MODE:
+                    technical_output = gr.JSON(
+                        label="⚙️ Technical Details (Debug Mode)",
+                        elem_classes="technical-details"
+                    )
+                else:
+                    with gr.Accordion("🔧 System Information", open=False):
+                        technical_output = gr.JSON(
+                            label="Processing Information",
+                            elem_classes="technical-details"
+                        )
+        # Audio input section (placeholder for future)
+        with gr.Accordion("🎙️ Audio Input (Coming Soon)", open=False):
+            gr.Markdown("""
+            **Future Enhancement**: Voice input capability will be available soon.
+            You'll be able to:
+            - Record audio queries directly in the interface
+            - Upload audio files for processing
+            - Receive audio responses (Text-to-Speech)
+            """)
+            # Placeholder components for audio (inactive)
+            audio_input = gr.Audio(
+                label="Audio Query (Not yet functional)",
+                type="filepath",
+                interactive=False
+            )
+        # Event handlers
+        submit_btn.click(
+            fn=oncall_system.process_medical_query,
+            inputs=[user_input, intention_override] if DEBUG_MODE else [user_input],
+            outputs=[medical_advice_output, processing_steps_output, guidelines_output, technical_output]
+        )
+        # Enter key support
+        user_input.submit(
+            fn=oncall_system.process_medical_query,
+            inputs=[user_input, intention_override] if DEBUG_MODE else [user_input],
+            outputs=[medical_advice_output, processing_steps_output, guidelines_output, technical_output]
+        )
+        # Footer
+        gr.Markdown("""
+        ---
+        **OnCall.ai v0.9.0** | Built with ❤️ for healthcare professionals |
+        [GitHub](https://github.com/your-username/oncall-ai) |
+        **⚠️ Research Use Only**
+        """)
+    return interface
+def main():
+    """Main application entry point"""
+    print("🏥 Starting OnCall.ai Interactive Interface...")
+    print(f"🔧 Debug Mode: {'ON' if DEBUG_MODE else 'OFF'}")
+    try:
+        # Create interface
+        interface = create_oncall_interface()
+        # Launch configuration
+        launch_config = {
+            "server_name": "0.0.0.0",  # Allow external connections
+            "server_port": 7860,       # Standard Gradio port
+            "share": False,            # Set to True for public links
+            "debug": DEBUG_MODE,
+            "show_error": DEBUG_MODE
+        }
+        print("🚀 Launching OnCall.ai interface...")
+        print(f"🌐 Interface will be available at: http://localhost:7860")
+        if DEBUG_MODE:
+            print("🔧 Debug mode active - Technical details will be visible")
+        else:
+            print("🛡️ Production mode - Limited technical information displayed")
+        # Launch interface
+        interface.launch(**launch_config)
+    except Exception as e:
+        print(f"❌ Failed to launch interface: {e}")
+        print(f"Traceback: {traceback.format_exc()}")
+        return 1
+    return 0
+if __name__ == "__main__":
+    exit_code = main()
+    sys.exit(exit_code)

src/generation.py CHANGED Viewed

@@ -348,10 +348,10 @@ Your response should be concise but comprehensive, suitable for immediate clinic
         Returns:
             Structured medical advice response
         """
-        # Extract generated content
-        advice_content = generated_advice.get('extracted_condition', '')
         if not advice_content:
-            advice_content = generated_advice.get('raw_response', 'Unable to generate medical advice.')
         # Calculate confidence based on available factors
         confidence_score = self._calculate_confidence_score(generated_advice, chunks_used)

         Returns:
             Structured medical advice response
         """
+        # Extract generated content - use raw_response for complete medical advice
+        advice_content = generated_advice.get('raw_response', '')
         if not advice_content:
+            advice_content = generated_advice.get('extracted_condition', 'Unable to generate medical advice.')
         # Calculate confidence based on available factors
         confidence_score = self._calculate_confidence_score(generated_advice, chunks_used)