Commit History

Re-check multilevel fallback validation test suite for OnCall.ai
fbad237

YanBoChen commited on

feat(user_prompt): Enhance Medical Query Processing Pipeline
f24fd2b

YanBoChen commited on

feat(user_prompt): update UserPromptProcessor to integrate Llama3-Med42-70B and enhance query validation; add unit tests for condition extraction and matching mechanisms
30fc9ee

YanBoChen commited on

feat(retrieval): add sliding window search method for enhanced semantic search
a1e2d00

YanBoChen commited on

feat(user_prompt): add keyword index checking and enhance medical query validation (add validate_medical_query() and _check_keyword_in_index(), based on implementation_todo_20250730_user_prompt.md
fa23be2

YanBoChen commited on

feat(llm_clients): enhance MeditronClient to support local model loading and improve error handling
4c919d2

YanBoChen commited on

feat(.gitignore): add cache directory to ignored files
acc25ea

YanBoChen commited on

WIP(llm_clients): add MeditronClient for medical query processing with Hugging Face integration
7282410

YanBoChen commited on

🚀 Implement Advanced Condition Extraction for Medical Query Processing
c414f60

YanBoChen commited on

refactor(deduplication): change deduplication logic from distance-based to exact text matching
37c6713

YanBoChen commited on

feat(retrieval) 1st MVP: enhance search logging and deduplication logic with distance threshold
890989b

YanBoChen commited on

WIP: feat(retrieval): implement basic vector retrieval system for medical documents
6c249e5

YanBoChen commited on

refactor(data_processing): enhance chunking and embedding generation
69b7911

YanBoChen commited on

WIP: Enhance dual keyword chunking to include pre-calculated metadata for treatment chunks
c0317b2

YanBoChen commited on

Add comprehensive tests for chunk quality analysis and embedding validation
f3ac7d9

YanBoChen commited on

Add requirements.txt for test dependencies
8942859

YanBoChen commited on

chore(.gitignore): update ignored files for environment and editor settings
d3ad7d4

YanBoChen commited on

Add new optimized subsets comparison reports in CSV and JSONL formats
fcd6404

YanBoChen commited on

chore(.gitignore): add json and png file extensions to ignored files
35ef528

YanBoChen commited on

chore: remove obsolete analysis files and plots for emergency and treatment datasets
95a7e44

YanBoChen commited on

refactor(.gitignore): reorganize and expand ignored files for clarity and completeness
2e1a43b

YanBoChen commited on

Remove obsolete embedding and index files; add comprehensive embedding test analysis and validation suite
775f8ea

YanBoChen commited on

feat(data_processing): Implement token length control with semantic preservation
922ed80

YanBoChen commited on

refactor(data_processing): add token-based chunking strategy for improved keyword context
6083d96

YanBoChen commited on

refactor(data_processing): optimize chunking strategy with token-based approach
87dcd9d

YanBoChen commited on

fix: add .vscode/ to .gitignore
e72f098

YanBoChen commited on

fix: update .gitignore to include docs/ and *.pyc
3a40790

YanBoChen commited on

feat(data-processing): implement data processing pipeline with embeddings
68cfce0

YanBoChen commited on

Merge pull request #1 from YanBoChen0928/dataprocessing
cd2cfdd

Yan-Bo Chen commited on

Update dataset/scripts/data_explorer_treatment.py
1e4fdbf

Yan-Bo Chen Copilot commited on

Update dataset/scripts/data_explorer_treatment_opt.py
f58b914

Yan-Bo Chen Copilot commited on

feat: add emergency subset analysis plots and statistics report
d72142e

YanBoChen commited on

refactor: migrate special terms to JSON configuration
8de0937

YanBoChen commited on

feat: implement special_term (added) emergency keyword matching and metadata extraction
9829a46

YanBoChen commited on

feat: update treatment analysis with keyword density calculations and enhanced visualization(test previous 2 dataset, especially treatment_subset)
654aa66

YanBoChen commited on

feat: add integrity check report for dataset analysis (previous commit test result)
04a03be

YanBoChen commited on

WIP: during pre-process dataset, when doing dataset_treatment exploration, some abnormality happen, thus we now create certain test script to identify the problem
d37f4b2

YanBoChen commited on

feat: add comprehensive treatment analysis statistics and update treatment keywords
2ee61dc

YanBoChen commited on

WIP: add dual keyword and text length distribution plots for treatment subset analysis
a5bcfa7

YanBoChen commited on

WIP: Try to analysis treatment_subset enhance emergency and treatment filtering scripts with metadata and analysis functionality
7d8970e

YanBoChen commited on

feat: update emergency subset analysis scripts and visualizations, enhance statistics, and fix keyword files
ee06c0f

YanBoChen commited on

WIP: add emergency subset analysis scripts and visualizations
b37084c

YanBoChen commited on

refactor: remove outdated emergency subset preprocessing documentation
c778aa6

YanBoChen commited on

feat(dataset): Implement emergency subset extraction with enhanced matching
988dac9

YanBoChen commited on

fix: correct dataset path in .gitignore
6845aa8

YanBoChen commited on

feat: update requirements.txt to include additional dependencies
b46ceed

YanBoChen commited on

feat: add requirements.txt with initial package dependencies
21e5bc6

YanBoChen commited on

docs: update .gitignore to include virtual environment and docs directory
990fd22

YanBoChen commited on

docs: add .gitignore and init
db42a56

YanBoChen commited on