feat: enhance pack ingestion logging and implement advanced summarization
a090266
Bellokcommited on
Merge branch 'main' of https://huggingface.co/spaces/Bellok/warbler-cda
75a37d8
Bellokcommited on
refactor(app): improve code formatting and add background ingestion status display
ec38897
Bellokcommited on
refactor(app): improve code formatting and add background ingestion status display
d754cd5
Bellokcommited on
fix(embeddings): enhance numerical stability in chunk entropy calculation
a2c1773
Bellokcommited on
chore: update .gitignore to ignore Node.js node_modules
4124d89
Bellokcommited on
disabled parallelism
09adbc1
Bellokcommited on
`Refactored background ingestion tracking and function`
a0dbf73
Bellokcommited on
again
bfcb0d4
Bellokcommited on
trying to allow ingesting in the background. Current throughput on ingest is 17+ documents per secon, that means only 25k documents out of over a million will load within HuggingFace's required time limit. If we want to have access to all 1.5m documents, we need to run the ingest in the background and allow Warbler-CDA to report healthy before the time-limit is up.
d90df2b
Bellokcommited on
`Refactor Warbler pack ingestion to process in batches for memory efficiency and add progress updates.`
08609d9
Bellokcommited on
again
c2695d4
Bellokcommited on
another try.
06c76fb
Bellokcommited on
trying updated datasets ran through our ingest
b2a1583
Bellokcommited on
again. i am so bad at this. lol.
91d1f81
Bellokcommited on
another one
172b3ba
Bellokcommited on
more
c68ca27
Bellokcommited on
again
4e49639
Bellokcommited on
again
edc1dd9
Bellokcommited on
again
74e02ab
Bellokcommited on
Merge branch 'main' of https://huggingface.co/spaces/Bellok/warbler-cda
ec21ffb
Bellokcommited on
again
8328a04
Bellokcommited on
again
92afc13
Bellokcommited on
again
695a6b5
Bellokcommited on
some more f-string fixes
86b7b28
Bellokcommited on
manually edited this time to try something... I noticed the \n\n at any f multi-line-strings were red... conversion to single-line turned them normal color... hoping this works.
16664af
Bellokcommited on
Merge branch 'main' of https://huggingface.co/spaces/Bellok/warbler-cda
5bd8469
Bellokcommited on
`Removed newline before assembly quality metric`
4143d66
Bellokcommited on
`Removed newline before assembly quality metric`
d0f04eb
Bellokcommited on
Fix syntax errors in app.py preventing Gradio app startup on HF Spaces
2133289
jmeyer1980commited on
Merge branch 'main' of https://huggingface.co/spaces/Bellok/warbler-cda
9380cea
Bellokcommited on
app.py hotfix
d7ce980
Bellokcommited on
app.py hotfix
6b0ac11
Bellokcommited on
staged changes are still showing even after forced push.
55d584b
Bellokcommited on
Initial clean commit with jsonl packs and updated .gitignore