4.5 KiB
4.5 KiB
Agent Guidelines
Git Workflow
- Commit frequently: After completing a meaningful unit of work (feature, fix, or file update), stage changes with
git addand create a commit with a clear, concise message in the imperative mood (e.g., "Add parser", "Fix timeout"). - Push to remote: Once the local commit(s) are ready, push them to the remote repository. Use
git push -u origin mainif the upstream branch is not yet tracked; otherwise usegit push. - No uncommitted changes left behind: Before finishing a task, ensure all intended changes are committed and pushed to avoid losing work.
- No empty commits: Avoid creating empty or placeholder commits.
Native RAG Engine
The project uses a native Python RAG engine (no external daemons, no Node.js): hybrid BM25 (SQLite FTS5) + vector (sqlite-vec with numpy fallback) + LLM rerank through OpenCode.
Layout
src/rag/engine/— the engine itself:db.py—Database(SQLite + sqlite-vec + FTS5 schema, fallback detection).chunker.py— markdown-aware recursive splitter (~900 chars, 15% overlap).embeddings.py— singleton sentence-transformers model (lazy load).bm25.py— FTS5 BM25 withrank_bm25fallback.vector.py— sqlite-vec with numpy cosine fallback.hybrid.py— RRF fusion (k=60).rerank.py— LLM rerank through OpenCode.engine.py— public facade:index_file,index_text,search,vsearch,query,get,status,warmup.
src/rag/qmd/— compatibility layer preserving the oldqmd_*API:qmd_query,qmd_chat,qmd_chat_stream,qmd_index_meeting,qmd_index_document.main.py/queue.py/ingest_worker.pyuse these.src/ingest/stub_writer.py—.mdstubs for binary files (videos, archives).
Conventions
- Коллекция =
processed/<org>/qmd_collections/<project_slug>/(или_global/) — внутри лежитindex.sqlite. - Перед изменением
src/rag/engine/— прочитайopenspec/changes/native-rag-engine/design.md. - При добавлении нового retrieval-режима — обнови
LEGACY_MODE_MAPвsrc/rag/qmd/query.py. - При добавлении нового LLM-вызова — обнови
CHAT_MODESвsrc/rag/qmd/query.py.
Tests
- Все новые модули
src/rag/engine/обязаны иметь unit-тест вtests/test_native_engine.py. - Реальные данные: 3–5
.mdфайлов вtempfile.TemporaryDirectory(). - Запуск:
python -m pytest tests/ -q(46 passed на момент написания). - E2E:
tests/test_native_engine_e2e.py— ingest → search → chat-stream с подменой OpenCode.
Fallback-стратегии
- FTS5 недоступен →
rank_bm25in-memory. - sqlite-vec недоступен → numpy cosine in-memory.
- Embedding-модель не загрузилась → BM25-only режим.
Deploy
Архитектура: код в git, модели отдельно (rsync).
Makefile— основные цели:init,pull-models,up,down,restart,logs,status,test,deploy.scripts/pull_models.sh—MODELS_SOURCE=user@host:path→ rsync моделей; fallback наdownload_models.pyиз интернета.scripts/deploy.sh— rsync кода +.env+make pull-models && make upна удалённом сервере..env.example— коммитится..env— нет (в.gitignore).
Что в git / что нет
| В git (~2 MB) | НЕ в git |
|---|---|
src/, backend/, tests/, scripts/, Makefile |
models/ (5.83 GB) |
Dockerfile*, docker-compose*.yml, config.yaml |
processed/, uploads/, data/ (рантайм) |
AGENTS.md, README.md, .env.example |
migrate/*.tar.gz (9.5 GB) |
openspec/ (спецификации) |
.env (секреты) |
Деплой одной командой (с исходной машины)
git add -A && git commit -m "..." && git push origin main
./scripts/deploy.sh user@server /opt/transcription
Первый запуск на новом сервере
git clone https://gts.meratalk.online/keboss/transcription.git /opt/transcription
cd /opt/transcription
make init # .env из .env.example
nano .env # вписать HF_TOKEN, OPENCODE_API_KEY
make pull-models MODELS_SOURCE=user@dev-host:/opt/transcription/models/
make up