transcription/.gitignore
keboss-m eee8f4c8a4 Replace LightRAG with native Python RAG engine + add deploy tooling
- New: src/rag/engine/ — in-process hybrid search (FTS5 BM25 + sqlite-vec + LLM rerank)
- New: src/rag/qmd/ — compatibility layer (qmd_query, qmd_chat, qmd_chat_stream, qmd_index_*)
- New: src/ingest/stub_writer.py — .md stubs for binary files (videos, archives)
- New: scripts/deploy.sh + scripts/pull_models.sh + Makefile + .env.example
- Removed: LightRAG, sentence-transformers embedding via separate package, rag_standalone/
- Removed: @nousresearch/qmd npm dep (package not published); Node.js from Dockerfile
- Updated: tests/ (46 passed), docker-compose, .dockerignore, config.yaml, README

Engine: in-process Python (no daemon, no npm), sentence-transformers 384-dim,
RRF fusion (k=60), BM25 + vector with numpy fallback. WebSocket API unchanged.

Deploy: 'git clone' + 'make init' + 'make pull-models MODELS_SOURCE=...' + 'make up'.
Models (5.83 GB) live outside git; pulled via rsync from dev host.
2026-06-10 14:24:01 +03:00

94 lines
994 B
Plaintext

**/__pycache__/
*.py[cod]
*$py.class
*.so
.Python
env/
venv/
ENV/
build/
develop-eggs/
dist/
downloads/
eggs/
.eggs/
lib/
lib64/
parts/
sdist/
var/
wheels/
*.egg-info/
.installed.cfg
*.egg
# Модели и кэш
models/
*.bin
*.pt
*.pth
*.onnx
checkpoints/
# Временные и выходные файлы
tmp/
temp/
output/
uploads/
processed/
*.wav
*.mp3
*.m4a
*.ogg
*.flac
*.docx
*.md
!README.md
!AGENTS.md
*.txt
!requirements.txt
!.gitkeep
# IDE
.vscode/
.idea/
*.swp
*.swo
*~
# OS
.DS_Store
Thumbs.db
# Secrets (даже в приватном Gitea лучше держать .env вне репо)
.env
.env.local
*.key
*.secret
# User data
video/
*.mp4
# Server logs
server.log
*.log
# Auth database
data/
# Migration archives (do not commit — see scripts/migrate_lightrag_to_qmd.py)
migrate/
*.tar.gz
*.tar
# OpenSpec local state (changes/specs are committed; archive metadata is fine)
.opencode/
# Build artifacts
_docker_out/
_*.log
_build*.log
_smoke_*.py
_debug*.py