Cerebro - Enterprise Knowledge Extraction Platform
Identityβ
- Product name: Cerebro
- What it is: Enterprise Knowledge Extraction Platform β code analysis, RAG, and GCP integration
- What it is NOT: A "credit burn" tool. GCP credit management is ONE feature, not the identity
Naming Conventionβ
| Scope | Name |
|---|---|
| Product / CLI | Cerebro (cerebro command) |
| Python package dir | src/cerebro/ |
| CLI entrypoints | Both cerebro and phantom work (backward compat) |
| Imports in code | from cerebro.core... |
Rename Historyβ
The package was renamed from src/phantom/ to src/cerebro/ in March 2026.
The phantom CLI entrypoint is kept as a backward-compat alias in pyproject.toml.
Language Rulesβ
- All user-facing strings (CLI output, TUI labels, docstrings) MUST be in English
- Internal code comments: English preferred
- No Portuguese in any user-facing surface
GCP Configurationβ
- Never hardcode GCP project IDs β use
os.getenv("GCP_PROJECT_ID") - Never hardcode Data Store IDs β use
os.getenv("DATA_STORE_ID") - Placeholder in docs/help:
<your-gcp-project-id>
Project Structureβ
src/cerebro/ # Python package
cli.py # CLI entrypoint (Typer)
tui/ # Terminal UI (Textual)
core/ # Business logic
commands/ # CLI command groups
dashboard/ # React web dashboard
docs/
architecture/ # Architecture docs, ADR summary
guides/ # Setup guides, cheatsheets
features/ # Feature-specific docs (gcp-credits, intelligence, strategy)
project/ # Project status, plans, audits
phases/ # Historical phase implementation docs
i18n/ # Portuguese translations
commands/ # CLI command docs
Companion Servicesβ
| Service | Repo | Role |
|---|---|---|
| cerebro-reranker | ~/master/cerebro-reranker | Primary reranker (HybridEngine: MiniLM β Electra β DeBERTa). API: POST /v1/rerank. GCP Vertex Search is the fallback. |
Integration point: src/cerebro/core/rerank_client.py β CerebroRerankerClient.
Endpoint: http://localhost:8090 (env: CEREBRO_RERANKER_URL).
Dashboard Backendβ
- Launcher (
src/cerebro/launcher.py) startscerebro.api.server:appon port 8000. src/cerebro/dashboard_server.pyis a legacy stub β do NOT use or extend it.- All new dashboard endpoints go in
src/cerebro/api/server.py.
Key ADRsβ
- ADR-0027: Phantom Phase 1 API (accepted)
- ADR-0030: Enterprise Repositioning & Identity Resolution (accepted)