| name | codealive-context-engine |
|---|---|
| description | Semantic code search and AI-powered codebase Q&A across indexed repositories. Use when understanding code beyond local files, exploring dependencies, discovering cross-project patterns, planning features, debugging, or onboarding. Queries like "How does X work?", "Show me Y patterns", "How is library Z used?". The default path is semantic search plus grep search; chat-with-codebase is slower, more expensive, and usually secondary. |
Semantic code intelligence across your entire code ecosystem — current project, organizational repos, dependencies, and any indexed codebase.
All scripts require a CodeAlive API key. If any script fails with "API key not configured", help the user set it up:
Option 1 (recommended): Run the interactive setup and wait for the user to complete it:
python setup.pyOption 2 (not recommended — key visible in chat history): If the user pastes their API key directly in chat, save it via:
python setup.py --key THE_KEYDo NOT retry the failed script until setup completes successfully.
| Tool | Script | Speed | Cost | Best For |
|---|---|---|---|---|
| List Data Sources | datasources.py |
Instant | Free | Discovering indexed repos and workspaces |
| Semantic Search | search.py |
Fast | Low | Finding relevant artifacts by meaning |
| Grep Search | grep.py |
Fast | Low | Exact text and regex matches with line previews |
| Fetch Artifacts | fetch.py |
Fast | Low | Retrieving full content for search results |
| Artifact Relationships | relationships.py |
Fast | Low | Drilling into call graph, inheritance, references for one artifact |
| Chat with Codebase | chat.py |
Slow | High | Synthesized answers, architectural explanations |
Cost guidance: semantic_search and grep_search are the default starting point. Chat with Codebase invokes an LLM on the server side, can take up to 30 seconds, and is significantly more expensive per call — use it only when you need a synthesized, ready-to-use answer rather than raw search results.
Highest-confidence guidance: If your agent supports subagents and the task needs maximum reliability or depth, prefer a subagent-driven workflow that combines search.py, grep.py, fetch.py, relationships.py, and local file reads. chat.py is optional synthesis, not the default path.
Three-step workflow (search → triage → load real content):
- Search — find relevant code locations with descriptions and identifiers
- Triage — use
descriptionONLY to decide which results are worth a closer look. It is a pointer, NOT the source of truth. Do not draw conclusions from it. - Get real content — for every artifact you decide is relevant:
- External repos (no local access):
python fetch.py <identifier> - Current working repo: read the file at the shown path with your editor's
file-read tool
Treat only that real
contentas ground truth.
- External repos (no local access):
Optional drill-down: once you know an artifact matters, run
python relationships.py <identifier> to expand its call graph, inheritance,
or references.
Use this skill for semantic understanding:
- "How is authentication implemented?"
- "Show me error handling patterns across services"
- "How does this library work internally?"
- "Find similar features to guide my implementation"
Use local file tools instead for:
- Finding specific files by name or pattern
- Exact keyword search in the current directory
- Reading known file paths
- Searching uncommitted changes
python scripts/datasources.pypython scripts/search.py "JWT token validation" my-backend
python scripts/search.py "authentication flow" my-repo --path src/auth --ext .py
python scripts/grep.py "AuthService" my-repo
python scripts/grep.py "auth\\(" my-repo --regexpython scripts/fetch.py "my-org/backend::src/auth.py::AuthService.login()"# Full call graph (default)
python scripts/relationships.py "my-org/backend::src/auth.py::AuthService.login()"
# Inheritance hierarchy for a class
python scripts/relationships.py "my-org/backend::src/models.py::User" --profile inheritanceOnly
# Calls + inheritance, raise the per-type cap
python scripts/relationships.py "my-org/backend::src/svc.py::Service" --profile allRelevant --max-count 200python scripts/chat.py "Explain the authentication flow" my-backend
python scripts/chat.py "What about security considerations?" --continue CONV_IDpython scripts/datasources.py # Ready-to-use sources
python scripts/datasources.py --all # All (including processing)
python scripts/datasources.py --json # JSON outputReturns file paths, line numbers, descriptions, identifiers, and content sizes. Fast and cheap.
python scripts/search.py <query> <data_sources...> [options]| Option | Description |
|---|---|
--max-results N |
Optional cap for the number of returned artifacts |
--path PATH |
Repo-relative path or directory scope (repeatable) |
--ext EXT |
File extension scope such as .py or .ts (repeatable) |
description is a triage pointer ONLY — it tells you which artifacts are
worth a closer look. It is NOT the source of truth and you must NOT draw
conclusions from it. For every result you consider relevant, load the real
source: use fetch.py <identifier> for external repos, or your editor's
file-read tool on the path for repos in the current working directory. Treat
only that real content as ground truth.
Returns artifact-level matches with line previews. Use this when the pattern itself matters more than semantic similarity.
python scripts/grep.py <query> <data_sources...> [--regex] [--max-results N] [--path PATH] [--ext EXT]| Option | Description |
|---|---|
--regex |
Interpret the query as a regex pattern |
--max-results N |
Optional cap for the number of returned artifacts |
--path PATH |
Repo-relative path or directory scope (repeatable) |
--ext EXT |
File extension scope such as .py or .ts (repeatable) |
Line previews are still search evidence, not source of truth. Use fetch.py
or your local file-read tool before drawing conclusions about behavior.
Retrieves the full source code content for artifacts found via search. Use this for external repositories you cannot access locally.
python scripts/fetch.py <identifier1> [identifier2...]| Constraint | Value |
|---|---|
| Max identifiers per request | 20 |
| Identifiers source | identifier field from search results |
| Identifier format | {owner/repo}::{path}::{symbol} (symbols), {owner/repo}::{path} (files) |
For function-like artifacts the response includes a small relationships
preview (up to 3 outgoing/incoming calls per direction). To see the full
call graph, inheritance, or references, run relationships.py with the
artifact's identifier.
Returns the full call graph (incoming/outgoing calls), inheritance hierarchy
(ancestors/descendants), or symbol references for a single artifact. This is
the drill-down tool — use it AFTER search.py or fetch.py once you have an
identifier and want to understand how the artifact relates to the rest of the
codebase.
python scripts/relationships.py <identifier> [--profile PROFILE] [--max-count N]| Option | Description |
|---|---|
--profile callsOnly |
Default. Outgoing + incoming calls |
--profile inheritanceOnly |
Ancestors + descendants |
--profile allRelevant |
Calls + inheritance (4 groups) |
--profile referencesOnly |
Symbol references |
--max-count N |
Max related artifacts per relationship type (1–1000, default 50) |
--json |
Emit the raw JSON response instead of the formatted view |
Sends your question to an AI consultant that has full context of the indexed codebase. Returns synthesized, ready-to-use answers. Supports conversation continuity for follow-ups.
This is more expensive than search because it runs an LLM inference on the server side. Prefer search when you just need to locate code. Use chat when you need explanations, comparisons, or architectural analysis after search. It can take up to 30 seconds.
python scripts/chat.py <question> <data_sources...> [options]| Option | Description |
|---|---|
--continue <id> |
Continue a previous conversation (saves context and cost) |
Conversation continuity: Every response includes a conversation_id. Pass it with --continue for follow-up questions — this preserves context and is cheaper than starting fresh.
Repository — single codebase, for targeted searches:
python scripts/search.py "query" my-backend-apiWorkspace — multiple repos, for cross-project patterns:
python scripts/search.py "query" workspace:backend-teamMultiple repositories:
python scripts/search.py "query" repo-a repo-b repo-c- Python 3.8+ (no third-party packages required — uses only stdlib)
The skill needs a CodeAlive API key. Resolution order:
CODEALIVE_API_KEYenvironment variable- OS credential store (macOS Keychain / Linux secret-tool / Windows Credential Manager)
Environment variable (all platforms):
export CODEALIVE_API_KEY="your_key_here"macOS Keychain:
security add-generic-password -a "$USER" -s "codealive-api-key" -w "YOUR_API_KEY"Linux (freedesktop secret-tool):
secret-tool store --label="CodeAlive API Key" service codealive-api-keyWindows Credential Manager:
cmdkey /generic:codealive-api-key /user:codealive /pass:"YOUR_API_KEY"Base URL (optional, defaults to https://app.codealive.ai):
export CODEALIVE_BASE_URL="https://your-instance.example.com"For self-hosted CodeAlive, use your deployment origin. https://your-instance.example.com is preferred, but https://your-instance.example.com/api is also accepted and normalized automatically.
Get API keys at: https://app.codealive.ai/settings/api-keys
This skill works standalone, but delivers the best experience when combined with the CodeAlive MCP server. The MCP server provides direct tool access via the Model Context Protocol, while this skill provides the workflow knowledge and query patterns to use those tools effectively.
| Component | What it provides |
|---|---|
| This skill | Query patterns, workflow guidance, cost-aware tool selection |
| MCP server | Direct semantic_search, grep_search, fetch_artifacts, get_artifact_relationships, chat, get_data_sources tools plus deprecated aliases |
When both are installed, prefer the MCP server's tools for direct operations and this skill's scripts for guided workflows.
For advanced usage, see reference files:
- Query Patterns — effective query writing, anti-patterns, language-specific examples
- Workflows — step-by-step workflows for onboarding, debugging, feature planning, and more