Spaces:

byoung-hf
/

ben-bot

Sleeping

App Files Files Community

byoung-hf commited on 24 days ago

Commit

63256f9

verified ·

1 Parent(s): e75edb4

Upload folder using huggingface_hub

Browse files

Files changed (22) hide show

.dockerignore +1 -0
.github/copilot-instructions.md +4 -2
.github/workflows/update_space.yml +12 -1
.specify/memory/constitution.md +3 -3
Dockerfile +9 -2
TESTING.md +24 -5
pyproject.toml +45 -0
pyrightconfig.json +6 -0
specs/001-personified-ai-agent/spec.md +4 -4
specs/002-linkedin-profile-extractor/INTEGRATION_GUIDE.md +4 -3
src/agent.py +299 -146
src/app.py +12 -11
src/config.py +18 -18
src/data.py +86 -62
src/notebooks/experiments.ipynb +7 -10
tests/data/README.md +194 -0
tests/data/projects.md +61 -0
tests/data/team.md +49 -0
tests/integration/spec-001.py +507 -0
tests/unit/test_config.py +36 -0
tests/unit/test_data.py +175 -0
uv.lock +39 -1

.dockerignore CHANGED Viewed

@@ -1,6 +1,7 @@
 # Environment and secrets
 .env
 .env.*
 # Git
 .git

 # Environment and secrets
 .env
 .env.*
+.venv
 # Git
 .git

.github/copilot-instructions.md CHANGED Viewed

@@ -22,9 +22,11 @@ The constitution covers:
 ## Quick Development Checklist
-- Run tests after refactoring: `uv run pytest src/test.py -v`
 - Always update notebooks when changing function signatures
-- Use `uv` for all code execution (never `pip` directly)
 - See `TESTING.md` for detailed test setup
 ## Common Gotchas & Reminders

 ## Quick Development Checklist
+- Run tests after refactoring: `uv run pytest tests/ -v`
 - Always update notebooks when changing function signatures
+- Use `uv` for all code execution (never `pip` directly, never manually activate venv)
+- Use `uv run` to execute scripts and commands, never bare `python` or shell activation
+- **NEVER use `tail`, `head`, `grep`, or similar output filters** — show full output always so you can see everything that's happening
 - See `TESTING.md` for detailed test setup
 ## Common Gotchas & Reminders

.github/workflows/update_space.yml CHANGED Viewed

@@ -23,13 +23,24 @@ jobs:
         GROQ_API_KEY: ${{ secrets.GROQ_API_KEY }}
         GITHUB_PERSONAL_ACCESS_TOKEN: ${{ secrets.GITHUB_TOKEN }}
       run: |
         docker run --rm \
           -e OPENAI_API_KEY="${OPENAI_API_KEY}" \
           -e GROQ_API_KEY="${GROQ_API_KEY}" \
           -e GITHUB_PERSONAL_ACCESS_TOKEN="${GITHUB_PERSONAL_ACCESS_TOKEN}" \
           --entrypoint uv \
           ai-me:test \
-          run pytest src/test.py -v -s
   deploy:
     needs: test

         GROQ_API_KEY: ${{ secrets.GROQ_API_KEY }}
         GITHUB_PERSONAL_ACCESS_TOKEN: ${{ secrets.GITHUB_TOKEN }}
       run: |
+        rm -rf ${{ github.workspace }}/htmlcov
         docker run --rm \
           -e OPENAI_API_KEY="${OPENAI_API_KEY}" \
           -e GROQ_API_KEY="${GROQ_API_KEY}" \
           -e GITHUB_PERSONAL_ACCESS_TOKEN="${GITHUB_PERSONAL_ACCESS_TOKEN}" \
+          -v "${{ github.workspace }}/htmlcov:/app/htmlcov" \
+          --user 0 \
           --entrypoint uv \
           ai-me:test \
+          run pytest tests/ -v --cov=src --cov-report=html --cov-report=term-missing
+    - name: Upload coverage reports as artifact
+      uses: actions/upload-artifact@v4
+      if: always()
+      with:
+        name: coverage-report
+        path: htmlcov/
+        retention-days: 30
   deploy:
     needs: test

.specify/memory/constitution.md CHANGED Viewed

@@ -104,7 +104,7 @@ All software must maintain complete traceability between requirements, implement
 2. **Local Development**:
    - Use `docs/` directory for markdown (won't deploy unless pushed to GitHub repo)
    - Test locally: `uv run src/app.py` (Gradio on port 7860)
-   - Run tests: `uv run pytest src/test.py -v`
    - Edit notebooks then validate changes don't break tests
 3. **Docker/Notebook Development**:
@@ -123,10 +123,10 @@ All software must maintain complete traceability between requirements, implement
 - `src/data.py` - DataManager class, complete document pipeline
 - `src/agent.py` - AIMeAgent class, MCP setup, agent creation
 - `src/app.py` - Gradio interface, session management
-- `src/test.py` - Integration tests with pytest-asyncio
 - `src/notebooks/experiments.ipynb` - Development sandbox (test all APIs here first)
 - `docs/` - Local markdown for RAG development
-- `test_data/` - Test fixtures and sample data
 - `.github/copilot-instructions.md` - Detailed AI assistant guidance
 - `.specify/` - Spec-Driven Development templates and memory

 2. **Local Development**:
    - Use `docs/` directory for markdown (won't deploy unless pushed to GitHub repo)
    - Test locally: `uv run src/app.py` (Gradio on port 7860)
+   - Run tests: `uv run pytest tests/ -v`
    - Edit notebooks then validate changes don't break tests
 3. **Docker/Notebook Development**:
 - `src/data.py` - DataManager class, complete document pipeline
 - `src/agent.py` - AIMeAgent class, MCP setup, agent creation
 - `src/app.py` - Gradio interface, session management
 - `src/notebooks/experiments.ipynb` - Development sandbox (test all APIs here first)
+- `tests/integration/spec-001.py` - Integration tests for spec 001 (personified AI agent)
+- `tests/data/` - Test fixtures and sample data
 - `docs/` - Local markdown for RAG development
 - `.github/copilot-instructions.md` - Detailed AI assistant guidance
 - `.specify/` - Spec-Driven Development templates and memory

Dockerfile CHANGED Viewed

@@ -22,12 +22,19 @@ RUN mkdir -p /app/bin \
 WORKDIR /app
-# Install project dependencies with uv
 COPY pyproject.toml uv.lock ./
-RUN uv sync
 COPY . /app
 # Non-root user with access to /app
 RUN adduser -u 5678 --disabled-password --gecos "" appuser && chown -R appuser /app
 USER appuser

 WORKDIR /app
+# Copy only dependency specifications for layer caching
 COPY pyproject.toml uv.lock ./
+# Create virtual environment and sync dependencies from lock file
+# --no-install-project defers building the local package until source is copied
+RUN uv venv && uv sync --locked --no-install-project
+# Now copy the complete source code
 COPY . /app
+# Sync again to install the local package (now that source is present)
+RUN uv sync --locked
 # Non-root user with access to /app
 RUN adduser -u 5678 --disabled-password --gecos "" appuser && chown -R appuser /app
 USER appuser

TESTING.md CHANGED Viewed

@@ -2,7 +2,7 @@
 ## Overview
-The test suite (`src/test.py`) validates the ai-me agent system including:
 - Vectorstore setup and document loading
 - Agent configuration and initialization
 - RAG (Retrieval Augmented Generation) functionality
@@ -29,13 +29,32 @@ The test suite (`src/test.py`) validates the ai-me agent system including:
 From project root:
 ```bash
 # All tests
-uv run pytest src/test.py -v
 # With detailed output
-uv run pytest src/test.py -v -o log_cli=true --log-cli-level=INFO --capture=no
 # Specific test
-uv run pytest src/test.py::test_rear_knowledge_contains_it245 -v -o log_cli=true --log-cli-level=INFO --capture=no
 ```
 ## Test Architecture
@@ -49,7 +68,7 @@ uv run pytest src/test.py::test_rear_knowledge_contains_it245 -v -o log_cli=true
 **Configuration**:
 - **Temperature**: Set to 0.0 for deterministic, reproducible responses
 - **Model**: Uses model specified in config (default: `openai/openai/gpt-oss-120b` via Groq)
-- **Data Source**: `test_data/` directory (configured via `doc_root` parameter)
 - **GitHub Repos**: Disabled (`GITHUB_REPOS=""`) for faster test execution
 The temperature of 0 ensures that the agent's responses are consistent across test runs, making assertions more reliable.

 ## Overview
+The test suite (`tests/integration/spec-001.py`) validates the ai-me agent system including:
 - Vectorstore setup and document loading
 - Agent configuration and initialization
 - RAG (Retrieval Augmented Generation) functionality
 From project root:
 ```bash
 # All tests
+uv run pytest tests/ -v
 # With detailed output
+uv run pytest tests/ -v -o log_cli=true --log-cli-level=INFO --capture=no
 # Specific test
+uv run pytest tests/integration/spec-001.py::test_rear_knowledge_contains_it245 -v -o log_cli=true --log-cli-level=INFO --capture=no
+```
+### Run Tests with Code Coverage
+```bash
+# Run tests with coverage report
+uv run pytest tests/ --cov=src --cov-report=term-missing -v
+# Generate HTML coverage report
+uv run pytest tests/ --cov=src --cov-report=html -v
+# View HTML report (opens in browser)
+open htmlcov/index.html
+# Integration tests only with coverage
+uv run pytest tests/integration/ --cov=src --cov-report=term-missing -v
+# Show only uncovered lines
+uv run pytest tests/ --cov=src --cov-report=term:skip-covered -v
 ```
 ## Test Architecture
 **Configuration**:
 - **Temperature**: Set to 0.0 for deterministic, reproducible responses
 - **Model**: Uses model specified in config (default: `openai/openai/gpt-oss-120b` via Groq)
+- **Data Source**: `tests/data/` directory (configured via `doc_root` parameter)
 - **GitHub Repos**: Disabled (`GITHUB_REPOS=""`) for faster test execution
 The temperature of 0 ensures that the agent's responses are consistent across test runs, making assertions more reliable.

pyproject.toml CHANGED Viewed

@@ -1,3 +1,7 @@
 [project]
 name = "ai-me"
 version = "0.1.0"
@@ -34,4 +38,45 @@ dev = [
     "ipywidgets~=8.1",
     "pytest~=8.0",
     "pytest-asyncio~=0.24",
 ]

+[build-system]
+requires = ["hatchling"]
+build-backend = "hatchling.build"
 [project]
 name = "ai-me"
 version = "0.1.0"
     "ipywidgets~=8.1",
     "pytest~=8.0",
     "pytest-asyncio~=0.24",
+    "pytest-cov~=6.0",
 ]
+[tool.hatch.build.targets.wheel]
+packages = ["src"]
+[tool.hatch.build.targets.editable]
+packages = ["src"]
+[tool.pytest.ini_options]
+testpaths = ["tests"]
+python_files = ["test_*.py", "*_test.py", "spec-*.py"]
+python_classes = ["Test*", "*Tests"]
+python_functions = ["test_*"]
+pythonpath = ["src"]
+[tool.coverage.run]
+source = ["src"]
+omit = [
+    "*/tests/*",
+    "*/test_*",
+    "*/__pycache__/*",
+    "*/notebooks/*",
+]
+[tool.coverage.report]
+exclude_lines = [
+    "pragma: no cover",
+    "def __repr__",
+    "raise AssertionError",
+    "raise NotImplementedError",
+    "if __name__ == .__main__.:",
+    "if TYPE_CHECKING:",
+    "class .*\\bProtocol\\):",
+    "@(abc\\.)?abstractmethod",
+]
+show_missing = true
+precision = 2
+fail_under = 85
+[tool.coverage.html]
+directory = "htmlcov"

pyrightconfig.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "typeCheckingMode": "standard",
+  "include": ["src", "tests"],
+  "exclude": ["**/node_modules", "**/__pycache__", ".venv"],
+  "extraPaths": ["src"]
+}

specs/001-personified-ai-agent/spec.md CHANGED Viewed

@@ -205,16 +205,16 @@ A user asks the agent a question, and the agent provides a response with clear r
 | Requirement | User Stories | Success Criteria | Implementation Modules | Tests |
 |---|---|---|---|---|
 | [**FR-001**](#fr-001-chat-interface) (Chat Interface) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-001](#sc-001-validates-fr-003), [SC-005](#sc-005-validates-nfr-001), [SC-006](#sc-006-validates-nfr-002) | [`src/app.py::initialize_session()`](../../src/app.py), [`src/app.py::chat()`](../../src/app.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |
-| [**FR-002**](#fr-002-knowledge-retrieval) (Knowledge Retrieval) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-002](#sc-002-validates-fr-002-fr-004), [SC-003](#sc-003-validates-fr-006) | [`src/data.py::load_local_documents()`](../../src/data.py), [`src/data.py::load_github_documents()`](../../src/data.py), [`src/data.py::chunk_documents()`](../../src/data.py), [`src/data.py::create_vectorstore()`](../../src/data.py) | [`test_rear_knowledge_contains_it245()`](../../src/test.py), [`test_carol_knowledge_contains_product()`](../../src/test.py), [`test_user_story_3_source_attribution()`](../../src/test.py) |
 | [**FR-003**](#fr-003-first-person-persona) (First-Person Persona) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-001](#sc-001-validates-fr-003) | [`src/agent.py::create_ai_me_agent()`](../../src/agent.py), [`src/agent.py::run()`](../../src/agent.py) | [`test_rear_knowledge_contains_it245()`](../../src/test.py), [`test_carol_knowledge_contains_product()`](../../src/test.py), [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |
-| [**FR-004**](#fr-004-source-attribution) (Source Attribution) | [US3](#user-story-3---access-sourced-information-with-attribution-priority-p2) | [SC-002](#sc-002-validates-fr-002-fr-004) | [`src/data.py::process_documents()`](../../src/data.py), [`src/agent.py::get_local_info_tool()`](../../src/agent.py) | [`test_github_relative_links_converted_to_absolute_urls()`](../../src/test.py), [`test_user_story_3_source_attribution()`](../../src/test.py) |
 | [**FR-005**](#fr-005-session-history) (Session History) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-005](#sc-005-validates-nfr-001) | [`src/app.py::chat()`](../../src/app.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |
 | [**FR-006**](#fr-006-knowledge-gap-handling) (Knowledge Gap Handling) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-003](#sc-003-validates-fr-006), [SC-004](#sc-004-validates-fr-006) | [`src/agent.py::run()`](../../src/agent.py) | [`test_unknown_person_contains_negative_response()`](../../src/test.py) |
 | [**FR-007**](#fr-007-session-isolation) (Session Isolation) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-006](#sc-006-validates-nfr-002) | [`src/app.py::initialize_session()`](../../src/app.py), [`src/app.py::get_session_status()`](../../src/app.py), [`src/app.py::chat()`](../../src/app.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |
 | [**FR-008**](#fr-008-output-normalization) (Output Normalization) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-001](#sc-001-validates-fr-003) | [`src/agent.py::run()`](../../src/agent.py) | All tests (implicit in response validation) |
 | [**FR-009**](#fr-009-mandatory-tools) (Mandatory Tools) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-005](#sc-005-validates-nfr-001), [SC-007](#sc-007-validates-fr-012) | [`src/agent.py::mcp_time_params`](../../src/agent.py), [`src/agent.py::get_mcp_memory_params()`](../../src/agent.py), [`src/agent.py::setup_mcp_servers()`](../../src/agent.py) | [`test_mcp_time_server_returns_current_date()`](../../src/test.py), [`test_mcp_memory_server_remembers_favorite_color()`](../../src/test.py) |
 | [**FR-010**](#fr-010-optional-tools) (Optional Tools) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-007](#sc-007-validates-fr-012) | [`src/agent.py::mcp_github_params`](../../src/agent.py), [`src/agent.py::setup_mcp_servers()`](../../src/agent.py), [`src/data.py::load_github_documents()`](../../src/data.py) | [`test_github_commits_contains_shas()`](../../src/test.py) |
-| [**FR-011**](#fr-011-conflict-resolution--logging) (Conflict Resolution) | [US3](#user-story-3---access-sourced-information-with-attribution-priority-p2) | [SC-002](#sc-002-validates-fr-002-fr-004) | [`src/data.py::load_github_documents()`](../../src/data.py), [`src/agent.py::get_local_info_tool()`](../../src/agent.py) | [`test_user_story_3_source_attribution()`](../../src/test.py) |
 | [**FR-012**](#fr-012-tool-error-handling) (Tool Error Handling) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US3](#user-story-3---access-sourced-information-with-attribution-priority-p2) | [SC-007](#sc-007-validates-fr-012) | [`src/agent.py::setup_mcp_servers()`](../../src/agent.py), [`src/agent.py::run()`](../../src/agent.py), [`src/agent.py::cleanup()`](../../src/agent.py) | [`test_tool_failure_error_messages_are_friendly()`](../../src/test.py) |
 | [**FR-013**](#fr-013-memory-tool) (Memory Tool) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-008](#sc-008-validates-fr-013) | [`src/agent.py::get_mcp_memory_params()`](../../src/agent.py) | [`test_mcp_memory_server_remembers_favorite_color()`](../../src/test.py) |
@@ -224,7 +224,7 @@ A user asks the agent a question, and the agent provides a response with clear r
 |---|---|---|---|---|
 | [**NFR-001**](#nfr-001-sub-5s-response) (Sub-5s Response) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-005](#sc-005-validates-nfr-001) | [`src/agent.py::run()`](../../src/agent.py), [`src/data.py::create_vectorstore()`](../../src/data.py) | [`test_mcp_time_server_returns_current_date()`](../../src/test.py) |
 | [**NFR-002**](#nfr-002-10-concurrent-sessions) (10+ Concurrent Sessions) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-006](#sc-006-validates-nfr-002) | [`src/app.py::initialize_session()`](../../src/app.py), [`src/app.py::chat()`](../../src/app.py), [`src/agent.py::AIMeAgent`](../../src/agent.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py), [`test_mcp_memory_server_remembers_favorite_color()`](../../src/test.py) |
-| [**NFR-003**](#nfr-003-structured-logging) (Structured Logging) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US3](#user-story-3---access-sourced-information-with-attribution-priority-p2) | [SC-007](#sc-007-validates-fr-012) | [`src/config.py::setup_logger()`](../../src/config.py), [`src/agent.py::run()`](../../src/agent.py), [`src/app.py::chat()`](../../src/app.py) | [`test_user_story_3_source_attribution()`](../../src/test.py), [`test_tool_failure_error_messages_are_friendly()`](../../src/test.py) |
 | [**NFR-004**](#nfr-004-unicode-normalization) (Unicode Normalization) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-001](#sc-001-validates-fr-003) | [`src/agent.py::run()`](../../src/agent.py) | All tests (implicit in response validation) |
 | [**NFR-005**](#nfr-005-session-isolation) (Session Isolation) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-006](#sc-006-validates-nfr-002) | [`src/app.py::initialize_session()`](../../src/app.py), [`src/agent.py::cleanup()`](../../src/agent.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |

 | Requirement | User Stories | Success Criteria | Implementation Modules | Tests |
 |---|---|---|---|---|
 | [**FR-001**](#fr-001-chat-interface) (Chat Interface) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-001](#sc-001-validates-fr-003), [SC-005](#sc-005-validates-nfr-001), [SC-006](#sc-006-validates-nfr-002) | [`src/app.py::initialize_session()`](../../src/app.py), [`src/app.py::chat()`](../../src/app.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |
+| [**FR-002**](#fr-002-knowledge-retrieval) (Knowledge Retrieval) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-002](#sc-002-validates-fr-002-fr-004), [SC-003](#sc-003-validates-fr-006) | [`src/data.py::load_local_documents()`](../../src/data.py), [`src/data.py::load_github_documents()`](../../src/data.py), [`src/data.py::chunk_documents()`](../../src/data.py), [`src/data.py::create_vectorstore()`](../../src/data.py) | [`test_rear_knowledge_contains_it245()`](../../src/test.py), [`test_carol_knowledge_contains_product()`](../../src/test.py), [`test_github_relative_links_converted_to_absolute_urls()`](../../src/test.py) |
 | [**FR-003**](#fr-003-first-person-persona) (First-Person Persona) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-001](#sc-001-validates-fr-003) | [`src/agent.py::create_ai_me_agent()`](../../src/agent.py), [`src/agent.py::run()`](../../src/agent.py) | [`test_rear_knowledge_contains_it245()`](../../src/test.py), [`test_carol_knowledge_contains_product()`](../../src/test.py), [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |
+| [**FR-004**](#fr-004-source-attribution) (Source Attribution) | [US3](#user-story-3---access-sourced-information-with-attribution-priority-p2) | [SC-002](#sc-002-validates-fr-002-fr-004) | [`src/data.py::process_documents()`](../../src/data.py), [`src/agent.py::get_local_info_tool()`](../../src/agent.py) | [`test_github_relative_links_converted_to_absolute_urls()`](../../src/test.py) |
 | [**FR-005**](#fr-005-session-history) (Session History) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-005](#sc-005-validates-nfr-001) | [`src/app.py::chat()`](../../src/app.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |
 | [**FR-006**](#fr-006-knowledge-gap-handling) (Knowledge Gap Handling) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-003](#sc-003-validates-fr-006), [SC-004](#sc-004-validates-fr-006) | [`src/agent.py::run()`](../../src/agent.py) | [`test_unknown_person_contains_negative_response()`](../../src/test.py) |
 | [**FR-007**](#fr-007-session-isolation) (Session Isolation) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-006](#sc-006-validates-nfr-002) | [`src/app.py::initialize_session()`](../../src/app.py), [`src/app.py::get_session_status()`](../../src/app.py), [`src/app.py::chat()`](../../src/app.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |
 | [**FR-008**](#fr-008-output-normalization) (Output Normalization) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-001](#sc-001-validates-fr-003) | [`src/agent.py::run()`](../../src/agent.py) | All tests (implicit in response validation) |
 | [**FR-009**](#fr-009-mandatory-tools) (Mandatory Tools) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-005](#sc-005-validates-nfr-001), [SC-007](#sc-007-validates-fr-012) | [`src/agent.py::mcp_time_params`](../../src/agent.py), [`src/agent.py::get_mcp_memory_params()`](../../src/agent.py), [`src/agent.py::setup_mcp_servers()`](../../src/agent.py) | [`test_mcp_time_server_returns_current_date()`](../../src/test.py), [`test_mcp_memory_server_remembers_favorite_color()`](../../src/test.py) |
 | [**FR-010**](#fr-010-optional-tools) (Optional Tools) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-007](#sc-007-validates-fr-012) | [`src/agent.py::mcp_github_params`](../../src/agent.py), [`src/agent.py::setup_mcp_servers()`](../../src/agent.py), [`src/data.py::load_github_documents()`](../../src/data.py) | [`test_github_commits_contains_shas()`](../../src/test.py) |
+| [**FR-011**](#fr-011-conflict-resolution--logging) (Conflict Resolution) | [US3](#user-story-3---access-sourced-information-with-attribution-priority-p2) | [SC-002](#sc-002-validates-fr-002-fr-004) | [`src/data.py::load_github_documents()`](../../src/data.py), [`src/agent.py::get_local_info_tool()`](../../src/agent.py) | [`test_github_relative_links_converted_to_absolute_urls()`](../../src/test.py) |
 | [**FR-012**](#fr-012-tool-error-handling) (Tool Error Handling) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US3](#user-story-3---access-sourced-information-with-attribution-priority-p2) | [SC-007](#sc-007-validates-fr-012) | [`src/agent.py::setup_mcp_servers()`](../../src/agent.py), [`src/agent.py::run()`](../../src/agent.py), [`src/agent.py::cleanup()`](../../src/agent.py) | [`test_tool_failure_error_messages_are_friendly()`](../../src/test.py) |
 | [**FR-013**](#fr-013-memory-tool) (Memory Tool) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-008](#sc-008-validates-fr-013) | [`src/agent.py::get_mcp_memory_params()`](../../src/agent.py) | [`test_mcp_memory_server_remembers_favorite_color()`](../../src/test.py) |
 |---|---|---|---|---|
 | [**NFR-001**](#nfr-001-sub-5s-response) (Sub-5s Response) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-005](#sc-005-validates-nfr-001) | [`src/agent.py::run()`](../../src/agent.py), [`src/data.py::create_vectorstore()`](../../src/data.py) | [`test_mcp_time_server_returns_current_date()`](../../src/test.py) |
 | [**NFR-002**](#nfr-002-10-concurrent-sessions) (10+ Concurrent Sessions) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-006](#sc-006-validates-nfr-002) | [`src/app.py::initialize_session()`](../../src/app.py), [`src/app.py::chat()`](../../src/app.py), [`src/agent.py::AIMeAgent`](../../src/agent.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py), [`test_mcp_memory_server_remembers_favorite_color()`](../../src/test.py) |
+| [**NFR-003**](#nfr-003-structured-logging) (Structured Logging) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1), [US3](#user-story-3---access-sourced-information-with-attribution-priority-p2) | [SC-007](#sc-007-validates-fr-012) | [`src/config.py::setup_logger()`](../../src/config.py), [`src/agent.py::run()`](../../src/agent.py), [`src/app.py::chat()`](../../src/app.py) | [`test_github_relative_links_converted_to_absolute_urls()`](../../src/test.py), [`test_tool_failure_error_messages_are_friendly()`](../../src/test.py) |
 | [**NFR-004**](#nfr-004-unicode-normalization) (Unicode Normalization) | [US1](#user-story-1---chat-with-personified-agent-about-expertise-priority-p1) | [SC-001](#sc-001-validates-fr-003) | [`src/agent.py::run()`](../../src/agent.py) | All tests (implicit in response validation) |
 | [**NFR-005**](#nfr-005-session-isolation) (Session Isolation) | [US2](#user-story-2---interact-across-multiple-conversation-topics-priority-p2) | [SC-006](#sc-006-validates-nfr-002) | [`src/app.py::initialize_session()`](../../src/app.py), [`src/agent.py::cleanup()`](../../src/agent.py) | [`test_user_story_2_multi_topic_consistency()`](../../src/test.py) |

specs/002-linkedin-profile-extractor/INTEGRATION_GUIDE.md CHANGED Viewed

@@ -301,9 +301,10 @@ Only **publicly visible** LinkedIn data:
 **Solution**:
 ```bash
 # Test data loading
-python -c "from src.data import DataManager;
-dm = DataManager();
-docs = dm.process_documents();
 print(f'Loaded {len(docs)} documents')"
 ```

 **Solution**:
 ```bash
 # Test data loading
+python -c "from src.data import DataManager, DataManagerConfig;
+config = DataManagerConfig();
+dm = DataManager(config=config);
+docs = dm.load_local_documents();
 print(f'Loaded {len(docs)} documents')"
 ```

src/agent.py CHANGED Viewed

@@ -3,16 +3,19 @@ Agent configuration and MCP server setup.
 Handles agent-specific configuration like MCP servers and prompts.
 """
 import json
 import traceback
-from typing import List, Dict, Any, Optional
 from pydantic import BaseModel, Field, computed_field, ConfigDict, SecretStr
 from agents import Agent, Tool, function_tool, Runner
 from agents.result import RunResult
 from agents.run import RunConfig
-from agents.mcp import MCPServerStdio
 from config import setup_logger
 logger = setup_logger(__name__)
 # Unicode normalization translation table - built once, reused for all responses
@@ -51,6 +54,173 @@ class AIMeAgent(BaseModel):
     model_config = ConfigDict(arbitrary_types_allowed=True)
     @computed_field
     @property
     def mcp_github_params(self) -> MCPServerParams:
@@ -65,8 +235,6 @@ class AIMeAgent(BaseModel):
         The official version supports --toolsets and --read-only flags.
         We use read-only mode with a limited toolset for safety.
         """
-        import os
         # Use local binary for testing, production path in Docker
         test_binary = "/tmp/test-github-mcp/github-mcp-server"
         prod_binary = "/app/bin/github-mcp-server"
@@ -118,9 +286,6 @@ class AIMeAgent(BaseModel):
         Returns:
             MCPServerParams configured with session-specific memory file
         """
-        import tempfile
-        import os
         # Create session-specific memory file in temp directory
         temp_dir = tempfile.gettempdir()
         memory_file = os.path.join(temp_dir, f"mcp_memory_{session_id}.json")
@@ -132,117 +297,52 @@ class AIMeAgent(BaseModel):
             description="Memory MCP Server"
         )
-    @computed_field
     @property
     def agent_prompt(self) -> str:
-        """Generate agent prompt template."""
         return f"""
 You are acting as somebody who is personifying {self.bot_full_name}.
 Your primary role is to help users by answering questions about my knowledge,
-experience, and expertise in technology. When interacting with the user follow
-these rules:
-- always refer to yourself as {self.bot_full_name} or "I".
-- When talking about a prior current or prior employer indicate the relationship
-  clearly. For example: Neosofia (my current employer) or Medidata (a prior
-  employer).
-- You should be personable, friendly, and professional in your responses.
-- You should note information about the user in your memory to improve future
-  interactions.
-- You should use the tools available to you to look up information as needed.
-- If the user asks a question ALWAYS USE THE get_local_info tool ONCE to gather
-  info from my documentation (this is RAG-based)
-- Format file references as complete GitHub URLs with owner, repo, path, and
-  filename
-  - Example: https://github.com/owner/repo/blob/main/filename.md
-  - Never use shorthand like: filename.md†L44-L53 or source†L44-L53
-  - Always strip out line number references
-- CRITICAL: Include source citations in your response to establish credibility
-  and traceability. Format citations as:
-  - For GitHub sources: "Per my [document_name]..." or "As mentioned in [document_name]..."
-  - For local sources: "According to my documentation on [topic]..."
-  - Include the source URL in parentheses when available
-  - Example: "Per my resume (https://github.com/byoung/ai-me/blob/main/resume.md), I worked at..."
-- Add reference links in a references section at the end of the output if they
-  match github.com
-- Below are critical instructions for using your memory and GitHub tools
-  effectively.
-MEMORY USAGE - MANDATORY WORKFLOW FOR EVERY USER MESSAGE:
-1. FIRST ACTION - Read Current Memory:
-   - Call read_graph() to see ALL existing entities and their observations
-   - This prevents errors when adding observations to entities
-2. User Identification:
-   - Assume you are interacting with a user entity (e.g., "user_john" if they
-     say "I'm John")
-   - If the user entity doesn't exist in the graph yet, you MUST create it first
-3. Gather New Information:
-   - Pay attention to new information about the user:
-     a) Basic Identity (name, age, gender, location, job title, education, etc.)
-     b) Behaviors (interests, habits, activities, etc.)
-     c) Preferences (communication style, preferred language, topics of
-        interest, etc.)
-     d) Goals (aspirations, targets, objectives, etc.)
-     e) Relationships (personal and professional connections)
-4. Update Memory - CRITICAL ORDER:
-   - STEP 1: Create missing entities using create_entities() for any new
-     people, organizations, or events
-   - STEP 2: ONLY AFTER entities exist, add facts using add_observations() to
-     existing entities
-   - STEP 3: Connect related entities using create_relations()
-EXAMPLE - User says "Hi, I'm Alice":
-✓ Correct order:
-  1. read_graph() - check if user_alice exists
-  2. create_entities(entities=[{{"name": "user_alice", "entityType": "person",
-     "observations": ["Name is Alice"]}}])
-  3. respond to user
-✗ WRONG - will cause errors:
-  1. add_observations(entityName="user_alice",
-     observations=["Name is Alice"]) - ERROR: entity not found!
-ALWAYS create entities BEFORE adding observations to them.
-GITHUB TOOLS RESTRICTIONS - IMPORTANT:
-DO NOT USE ANY GITHUB TOOL MORE THAN THREE TIMES PER SESSION.
-You have access to these GitHub tools ONLY:
-- search_code: to look for code snippets and references supporting your
-  answers
-- get_file_contents: for getting source code (NEVER download .md markdown
-  files)
-- list_commits: for getting commit history for a specific user
-CRITICAL RULES FOR search_code TOOL:
-The search_code tool searches ALL of GitHub by default. You MUST add
-owner/repo filters to EVERY search_code query.
-REQUIRED FORMAT: Always include one of these filters in the query parameter:
-- user:byoung (to search byoung's repos)
-- org:Neosofia (to search Neosofia's repos)
-- repo:byoung/ai-me (specific repo)
-- repo:Neosofia/corporate (specific repo)
-EXAMPLES OF CORRECT search_code USAGE:
-- search_code(query="python user:byoung")
-- search_code(query="docker org:Neosofia")
-- search_code(query="ReaR repo:Neosofia/corporate")
-EXAMPLES OF INCORRECT search_code USAGE (NEVER DO THIS):
-- search_code(query="python")
-- search_code(query="ReaR")
-- search_code(query="bash script")
-CRITICAL RULES FOR get_file_contents TOOL:
-The get_file_contents tool accepts ONLY these parameters: owner, repo, path
-DO NOT use 'ref' parameter - it will cause errors. The tool always reads from
-the main/default branch.
-EXAMPLES OF CORRECT get_file_contents USAGE:
-- get_file_contents(owner="Neosofia", repo="corporate",
-  path="website/qms/policies.md")
-- get_file_contents(owner="byoung", repo="ai-me", path="README.md")
-EXAMPLES OF INCORRECT get_file_contents USAGE (NEVER DO THIS):
-- get_file_contents(owner="Neosofia", repo="corporate",
-  path="website/qms/policies.md", ref="main")
-- get_file_contents(owner="byoung", repo="ai-me", path="README.md",
-  ref="master")
 """
     async def setup_mcp_servers(self, mcp_params_list: List[MCPServerParams]):
-        """Initialize and connect all MCP servers from provided parameters list.
-        Implements FR-009 (Mandatory Tools), FR-010 (Optional Tools), FR-012 (Tool Error Handling).
         """
         mcp_servers_local = []
@@ -254,12 +354,14 @@ EXAMPLES OF INCORRECT get_file_contents USAGE (NEVER DO THIS):
                 logger.debug(f"Args: {params.args}")
                 logger.debug(f"Env vars: {list(params.env.keys()) if params.env else 'None'}")
-                server = MCPServerStdio(params.model_dump(), client_session_timeout_seconds=30)
                 await server.connect()
                 logger.info(f"✓ {server_name} connected successfully")
                 mcp_servers_local.append(server)
-            except Exception as e:
                 logger.error(f"✗ {server_name} failed to connect")
                 logger.error(f"  Error type: {type(e).__name__}")
                 logger.error(f"  Error message: {e}")
@@ -318,59 +420,111 @@ EXAMPLES OF INCORRECT get_file_contents USAGE (NEVER DO THIS):
     async def create_ai_me_agent(
         self,
-        agent_prompt: str = None,
         mcp_params: Optional[List[MCPServerParams]] = None,
-        additional_tools: Optional[List[Tool]] = None,
     ) -> Agent:
-        """Create the main ai-me agent.
-        Implements FR-001 (Chat Interface), FR-003 (First-Person Persona), FR-009 (Mandatory Tools),
-        FR-010 (Optional Tools).
         Args:
-            agent_prompt: Optional prompt override. If None, uses self.agent_prompt.
-            mcp_params: Optional list of MCP server parameters to initialize.
-                If None or empty, no MCP servers will be initialized. To use memory
-                functionality, caller must explicitly pass mcp_params including
-                get_mcp_memory_params(session_id) with a unique session_id.
-            additional_tools: Optional list of additional tools to append to
-                the default get_local_info tool. The get_local_info tool is
-                always included as the first tool.
         Returns:
             An initialized Agent instance.
         """
         # Setup MCP servers if any params provided
-        mcp_servers = await self.setup_mcp_servers(mcp_params) if mcp_params else None
         # Store MCP servers for cleanup
-        if mcp_servers:
-            self._mcp_servers = mcp_servers
-        # Use provided prompt or fall back to default
-        prompt = agent_prompt if agent_prompt is not None else self.agent_prompt
         logger.debug(f"Creating ai-me agent with prompt: {prompt[:100]}...")
         # Build tools list - get_local_info is always the default first tool
         tools = [self.get_local_info_tool()]
-        # Append any additional tools provided
-        if additional_tools:
-            tools.extend(additional_tools)
         logger.info(f"Creating ai-me agent with tools: {[tool.name for tool in tools]}")
-        # Pass MCP servers directly to main agent instead of wrapping in sub-agent
         agent_kwargs = {
             "model": self.model,
             "name": "ai-me",
             "instructions": prompt,
             "tools": tools,
         }
-        # Only add mcp_servers if we have them
         if mcp_servers:
             agent_kwargs["mcp_servers"] = mcp_servers
         ai_me = Agent(**agent_kwargs)
         # Print all available tools after agent initialization
@@ -394,8 +548,10 @@ EXAMPLES OF INCORRECT get_file_contents USAGE (NEVER DO THIS):
     async def run(self, user_input: str, **runner_kwargs) -> str:
         """Run the agent and post-process output to remove Unicode brackets.
-        Implements FR-001 (Chat Interface), FR-003 (First-Person Persona), FR-008 (Output Normalization),
-        FR-012 (Tool Error Handling), NFR-001 (Sub-5s Response), NFR-003 (Structured Logging), NFR-004 (Unicode Normalization).
         """
         # Log user input with session context
         session_prefix = f"[Session: {self.session_id[:8]}...] " if self.session_id else ""
@@ -411,8 +567,8 @@ EXAMPLES OF INCORRECT get_file_contents USAGE (NEVER DO THIS):
                                                  **runner_kwargs)
         except Exception as e:
             error_str = str(e).lower()
-            if "rate limit" in error_str or "api rate limit exceeded" in error_str:
                 logger.warning(f"{session_prefix}GitHub rate limit exceeded")
                 return "⚠️ GitHub rate limit exceeded. Try asking me again in 30 seconds"
             else:
@@ -438,16 +594,13 @@ EXAMPLES OF INCORRECT get_file_contents USAGE (NEVER DO THIS):
         Implements FR-012 (Tool Error Handling), NFR-005 (Session Isolation).
         """
-        if not self._mcp_servers:
-            return
         session_prefix = f"[Session: {self.session_id[:8]}...] " if self.session_id else ""
         logger.debug(f"{session_prefix}Cleaning up {len(self._mcp_servers)} MCP servers...")
         for server in self._mcp_servers:
             try:
                 await server.cleanup()
-            except Exception as e:
                 # Log but don't fail - best effort cleanup
                 logger.debug(f"{session_prefix}Error cleaning up MCP server: {e}")

 Handles agent-specific configuration like MCP servers and prompts.
 """
 import json
+import os
+import tempfile
 import traceback
+from typing import List, Dict, Any, Optional, ClassVar
 from pydantic import BaseModel, Field, computed_field, ConfigDict, SecretStr
 from agents import Agent, Tool, function_tool, Runner
 from agents.result import RunResult
 from agents.run import RunConfig
+from agents.mcp import MCPServerStdio, MCPServerStdioParams
 from config import setup_logger
 logger = setup_logger(__name__)
 # Unicode normalization translation table - built once, reused for all responses
     model_config = ConfigDict(arbitrary_types_allowed=True)
+    # Static prompt sections - these don't need instance data
+    MEMORY_AGENT_PROMPT: ClassVar[str] = """
+🚨 MEMORY MANAGEMENT - THIS SECTION MUST BE FOLLOWED EXACTLY 🚨
+YOU MUST use the read_graph tool at the START of EVERY user interaction.
+The read_graph tool is DIFFERENT from all other tools - it takes NO input parameters.
+CRITICAL SYNTAX FOR read_graph:
+- read_graph is called with ZERO arguments
+- NO curly braces: read_graph
+- NO parentheses with content: read_graph() ← This is the ONLY correct form
+- NO empty object: read_graph({}) ← WRONG - will cause 400 error
+- NO empty string key: read_graph({"": {}}) ← WRONG - will cause 400 error
+- NO parameters at all: read_graph ← Correct but less clear
+- The correct way: read_graph() with empty parentheses but NO content inside
+When calling read_graph:
+✅ CORRECT: read_graph() with nothing inside the parentheses
+❌ WRONG: read_graph({}), read_graph({"":""}), read_graph(params={}), read_graph(data=None)
+WORKFLOW FOR EVERY MESSAGE:
+1. Call read_graph() immediately - retrieve all stored information
+2. Check if "user" entity exists in the returned knowledge graph
+3. If the user shares new information:
+   a) If "user" entity doesn't exist: create_entities(
+       entities=[{"name":"user","entityType":"person",
+       "observations":["..."]}])
+   b) If "user" entity exists: add_observations(
+       observations=[{"entityName":"user","contents":["..."]}])
+4. If user asks about stored info: search read_graph results and respond
+TOOLS REFERENCE:
+- read_graph() ← Takes ZERO parameters, returns all stored data
+- create_entities(entities=[...]) ← Takes entities array
+- add_observations(observations=[...]) ← Takes observations array
+- create_relations(relations=[...]) ← Takes relations array
+EXAMPLES:
+User says "My favorite color is blue":
+1. read_graph() ← Call with empty parentheses
+2. See if "user" entity exists
+3. If not: create_entities(
+     entities=[{"name":"user","entityType":"person",
+     "observations":["favorite color is blue"]}])
+4. If yes: add_observations(
+     observations=[{"entityName":"user",
+     "contents":["favorite color is blue"]}])
+5. Reply: "Got it, I'll remember that your favorite color is blue."
+User asks "What's my favorite color":
+1. read_graph() ← Call with empty parentheses FIRST
+2. Find "user" entity in returned graph
+3. Look for observation about color
+4. Reply with the stored information
+MEMORY ENTITY STRUCTURE:
+- Entity name: "user" (the user you're talking to)
+- Entity type: "person"
+- Observations: Array of facts about them (["likes red", "from NYC", "engineer"])
+"""
+    GITHUB_RESEARCHER_PROMPT: ClassVar[str] = """
+You are the GitHub Researcher, responsible for researching the Bot's professional
+portfolio on GitHub.
+Your responsibilities:
+- Search for code, projects, and commits on GitHub
+- Retrieve file contents from repositories
+- Provide context about technical work and contributions
+GITHUB TOOLS RESTRICTIONS - IMPORTANT:
+DO NOT USE ANY GITHUB TOOL MORE THAN THREE TIMES PER REQUEST.
+You have access to these GitHub tools ONLY:
+- search_code: to look for code snippets and references supporting your
+  answers
+- get_file_contents: for getting source code (NEVER download .md markdown
+  files)
+- list_commits: for getting commit history for a specific user
+CRITICAL RULES FOR search_code TOOL:
+The search_code tool searches ALL of GitHub by default. You MUST add
+owner/repo filters to EVERY search_code query.
+REQUIRED FORMAT: Always include one of these filters in the query parameter:
+- user:byoung (to search byoung's repos)
+- org:Neosofia (to search Neosofia's repos)
+- repo:byoung/ai-me (specific repo)
+- repo:Neosofia/corporate (specific repo)
+EXAMPLES OF CORRECT search_code USAGE:
+- search_code(query="python user:byoung")
+- search_code(query="docker org:Neosofia")
+- search_code(query="ReaR repo:Neosofia/corporate")
+EXAMPLES OF INCORRECT search_code USAGE (NEVER DO THIS):
+- search_code(query="python")
+- search_code(query="ReaR")
+- search_code(query="bash script")
+CRITICAL RULES FOR get_file_contents TOOL:
+The get_file_contents tool accepts ONLY these parameters: owner, repo, path
+DO NOT use 'ref' parameter - it will cause errors. The tool always reads from
+the main/default branch.
+EXAMPLES OF CORRECT get_file_contents USAGE:
+- get_file_contents(owner="Neosofia", repo="corporate",
+  path="website/qms/policies.md")
+- get_file_contents(owner="byoung", repo="ai-me", path="README.md")
+EXAMPLES OF INCORRECT get_file_contents USAGE (NEVER DO THIS):
+- get_file_contents(owner="Neosofia", repo="corporate",
+  path="website/qms/policies.md", ref="main")
+- get_file_contents(owner="byoung", repo="ai-me", path="README.md",
+  ref="master")
+"""
+    KB_RESEARCHER_PROMPT: ClassVar[str] = """
+KNOWLEDGE BASE RESEARCH - MANDATORY TOOL USAGE:
+You MUST use get_local_info tool to answer ANY questions about my background,
+experience, skills, education, projects, or expertise.
+🚨 CRITICAL RULES:
+1. When user asks about your background, skills, languages, experience → ALWAYS use get_local_info
+2. When you don't know something → use get_local_info before saying "I don't know"
+3. When user asks personal/professional questions → ALWAYS search knowledge base first
+4. Never say "I'm not familiar with that" without first trying get_local_info
+MANDATORY WORKFLOW:
+1. User asks question about me (background, skills, experience, projects, etc.)
+2. IMMEDIATELY call: get_local_info(query="[user's question]")
+3. Review ALL returned documents carefully
+4. Formulate first-person response from the documents
+5. Include source references (file paths or document titles)
+TOOL USAGE:
+- get_local_info(query="Python programming languages skills") →
+  retrieves all documents about my skills
+- get_local_info(query="background experience") → retrieves background info
+- get_local_info(query="projects I've worked on") → retrieves project info
+EXAMPLES:
+User asks: "What programming languages are you skilled in?"
+1. Call: get_local_info(query="programming languages skills")
+2. Search returned docs for language list
+3. Respond: "I'm skilled in Python, Go, TypeScript, Rust, and SQL. I specialize in..."
+4. Include source like: "(from team documentation)"
+User asks: "What is your background in technology?"
+1. Call: get_local_info(query="background experience technology")
+2. Find relevant background information
+3. Respond in first-person: "I specialize in backend systems and..."
+4. Cite sources
+CRITICAL - DO NOT:
+❌ Say "I'm not familiar" without trying get_local_info first
+❌ Refuse to answer without searching the knowledge base
+❌ Make up information if get_local_info returns no results
+Response Format:
+- ALWAYS first-person (I, my, me)
+- ALWAYS include source attribution
+- ALWAYS use information from get_local_info results
+- Format sources like: "(from team.md)" or "(from professional documentation)"
+"""
     @computed_field
     @property
     def mcp_github_params(self) -> MCPServerParams:
         The official version supports --toolsets and --read-only flags.
         We use read-only mode with a limited toolset for safety.
         """
         # Use local binary for testing, production path in Docker
         test_binary = "/tmp/test-github-mcp/github-mcp-server"
         prod_binary = "/app/bin/github-mcp-server"
         Returns:
             MCPServerParams configured with session-specific memory file
         """
         # Create session-specific memory file in temp directory
         temp_dir = tempfile.gettempdir()
         memory_file = os.path.join(temp_dir, f"mcp_memory_{session_id}.json")
             description="Memory MCP Server"
         )
     @property
     def agent_prompt(self) -> str:
+        """Generate main agent prompt."""
         return f"""
 You are acting as somebody who is personifying {self.bot_full_name}.
 Your primary role is to help users by answering questions about my knowledge,
+experience, and expertise in technology.
+CRITICAL: You are NOT an all-knowing AI. You are personifying ME, {self.bot_full_name},
+a specific person. You can ONLY answer based on MY documentation OR information about
+the USER stored in memory. Do NOT use your general LLM training data to answer questions.
+=== CRITICAL WORKFLOW FOR EVERY USER MESSAGE ===
+1. **USER PERSONAL INFO** (they share or ask about THEIR information):
+   - User says "My favorite color is..." → Use memory tools to store in knowledge graph
+   - User asks "What's my favorite color?" → Use memory tools to retrieve from knowledge graph
+   - Call read_graph() immediately, then create_entities/add_observations for new info
+2. **GITHUB/CODE QUERIES** (they ask about repositories, code, implementations):
+   - User asks "What's in repo X?" → Use GitHub search_code or get_file_contents tools
+   - User asks "Show me file Y" → Use get_file_contents to fetch content
+   - Use available GitHub tools to search and retrieve
+3. **YOUR BACKGROUND/KNOWLEDGE** (they ask about you, {self.bot_full_name}):
+   - User asks "What's your experience?" → Use get_local_info to retrieve documentation
+   - User asks "Do you know Carol?" → Use get_local_info to search knowledge base
+   - ALWAYS use get_local_info FIRST before saying you don't know something
+=== RESPONSE GUIDELINES ===
+When formulating responses:
+- Always refer to yourself as {self.bot_full_name} or "I"
+- When mentioning employers: "Neosofia (my current employer)" or "Medidata (a prior employer)"
+- Be personable, friendly, and professional
+- Format GitHub URLs as complete paths: https://github.com/owner/repo/blob/main/path/file.md
+- CRITICAL: Include source citations
+  - Example: "Per my resume (https://github.com/byoung/ai-me/blob/main/resume.md), I worked at..."
+- Add reference links section at end if GitHub sources referenced
 """
     async def setup_mcp_servers(self, mcp_params_list: List[MCPServerParams]):
+        """Initialize and connect all MCP servers from provided parameters.
+        Implements FR-009 (Mandatory Tools), FR-010 (Optional Tools),
+        FR-012 (Tool Error Handling).
         """
         mcp_servers_local = []
                 logger.debug(f"Args: {params.args}")
                 logger.debug(f"Env vars: {list(params.env.keys()) if params.env else 'None'}")
+                # Construct a strongly-typed MCPServerStdioParams from the Pydantic model dict
+                server_params = MCPServerStdioParams(**params.model_dump())
+                server = MCPServerStdio(server_params, client_session_timeout_seconds=30)
                 await server.connect()
                 logger.info(f"✓ {server_name} connected successfully")
                 mcp_servers_local.append(server)
+            except Exception as e:  # pragma: no cover
                 logger.error(f"✗ {server_name} failed to connect")
                 logger.error(f"  Error type: {type(e).__name__}")
                 logger.error(f"  Error message: {e}")
     async def create_ai_me_agent(
         self,
         mcp_params: Optional[List[MCPServerParams]] = None,
     ) -> Agent:
+        """Create the main ai-me agent with organized instruction sections.
+        Implements FR-001 (Chat Interface), FR-003 (First-Person Persona),
+        FR-009 (Mandatory Tools), FR-010 (Optional Tools).
+        The agent prompt is organized into sections providing specialized
+        instructions for different capabilities:
+        - Main persona and response guidelines
+        - Memory management (always included)
+        - GitHub research (always included)
+        - Knowledge base research (always included via get_local_info)
+        - Time utilities (always included)
         Args:
+            mcp_params: List of MCP server parameters to initialize.
+                Should include GitHub, Time, and Memory MCP servers.
+                Caller must pass get_mcp_memory_params(session_id) with a
+                unique session_id for proper session isolation.
         Returns:
             An initialized Agent instance.
         """
         # Setup MCP servers if any params provided
+        mcp_servers = await self.setup_mcp_servers(mcp_params) if mcp_params else []
         # Store MCP servers for cleanup
+        self._mcp_servers = mcp_servers
+        # Build comprehensive prompt from sections
+        # Start with main agent prompt
+        prompt_sections = [self.agent_prompt]
+        # Add KB Researcher instructions (always available)
+        prompt_sections.append("\n## Knowledge Base Research")
+        prompt_sections.append(self.KB_RESEARCHER_PROMPT)
+        # Add Time utility note (time server is always included)
+        prompt_sections.append("\n## Time Information")
+        prompt_sections.append(
+            "You have access to time tools for getting current "
+            "date/time information."
+        )
+        prompt = "\n".join(prompt_sections)
         logger.debug(f"Creating ai-me agent with prompt: {prompt[:100]}...")
         # Build tools list - get_local_info is always the default first tool
         tools = [self.get_local_info_tool()]
         logger.info(f"Creating ai-me agent with tools: {[tool.name for tool in tools]}")
+        # Separate GitHub and memory servers for sub-agent creation
+        github_mcp_servers = [s for s in mcp_servers if "github-mcp-server" in str(s)]
+        memory_mcp_servers = [s for s in mcp_servers if "server-memory" in str(s)]
+        time_mcp_servers = [s for s in mcp_servers if "mcp-server-time" in str(s)]
+        # Create GitHub sub-agent (always included)
+        github_agent = Agent(
+            name="github_agent",
+            handoff_description=(
+                "Handles GitHub research and code exploration"
+            ),
+            instructions=self.GITHUB_RESEARCHER_PROMPT,
+            tools=[],
+            mcp_servers=github_mcp_servers,
+            model=self.model,
+        )
+        logger.info(
+            f"✓ GitHub sub-agent created with "
+            f"{len(github_mcp_servers)} MCP server(s)"
+        )
+        # Create Memory sub-agent (always included)
+        memory_agent = Agent(
+            name="memory_agent",
+            handoff_description="Handles memory management and knowledge graph operations",
+            instructions=self.MEMORY_AGENT_PROMPT,
+            tools=[],
+            mcp_servers=memory_mcp_servers,
+            model=self.model,
+        )
+        logger.info(
+            f"✓ Memory sub-agent created with "
+            f"{len(memory_mcp_servers)} MCP server(s)"
+        )
+        # Create main agent with ALL MCP servers for direct execution
+        # Sub-agents have specialized prompts but access same tools for reliability
         agent_kwargs = {
             "model": self.model,
             "name": "ai-me",
             "instructions": prompt,
             "tools": tools,
         }
         if mcp_servers:
             agent_kwargs["mcp_servers"] = mcp_servers
+            logger.info(f"✓ {len(mcp_servers)} MCP servers added to main agent")
+        # Add both sub-agents as handoffs (always included)
+        agent_kwargs["handoffs"] = [github_agent, memory_agent]
         ai_me = Agent(**agent_kwargs)
         # Print all available tools after agent initialization
     async def run(self, user_input: str, **runner_kwargs) -> str:
         """Run the agent and post-process output to remove Unicode brackets.
+        Implements FR-001 (Chat Interface), FR-003 (First-Person Persona),
+        FR-008 (Output Normalization), FR-012 (Tool Error Handling),
+        NFR-001 (Sub-5s Response), NFR-003 (Structured Logging),
+        NFR-004 (Unicode Normalization).
         """
         # Log user input with session context
         session_prefix = f"[Session: {self.session_id[:8]}...] " if self.session_id else ""
                                                  **runner_kwargs)
         except Exception as e:
             error_str = str(e).lower()
+            if "rate limit" in error_str or "api rate limit exceeded" in error_str:  # pragma: no cover
                 logger.warning(f"{session_prefix}GitHub rate limit exceeded")
                 return "⚠️ GitHub rate limit exceeded. Try asking me again in 30 seconds"
             else:
         Implements FR-012 (Tool Error Handling), NFR-005 (Session Isolation).
         """
         session_prefix = f"[Session: {self.session_id[:8]}...] " if self.session_id else ""
         logger.debug(f"{session_prefix}Cleaning up {len(self._mcp_servers)} MCP servers...")
         for server in self._mcp_servers:
             try:
                 await server.cleanup()
+            except Exception as e:  # pragma: no cover
                 # Log but don't fail - best effort cleanup
                 logger.debug(f"{session_prefix}Error cleaning up MCP server: {e}")

src/app.py CHANGED Viewed

@@ -1,19 +1,18 @@
 from config import Config, setup_logger
 from agent import AIMeAgent
 from data import DataManager, DataManagerConfig
-import gradio
-from gradio import Request
 logger = setup_logger(__name__)
-config = Config()
 # Initialize data manager and vectorstore
-data_config = DataManagerConfig(
-    github_repos=config.github_repos
-)
 data_manager = DataManager(config=data_config)
-vectorstore = data_manager.setup_vectorstore()
 # Per-session agent storage (keyed by Gradio session_hash)
 # Each session gets its own AIMeAgent instance with session-specific MCP servers
@@ -39,8 +38,7 @@ async def initialize_session(session_id: str) -> None:
         session_id=session_id  # Pass session_id for logging context
     )
-    # TBD: make this prompt more generic by removing byoung/Neosofia specific
-    # references. The instructions are verbose because search_code tool is complex.
     await session_agent.create_ai_me_agent(
         mcp_params=[
             session_agent.mcp_github_params,
@@ -55,7 +53,8 @@ async def initialize_session(session_id: str) -> None:
     # Warmup: establish context and preload tools
     try:
         logger.info(f"[Session: {session_id[:8]}...] Running warmup...")
-        await session_agent.run("Please introduce yourself briefly - who you are and what your main expertise is.")
         logger.info(f"[Session: {session_id[:8]}...] Warmup complete!")
     except Exception as e:
         logger.info(f"[Session: {session_id[:8]}...] Warmup failed: {e}")
@@ -67,6 +66,7 @@ async def get_session_status(request: Request):
     Implements FR-001 (Chat Interface), FR-007 (Session Isolation).
     """
     session_id = request.session_hash
     if session_id not in session_agents:
         await initialize_session(session_id)
     return ""
@@ -78,6 +78,7 @@ async def chat(user_input: str, history, request: Request):
     Implements FR-001 (Chat Interface), FR-005 (Session History), FR-007 (Session Isolation).
     """
     session_id = request.session_hash
     # Initialize agent for this session if not already done
     if session_id not in session_agents:
@@ -97,7 +98,7 @@ if __name__ == "__main__":
         custom_js = f.read()
     with gradio.Blocks(
-        theme=gradio.themes.Default(),
         css=custom_css,
         fill_height=True,
         js=f"() => {{ {custom_js} }}"

+import gradio
+from gradio import Request, themes
 from config import Config, setup_logger
 from agent import AIMeAgent
 from data import DataManager, DataManagerConfig
 logger = setup_logger(__name__)
+config = Config() # type: ignore
 # Initialize data manager and vectorstore
+data_config = DataManagerConfig()
 data_manager = DataManager(config=data_config)
+vectorstore = data_manager.setup_vectorstore(github_repos=config.github_repos)  # type: ignore
 # Per-session agent storage (keyed by Gradio session_hash)
 # Each session gets its own AIMeAgent instance with session-specific MCP servers
         session_id=session_id  # Pass session_id for logging context
     )
+    # Initialize agent with MCP servers for GitHub, Time, and Memory tools
     await session_agent.create_ai_me_agent(
         mcp_params=[
             session_agent.mcp_github_params,
     # Warmup: establish context and preload tools
     try:
         logger.info(f"[Session: {session_id[:8]}...] Running warmup...")
+        # Use a greeting that doesn't require tool calls or RAG retrieval
+        await session_agent.run("Hello!")
         logger.info(f"[Session: {session_id[:8]}...] Warmup complete!")
     except Exception as e:
         logger.info(f"[Session: {session_id[:8]}...] Warmup failed: {e}")
     Implements FR-001 (Chat Interface), FR-007 (Session Isolation).
     """
     session_id = request.session_hash
+    assert session_id is not None, "session_hash should always be set by Gradio"
     if session_id not in session_agents:
         await initialize_session(session_id)
     return ""
     Implements FR-001 (Chat Interface), FR-005 (Session History), FR-007 (Session Isolation).
     """
     session_id = request.session_hash
+    assert session_id is not None, "session_hash should always be set by Gradio"
     # Initialize agent for this session if not already done
     if session_id not in session_agents:
         custom_js = f.read()
     with gradio.Blocks(
+        theme=themes.Default(),
         css=custom_css,
         fill_height=True,
         js=f"() => {{ {custom_js} }}"

src/config.py CHANGED Viewed

@@ -2,12 +2,12 @@
 Configuration management for ai-me application.
 Centralizes environment variables, API clients, and application defaults.
 """
-import os
 import logging
 import socket
-from typing import Optional, List, Union
 from logging.handlers import QueueHandler, QueueListener
 from queue import Queue
 from pydantic import Field, field_validator, SecretStr
 from pydantic_settings import BaseSettings, SettingsConfigDict
@@ -77,7 +77,7 @@ def setup_logger(name: str) -> logging.Logger:
         loki_username = os.getenv('LOKI_USERNAME')
         loki_password = os.getenv('LOKI_PASSWORD')
-        if loki_url and loki_username and loki_password:
             try:
                 # Create async queue for non-blocking logging
                 log_queue = Queue(maxsize=1000)  # Buffer up to 1000 log messages
@@ -109,18 +109,8 @@ def setup_logger(name: str) -> logging.Logger:
                 root_logger.addHandler(queue_handler)
                 root_logger.info(f"Grafana Loki logging enabled: {loki_url} (tags: {loki_tags})")
-            except Exception as e:
                 root_logger.warning(f"Failed to setup Grafana Loki logging: {e}")
-        else:
-            missing = []
-            if not loki_url:
-                missing.append("LOKI_URL")
-            if not loki_username:
-                missing.append("LOKI_USERNAME")
-            if not loki_password:
-                missing.append("LOKI_PASSWORD")
-            if missing:
-                root_logger.info(f"Loki logging disabled (missing: {', '.join(missing)})")
         root_logger.setLevel(log_level)
@@ -134,6 +124,9 @@ class Config(BaseSettings):
     """Central configuration class for ai-me application with Pydantic validation."""
     # Environment Variables (from .env) - Required
     openai_api_key: SecretStr = Field(...,
         description="OpenAI API key for tracing")
     groq_api_key: SecretStr = Field(...,
@@ -155,6 +148,9 @@ class Config(BaseSettings):
     temperature: float = Field(
         default=1.0,
         description="LLM temperature for sampling (0.0-2.0, default 1.0)")
     github_repos: Union[str, List[str]] = Field(
         default="",
         description="GitHub repos to load (format: owner/repo), comma-separated in .env")
@@ -195,10 +191,14 @@ class Config(BaseSettings):
         os.environ["TOKENIZERS_PARALLELISM"] = "false"
         # Initialize Groq client for LLM operations
         self.openai_client = AsyncOpenAI(
             base_url="https://api.groq.com/openai/v1",
             api_key=self.groq_api_key.get_secret_value(),
-            default_query={"temperature": self.temperature}
         )
         set_default_openai_client(self.openai_client)
@@ -206,7 +206,7 @@ class Config(BaseSettings):
         logger.info("Setting tracing export API key for agents.")
         set_tracing_export_api_key(self.openai_api_key.get_secret_value())
-    def _safe_repr(self) -> str:
         """Helper to generate string representation excluding sensitive fields."""
         lines = ["Config:"]
         for field_name in type(self).model_fields:
@@ -216,14 +216,14 @@ class Config(BaseSettings):
             lines.append(f"  {field_name}: {display}")
         return "\n".join(lines)
-    def __repr__(self) -> str:
         """Return string representation of Config with secrets hidden.
         DEBUG: Debug utility for logging/debugging configuration state.
         """
         return self._safe_repr()
-    def __str__(self) -> str:
         """Return human-readable string representation of Config with secrets hidden.
         DEBUG: Debug utility for logging/debugging configuration state.

 Configuration management for ai-me application.
 Centralizes environment variables, API clients, and application defaults.
 """
 import logging
+import os
 import socket
 from logging.handlers import QueueHandler, QueueListener
 from queue import Queue
+from typing import Optional, List, Union
 from pydantic import Field, field_validator, SecretStr
 from pydantic_settings import BaseSettings, SettingsConfigDict
         loki_username = os.getenv('LOKI_USERNAME')
         loki_password = os.getenv('LOKI_PASSWORD')
+        if loki_url and loki_username and loki_password:  # pragma: no cover
             try:
                 # Create async queue for non-blocking logging
                 log_queue = Queue(maxsize=1000)  # Buffer up to 1000 log messages
                 root_logger.addHandler(queue_handler)
                 root_logger.info(f"Grafana Loki logging enabled: {loki_url} (tags: {loki_tags})")
+            except Exception as e:  # pragma: no cover
                 root_logger.warning(f"Failed to setup Grafana Loki logging: {e}")
         root_logger.setLevel(log_level)
     """Central configuration class for ai-me application with Pydantic validation."""
     # Environment Variables (from .env) - Required
+    # Note: These have no defaults, so they MUST be in .env or will raise ValidationError
+    # We don't provide defaults here because Pydantic will raise an error at runtime
+    # if they're missing from the environment, which is the intended behavior.
     openai_api_key: SecretStr = Field(...,
         description="OpenAI API key for tracing")
     groq_api_key: SecretStr = Field(...,
     temperature: float = Field(
         default=1.0,
         description="LLM temperature for sampling (0.0-2.0, default 1.0)")
+    seed: Optional[int] = Field(
+        default=None,
+        description="Random seed for deterministic outputs (optional, for testing)")
     github_repos: Union[str, List[str]] = Field(
         default="",
         description="GitHub repos to load (format: owner/repo), comma-separated in .env")
         os.environ["TOKENIZERS_PARALLELISM"] = "false"
         # Initialize Groq client for LLM operations
+        default_query = {"temperature": self.temperature}
+        if self.seed is not None:
+            default_query["seed"] = self.seed
         self.openai_client = AsyncOpenAI(
             base_url="https://api.groq.com/openai/v1",
             api_key=self.groq_api_key.get_secret_value(),
+            default_query=default_query
         )
         set_default_openai_client(self.openai_client)
         logger.info("Setting tracing export API key for agents.")
         set_tracing_export_api_key(self.openai_api_key.get_secret_value())
+    def _safe_repr(self) -> str:  # pragma: no cover
         """Helper to generate string representation excluding sensitive fields."""
         lines = ["Config:"]
         for field_name in type(self).model_fields:
             lines.append(f"  {field_name}: {display}")
         return "\n".join(lines)
+    def __repr__(self) -> str:  # pragma: no cover
         """Return string representation of Config with secrets hidden.
         DEBUG: Debug utility for logging/debugging configuration state.
         """
         return self._safe_repr()
+    def __str__(self) -> str:  # pragma: no cover
         """Return human-readable string representation of Config with secrets hidden.
         DEBUG: Debug utility for logging/debugging configuration state.

src/data.py CHANGED Viewed

@@ -3,7 +3,10 @@ Document loading, processing, and vectorstore management for ai-me application.
 from local directories and GitHub repositories, chunking, and creating ChromaDB vector stores.
 """
 import os
 from typing import List, Optional, Callable
 from pydantic import BaseModel, Field
 from langchain_community.document_loaders import (
     DirectoryLoader,
@@ -16,8 +19,7 @@ from langchain_huggingface import HuggingFaceEmbeddings
 from langchain_chroma import Chroma
 import chromadb
 from chromadb.config import Settings
-import shutil
-import re
 from config import setup_logger
 logger = setup_logger(__name__)
@@ -27,11 +29,17 @@ class DataManagerConfig(BaseModel):
     doc_load_local: List[str] = Field(
         default=["**/*.md"], description="Glob patterns for local docs (e.g., ['*.md'])")
-    github_repos: List[str] = Field(
-        default=[], description="List of GitHub repos (format: owner/repo)")
     doc_root: str = Field(
-        default=os.path.abspath(os.path.join(os.path.dirname(__file__), "..", "docs", "local-testing")) + "/",
-        description="Root directory for local documents (development/testing only)")
     chunk_size: int = Field(
         default=2500, description="Character chunk size for splitting")
     chunk_overlap: int = Field(
@@ -49,33 +57,19 @@ class DataManager:
     parameters have sensible defaults and can be overridden as needed.
     """
-    def __init__(self, config: Optional[DataManagerConfig] = None, **kwargs):
         """
         Initialize data manager with configuration.
         Implements FR-002 (Knowledge Retrieval).
         Args:
-            config: Optional DataManagerConfig instance. If not provided, one will be created
-                from kwargs. For backward compatibility, can also pass individual parameters.
-            **kwargs: Individual config parameters (doc_load_local, github_repos, etc.)
-                Used when config is not provided or to override config values.
         """
-        if config is None:
-            # Create config from kwargs for backward compatibility
-            self.config = DataManagerConfig(**kwargs)
-        else:
-            # Use provided config, but allow kwargs to override
-            if kwargs:
-                # Merge provided config with overrides
-                config_dict = config.model_dump()
-                config_dict.update(kwargs)
-                self.config = DataManagerConfig(**config_dict)
-            else:
-                self.config = config
         # Internal state
-        self._vectorstore: Optional[Chroma] = None
         self._embeddings: Optional[HuggingFaceEmbeddings] = None
     def load_local_documents(self) -> List[Document]:
@@ -109,7 +103,7 @@ class DataManager:
                 documents = loader.load()
                 logger.info(f"    Found {len(documents)} documents")
                 all_documents.extend(documents)
-            except Exception as e:
                 logger.info(
                     f"  Error loading pattern {pattern}: {e}"
                     f" - skipping this pattern"
@@ -119,8 +113,11 @@ class DataManager:
         logger.info(f"Loaded {len(all_documents)} total local documents.")
         return all_documents
-    def load_github_documents(self, repos: List[str] = None,
-        file_filter: Optional[Callable[[str], bool]] = None, cleanup_tmp: bool = True
     ) -> List[Document]:
         """
         Load documents from GitHub repositories.
@@ -129,28 +126,29 @@ class DataManager:
         Args:
             repos: List of repos (owner/repo format). Defaults to github_repos from init.
-            file_filter: Optional filter function for files. Defaults to .md files.
             cleanup_tmp: If True, remove tmp directory before loading.
         Returns:
             List of loaded documents from all repos.
         """
-        if repos is None:
-            repos = self.config.github_repos
         if file_filter is None:
-            def file_filter(fp: str) -> bool:
-                """Filter function for GitHub document loading.
-                Implements FR-002 (Knowledge Retrieval): Filters markdown files for document loading.
                 """
-                fp_lower = fp.lower()
                 basename = os.path.basename(fp).lower()
-                # TBD: Make this configurable once chunking logic is enhanced
-                keep = (fp_lower.endswith(".md") and
-                    basename not in ["contributing.md", "code_of_conduct.md", "security.md",
-                                     "readme.md"])
-                return keep
         all_docs = []
         # Clean up tmp directory before loading
@@ -160,25 +158,42 @@ class DataManager:
             logger.info(f"Cleaning up existing tmp directory: {tmp_dir}")
             shutil.rmtree(tmp_dir)
-        logger.info(f"Loading GitHub documents from {len(repos)} repos {repos}")
-        for repo in repos:
             logger.info(f"Loading GitHub repo: {repo}")
             try:
                 loader = GitLoader(
                     clone_url=f"https://github.com/{repo}",
-                    repo_path=f"{tmp_dir}/{repo}",
-                    file_filter=file_filter,
                     branch="main",
                 )
-                docs = loader.load()
                 # Add repo metadata to each document
                 for doc in docs:
                     doc.metadata["github_repo"] = repo
                 logger.info(f"  Loaded {len(docs)} documents from {repo}")
-                all_docs.extend(docs)
-            except Exception as e:
                 logger.info(f"  Error loading repo {repo}: {e} - skipping")
                 continue
@@ -237,6 +252,7 @@ class DataManager:
             strip_headers=False)
         all_chunks = []
         for doc in documents:
             # Split by headers first - this returns Documents with header metadata
             header_chunks = header_splitter.split_text(doc.page_content)
@@ -245,6 +261,8 @@ class DataManager:
             for chunk in header_chunks:
                 # Create new Document with combined metadata
                 merged_metadata = {**doc.metadata, **chunk.metadata}
                 new_doc = Document(
                     page_content=chunk.page_content,
                     metadata=merged_metadata,
@@ -255,10 +273,14 @@ class DataManager:
         size_splitter = MarkdownTextSplitter(chunk_size=self.config.chunk_size)
         final_chunks = size_splitter.split_documents(all_chunks)
         logger.info(f"Created {len(final_chunks)} chunks")
         return final_chunks
-    def load_and_process_all(self, github_repos: List[str] = None) -> List[Document]:
         """
         Load, process, and chunk all documents. Automatically loads local documents
         if doc_load_local is set, and GitHub documents if github_repos (or
@@ -279,10 +301,9 @@ class DataManager:
         if self.config.doc_load_local:
             all_docs.extend(self.load_local_documents())
-        # Load GitHub documents if repos are configured
-        repos_to_load = github_repos if github_repos is not None else self.config.github_repos
-        if repos_to_load:
-            all_docs.extend(self.load_github_documents(repos=repos_to_load))
         processed_docs = self.process_documents(all_docs)
         chunks = self.chunk_documents(processed_docs)
@@ -322,11 +343,11 @@ class DataManager:
         chroma_client = chromadb.EphemeralClient(Settings(anonymized_telemetry=False))
         # Drop existing collection if requested
-        if reset:
             try:
                 chroma_client.delete_collection(self.config.db_name)
                 logger.info(f"Dropped existing collection: {self.config.db_name}")
-            except Exception:
                 pass  # Collection doesn't exist yet
         logger.info(f"Creating vectorstore with {len(chunks)} chunks...")
@@ -340,11 +361,11 @@ class DataManager:
         count = vectorstore._collection.count()
         logger.info(f"Vectorstore created with {count} documents")
-        self._vectorstore = vectorstore
         return vectorstore
     def setup_vectorstore(
-        self, github_repos: List[str] = None, reset: bool = True
     ) -> Chroma:
         """
         Complete pipeline: load, process, chunk, and create vectorstore. Automatically
@@ -365,14 +386,20 @@ class DataManager:
         chunks = self.load_and_process_all(github_repos=github_repos)
         return self.create_vectorstore(chunks, reset=reset)
-    def show_docs_for_file(self, filename: str):
         """
         Retrieve and print chunks from the vectorstore whose metadata['file_path'] ends with the
         given filename. Returns a list of (doc_id, metadata, document).
         DEBUG TOOL: Utility/debugging function - no corresponding FR/NFR.
         """
-        all_docs = self._vectorstore.get()
         logger.info(f"Searching for chunks from file: {filename}")
         ids = all_docs.get("ids", [])
@@ -393,8 +420,5 @@ class DataManager:
             logger.info("=" * 100)
             logger.info(content)
             logger.info("")
-    @property
-    def vectorstore(self) -> Optional[Chroma]:
-        """Get the current vectorstore instance."""
-        return self._vectorstore

 from local directories and GitHub repositories, chunking, and creating ChromaDB vector stores.
 """
 import os
+import re
+import shutil
 from typing import List, Optional, Callable
 from pydantic import BaseModel, Field
 from langchain_community.document_loaders import (
     DirectoryLoader,
 from langchain_chroma import Chroma
 import chromadb
 from chromadb.config import Settings
 from config import setup_logger
 logger = setup_logger(__name__)
     doc_load_local: List[str] = Field(
         default=["**/*.md"], description="Glob patterns for local docs (e.g., ['*.md'])")
     doc_root: str = Field(
+        default=(
+            os.path.abspath(
+                os.path.join(
+                    os.path.dirname(__file__), "..", "docs", "local-testing"
+                )
+            )
+            + "/"
+        ),
+        description="Root directory for local documents (development/testing only)"
+    )
     chunk_size: int = Field(
         default=2500, description="Character chunk size for splitting")
     chunk_overlap: int = Field(
     parameters have sensible defaults and can be overridden as needed.
     """
+    def __init__(self, config: DataManagerConfig):
         """
         Initialize data manager with configuration.
         Implements FR-002 (Knowledge Retrieval).
         Args:
+            config: DataManagerConfig instance with all settings
         """
+        self.config = config
         # Internal state
+        self.vectorstore: Optional[Chroma] = None
         self._embeddings: Optional[HuggingFaceEmbeddings] = None
     def load_local_documents(self) -> List[Document]:
                 documents = loader.load()
                 logger.info(f"    Found {len(documents)} documents")
                 all_documents.extend(documents)
+            except Exception as e:  # pragma: no cover
                 logger.info(
                     f"  Error loading pattern {pattern}: {e}"
                     f" - skipping this pattern"
         logger.info(f"Loaded {len(all_documents)} total local documents.")
         return all_documents
+    def _load_github_documents(
+        self,
+        repos: Optional[List[str]] = None,
+        file_filter: Optional[Callable[[str], bool]] = None,
+        cleanup_tmp: bool = True
     ) -> List[Document]:
         """
         Load documents from GitHub repositories.
         Args:
             repos: List of repos (owner/repo format). Defaults to github_repos from init.
+            file_filter: Optional filter function for files. If None, uses default filter
+                excluding README, CONTRIBUTING, CODE_OF_CONDUCT, and SECURITY files.
             cleanup_tmp: If True, remove tmp directory before loading.
         Returns:
             List of loaded documents from all repos.
         """
+        # Default filter excludes common documentation files that degrade RAG quality
         if file_filter is None:
+            def default_file_filter(fp: str) -> bool:
+                """Default filter excludes contributing docs to preserve RAG quality.
+                Implements FR-002 (Knowledge Retrieval): Filters out common boilerplate
+                files (README, CONTRIBUTING, etc.) that aren't representative of
+                personified agent knowledge.
                 """
                 basename = os.path.basename(fp).lower()
+                # Exclude common boilerplate that doesn't represent agent's knowledge
+                excluded = {"readme.md", "contributing.md", "code_of_conduct.md",
+                           "security.md"}
+                return basename not in excluded
+            file_filter = default_file_filter
         all_docs = []
         # Clean up tmp directory before loading
             logger.info(f"Cleaning up existing tmp directory: {tmp_dir}")
             shutil.rmtree(tmp_dir)
+        # Use provided repos or default to empty list if none specified
+        repos_to_load = repos if repos is not None else []
+        logger.info(f"Loading GitHub documents from {len(repos_to_load)} repos {repos_to_load}")
+        for repo in repos_to_load:
             logger.info(f"Loading GitHub repo: {repo}")
             try:
+                # Clone repo using GitLoader (even though it doesn't load files)
+                repo_path = f"{tmp_dir}/{repo}"
                 loader = GitLoader(
                     clone_url=f"https://github.com/{repo}",
+                    repo_path=repo_path,
                     branch="main",
                 )
+                # GitLoader.load() doesn't return files, but it clones the repo
+                # so we use DirectoryLoader to actually load the markdown files
+                loader.load()
+                # Now use DirectoryLoader to load markdown files from the cloned repo
+                directory_loader = DirectoryLoader(
+                    repo_path,
+                    glob="**/*.md",
+                    loader_cls=TextLoader,
+                    loader_kwargs={'encoding': 'utf-8'}
+                )
+                docs = directory_loader.load()
+                # Apply filter (default or custom) to exclude irrelevant files
+                docs = [doc for doc in docs if file_filter(doc.metadata['source'])]
                 # Add repo metadata to each document
                 for doc in docs:
                     doc.metadata["github_repo"] = repo
                 logger.info(f"  Loaded {len(docs)} documents from {repo}")
+                all_docs.extend(docs)
+            except Exception as e: # pragma: no cover
                 logger.info(f"  Error loading repo {repo}: {e} - skipping")
                 continue
             strip_headers=False)
         all_chunks = []
+        chunk_index = 0  # Track chunk number across all documents
         for doc in documents:
             # Split by headers first - this returns Documents with header metadata
             header_chunks = header_splitter.split_text(doc.page_content)
             for chunk in header_chunks:
                 # Create new Document with combined metadata
                 merged_metadata = {**doc.metadata, **chunk.metadata}
+                merged_metadata['chunk_index'] = chunk_index  # Add global chunk index
+                chunk_index += 1
                 new_doc = Document(
                     page_content=chunk.page_content,
                     metadata=merged_metadata,
         size_splitter = MarkdownTextSplitter(chunk_size=self.config.chunk_size)
         final_chunks = size_splitter.split_documents(all_chunks)
+        # Re-index after size splitting to maintain sequential chunk indices
+        for i, chunk in enumerate(final_chunks):
+            chunk.metadata['chunk_index'] = i
         logger.info(f"Created {len(final_chunks)} chunks")
         return final_chunks
+    def load_and_process_all(self, github_repos: Optional[List[str]] = None) -> List[Document]:
         """
         Load, process, and chunk all documents. Automatically loads local documents
         if doc_load_local is set, and GitHub documents if github_repos (or
         if self.config.doc_load_local:
             all_docs.extend(self.load_local_documents())
+        # Load GitHub documents if repos are provided (github_repos must come from caller)
+        if github_repos:
+            all_docs.extend(self._load_github_documents(repos=github_repos))
         processed_docs = self.process_documents(all_docs)
         chunks = self.chunk_documents(processed_docs)
         chroma_client = chromadb.EphemeralClient(Settings(anonymized_telemetry=False))
         # Drop existing collection if requested
+        if reset:  # pragma: no cover
             try:
                 chroma_client.delete_collection(self.config.db_name)
                 logger.info(f"Dropped existing collection: {self.config.db_name}")
+            except Exception:  # pragma: no cover
                 pass  # Collection doesn't exist yet
         logger.info(f"Creating vectorstore with {len(chunks)} chunks...")
         count = vectorstore._collection.count()
         logger.info(f"Vectorstore created with {count} documents")
+        self.vectorstore = vectorstore
         return vectorstore
     def setup_vectorstore(
+        self, github_repos: Optional[List[str]] = None, reset: bool = True
     ) -> Chroma:
         """
         Complete pipeline: load, process, chunk, and create vectorstore. Automatically
         chunks = self.load_and_process_all(github_repos=github_repos)
         return self.create_vectorstore(chunks, reset=reset)
+    def show_docs_for_file(self, filename: str):  # pragma: no cover
         """
         Retrieve and print chunks from the vectorstore whose metadata['file_path'] ends with the
         given filename. Returns a list of (doc_id, metadata, document).
         DEBUG TOOL: Utility/debugging function - no corresponding FR/NFR.
         """
+        if self.vectorstore is None:
+            logger.warning(
+                "Vectorstore not initialized. Call setup_vectorstore() first."
+            )
+            return []
+        all_docs = self.vectorstore.get()
         logger.info(f"Searching for chunks from file: {filename}")
         ids = all_docs.get("ids", [])
             logger.info("=" * 100)
             logger.info(content)
             logger.info("")
+        return matched

src/notebooks/experiments.ipynb CHANGED Viewed

@@ -16,10 +16,7 @@
    "outputs": [],
    "source": [
     "# Setup configuration\n",
-    "import sys\n",
-    "sys.path.append('/Users/benyoung/projects/ai-me')\n",
-    "\n",
-    "from src.config import Config\n",
     "from IPython.display import Markdown\n",
     "from agents import trace, Runner\n",
     "\n",
@@ -44,9 +41,9 @@
    "outputs": [],
    "source": [
     "from importlib import reload\n",
-    "import src.data as _data_module\n",
     "reload(_data_module)\n",
-    "from src.data import DataManager, DataManagerConfig\n",
     "\n",
     "\n",
     "# Use consolidated data manager\n",
@@ -85,7 +82,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "from src.agent import AIMeAgent\n",
     "\n",
     "# Initialize agent config with vectorstore\n",
     "agent_config = AIMeAgent(\n",
@@ -95,7 +92,7 @@
     "    github_token=config.github_token\n",
     ")\n",
     "\n",
-    "ai_me = await agent_config.create_ai_me_agent()\n"
    ]
   },
   {
@@ -275,9 +272,9 @@
    "outputs": [],
    "source": [
     "# Reload agent module to pick up latest changes\n",
-    "import src.agent as _agent_module\n",
     "reload(_agent_module)\n",
-    "from src.agent import AIMeAgent\n",
     "\n",
     "# Recreate agent config with updated module\n",
     "agent_config = AIMeAgent(\n",

    "outputs": [],
    "source": [
     "# Setup configuration\n",
+    "from config import Config\n",
     "from IPython.display import Markdown\n",
     "from agents import trace, Runner\n",
     "\n",
    "outputs": [],
    "source": [
     "from importlib import reload\n",
+    "import data as _data_module\n",
     "reload(_data_module)\n",
+    "from data import DataManager, DataManagerConfig\n",
     "\n",
     "\n",
     "# Use consolidated data manager\n",
    "metadata": {},
    "outputs": [],
    "source": [
+    "from agent import AIMeAgent\n",
     "\n",
     "# Initialize agent config with vectorstore\n",
     "agent_config = AIMeAgent(\n",
     "    github_token=config.github_token\n",
     ")\n",
     "\n",
+    "ai_me = await agent_config.create_ai_me_agent()"
    ]
   },
   {
    "outputs": [],
    "source": [
     "# Reload agent module to pick up latest changes\n",
+    "import agent as _agent_module\n",
     "reload(_agent_module)\n",
+    "from agent import AIMeAgent\n",
     "\n",
     "# Recreate agent config with updated module\n",
     "agent_config = AIMeAgent(\n",

tests/data/README.md ADDED Viewed

	@@ -0,0 +1,194 @@

+# Test Data Directory
+This directory contains controlled test data for RAG (Retrieval Augmented Generation) testing in the ai-me project.
+## Purpose
+These markdown files provide known content for deterministic testing of:
+1. Document loading and chunking (from local files)
+2. Vector embeddings and storage (ChromaDB)
+3. Retrieval quality (similarity search)
+4. Agent response accuracy (RAG output validation)
+## Files
+| File | Purpose | Key Content | Used By Tests |
+|------|---------|-------------|---------------|
+| **rear_info.md** | ReaR disaster recovery info | Project ID: IT-245 | test_rear_knowledge_contains_it245 |
+| **projects.md** | Project listings | IT-245, IT-300, APP-101, DATA-500 | General project queries |
+| **team_info.md** | Team structure (fictional) | Alice, Bob, Carol + departments | Person/team queries |
+| **faq.md** | FAQ with tech stack, workflows | IT-245 references, dev processes | General knowledge queries |
+| **README.md** | This documentation | Test data guide | - |
+## Statistics
+- **Total Files**: 5 markdown files
+- **Total Chunks**: ~38 (after splitting with CharacterTextSplitter)
+- **Chunk Size**: 2500 characters (default)
+- **Chunk Overlap**: 0 characters (default)
+- **Embedding Model**: sentence-transformers/all-MiniLM-L6-v2
+## Usage in Tests
+The test suite (`tests/integration/spec-001.py`) automatically uses this directory:
+```python
+# Configuration in tests/integration/spec-001.py
+os.environ["GITHUB_REPOS"] = ""  # Disable GitHub loading
+test_data_dir = os.path.join(project_root, "tests", "data")
+# DataManager initialization with required config
+config = DataManagerConfig(
+    doc_load_local=["**/*.md"],
+    github_repos=[],
+    doc_root=test_data_dir  # Points to this directory
+)
+data_manager = DataManager(config=config)
+```
+## Test Cases
+### ✅ Test 1: ReaR Knowledge (IT-245)
+**Query**: "What do you know about ReaR?"
+**Source**: `rear_info.md`
+**Validates**:
+- Document retrieval works correctly
+- Agent finds and extracts specific project information
+- Response contains "IT-245" identifier
+**Expected Output**: Response mentions ReaR, disaster recovery, and IT-245 project.
+### ⏭️ Test 2: GitHub Commits (Skipped)
+**Note**: Requires MCP servers (disabled for test speed).
+### ✅ Test 3: Unknown Person (Negative Test)
+**Query**: "Who is slartibartfast?"
+**Source**: None (intentionally missing)
+**Validates**:
+- Agent handles missing information gracefully
+- No hallucination or fabricated responses
+- Proper "don't have information" response
+**Expected Output**: Response contains negative indicators like "don't have", "no information", etc.
+## Benefits vs. Loading from GitHub
+| Aspect | Test Data Directory | GitHub Loading |
+|--------|-------------------|----------------|
+| **Speed** | ~10 seconds total | Minutes per test run |
+| **Network** | None required | API calls needed |
+| **Determinism** | Fully controlled | May change over time |
+| **Setup** | Already included | Requires GitHub token |
+| **Isolation** | Completely isolated | External dependency |
+## Key Implementation Details
+### Local Document Metadata
+Unlike GitHub documents, local documents have simplified metadata:
+```python
+# GitHub documents have:
+doc.metadata['github_repo'] = 'owner/repo'
+doc.metadata['file_path'] = 'path/to/file.md'
+# Local documents have:
+doc.metadata['source'] = '/full/path/to/file.md'
+# NO github_repo field
+```
+The `get_local_info` tool in `src/agent.py` was updated to handle both cases.
+### Unicode Handling
+Test assertions handle Unicode variants:
+- **Hyphens**: `IT-245` (regular) vs `IT‑245` (non-breaking)
+- **Apostrophes**: `don't` (regular) vs `don't` (smart quote)
+- **Spaces**: Regular space vs non-breaking space (`\u00a0`)
+## Adding New Test Data
+To add new test content:
+1. **Create markdown file** in this directory:
+   ```bash
+   touch tests/data/my_topic.md
+   # Add relevant content with known facts
+   ```
+2. **Add test case** in `tests/integration/spec-001.py`:
+   ```python
+   @pytest.mark.asyncio
+   async def test_my_topic_knowledge(ai_me_agent):
+       query = "What do you know about [topic]?"
+       result = await Runner.run(ai_me_agent, query, max_turns=30)
+       assert "[expected_content]" in result.final_output
+   ```
+3. **Document** in this README
+4. **Verify** chunks created:
+   ```bash
+   uv run pytest tests/ -v -s | grep "Created.*chunks"
+   ```
+## Maintenance Guidelines
+### What TO Include
+✅ Fictional but realistic data
+✅ Specific identifiers for testing (e.g., IT-245)
+✅ Structured markdown with clear headings
+✅ Cross-references between documents
+✅ Both positive and negative test cases
+### What NOT to Include
+❌ Real personal information or PII
+❌ Sensitive company data
+❌ Large binary files or images
+❌ External dependencies
+❌ Dynamic/time-sensitive content
+## Troubleshooting
+### "Vectorstore setup complete with 0 documents"
+**Cause**: Files not loading from tests/data directory
+**Fix**: Verify `doc_root` parameter and file patterns
+### "Expected 'IT-245' in response but got..."
+**Cause**: LLM used Unicode non-breaking hyphen
+**Fix**: Test already handles both variants, check for other formatting
+### Test execution is slow (> 30 seconds)
+**Cause**: May be loading from GitHub instead of tests/data
+**Fix**: Verify `GITHUB_REPOS=""` in test environment setup
+## Performance Benchmarks
+Measured on M1 MacBook Pro:
+- **Vectorstore Setup**: 2-3 seconds (includes embedding model loading)
+- **Test 1 (ReaR)**: 3-4 seconds (includes LLM calls)
+- **Test 3 (Unknown)**: 3-4 seconds
+- **Total Runtime**: ~10 seconds for all passing tests
+Compare to production setup with GitHub repos: 2-5 minutes
+## Future Enhancements
+- [ ] Add more domain-specific test documents
+- [ ] Create test cases for multi-document synthesis
+- [ ] Add edge cases (empty files, malformed markdown)
+- [ ] Performance regression tests
+- [ ] Quality metrics (retrieval precision/recall)
+For more details, see `/TESTING.md` in the project root.

tests/data/projects.md ADDED Viewed

	@@ -0,0 +1,61 @@

+# Active Projects
+## Infrastructure Projects
+### IT-245: Disaster Recovery
+**Status**: In Progress
+**Owner**: Infrastructure Team
+**Timeline**: Q4 2024 - Q1 2025
+Implementation of Relax-and-Recover (ReaR) backup solution across production servers. This provides automated disaster recovery capabilities with bare metal restoration.
+**Key Milestones**:
+- ✅ Proof of concept completed
+- ✅ Test environment deployment
+- 🔄 Production rollout (in progress)
+- ⏳ Documentation and training
+### IT-300: Network Segmentation
+**Status**: Planning
+**Owner**: Security Team
+**Timeline**: Q1 2025
+Network redesign to implement zero-trust architecture with micro-segmentation.
+## Application Projects
+### APP-101: Customer Portal v2
+**Status**: Active Development
+**Owner**: Frontend Team
+**Timeline**: Q4 2024 - Q2 2025
+Complete redesign of customer-facing portal with modern React architecture.
+### APP-150: API Gateway Upgrade
+**Status**: In Progress
+**Owner**: Backend Team
+**Timeline**: Q4 2024
+Migrating from legacy API gateway to Kong with enhanced security and monitoring.
+## Data Projects
+### DATA-500: Analytics Platform
+**Status**: Active Development
+**Owner**: Data Engineering Team
+**Timeline**: Q3 2024 - Q2 2025
+Building modern data warehouse on Snowflake with real-time analytics capabilities.
+### DATA-510: ML Pipeline
+**Status**: Planning
+**Owner**: Data Science Team
+**Timeline**: Q1 2025 - Q3 2025
+Automated machine learning pipeline for predictive analytics and recommendation systems.
+## Completed Projects
+- **IT-200**: VMware to KVM migration (Completed Q3 2024)
+- **APP-90**: Mobile app release (Completed Q2 2024)
+- **SEC-400**: SOC 2 compliance (Completed Q1 2024)

tests/data/team.md ADDED Viewed

	@@ -0,0 +1,49 @@

+# Team Information
+## Operation Agenetic Me
+### Engineers
+- **Ben Young** - Lead Software Engineer
+  - Specializes in backend systems
+  - Working on microservices architecture
+  - Programming languages: Python, Go, TypeScript, Rust, SQL
+  - Expertise in: async programming, distributed systems, cloud infrastructure
+  - Contact: ben@example.com
+- **Bob Smith** - Frontend Developer
+  - React and TypeScript expert
+  - UI/UX design background
+  - Contact: bob@example.com
+### Product Managers
+- **Carol Williams** - Product Owner
+  - Write requirements
+  - Educate team on user profiles
+  - Contact: carol@example.com
+### Recent Projects
+1. **Project Phoenix** - Cloud migration initiative
+2. **Project Titan** - New customer portal
+3. **IT-245** - ReaR disaster recovery implementation
+## Department Structure
+- Engineering Director: David Chen
+- Product Manager: Emma Davis
+- QA Lead: Frank Miller
+## Office Locations
+- San Francisco HQ
+- Austin Remote Office
+- London European Hub
+## Team Events
+- Weekly stand-ups: Monday 9 AM PST
+- Sprint planning: Every other Wednesday
+- Team retrospectives: Last Friday of the month
+- Quarterly all-hands: First week of Q1, Q2, Q3, Q4

tests/integration/spec-001.py ADDED Viewed

	@@ -0,0 +1,507 @@

+"""
+Integration tests for ai-me agent.
+Tests the complete setup including vectorstore, agent configuration, and agent responses.
+"""
+import pytest
+import pytest_asyncio
+import re
+import sys
+import os
+import logging
+from datetime import datetime
+from unittest.mock import AsyncMock, patch
+# Something about these tests makes me feel yucky. Big, brittle, and slow. BBS?
+# In the future we should run inference locally with docker-compose models.
+# Set temperature and seed for deterministic test results
+os.environ["TEMPERATURE"] = "0"
+os.environ["SEED"] = "42"
+# Point our RAG to the tests/data directory
+project_root = os.path.abspath(os.path.join(os.path.dirname(__file__), "../.."))
+test_data_dir = os.path.join(project_root, "tests", "data")
+os.environ["DOC_ROOT"] = test_data_dir
+os.environ["LOCAL_DOCS"] = "**/*.md"
+from config import setup_logger, Config
+from agent import AIMeAgent
+from data import DataManager, DataManagerConfig
+logger = setup_logger(__name__)
+# ============================================================================
+# SHARED CACHING - Initialize on first use, then reuse
+# ============================================================================
+_config = None
+_vectorstore = None
+_data_manager = None
+def _get_shared_config():
+    """Lazy initialization of shared config."""
+    global _config
+    if _config is None:
+        _config = Config()  # type: ignore
+        logger.info(f"Initialized shared config: {_config.bot_full_name}")
+    return _config
+def _get_shared_vectorstore():
+    """Lazy initialization of shared vectorstore."""
+    global _vectorstore, _data_manager
+    if _vectorstore is None:
+        logger.info("Initializing shared vectorstore (first test)...")
+        test_data_dir = os.path.join(project_root, "tests", "data")
+        _data_config = DataManagerConfig(
+            doc_root=test_data_dir
+        )
+        _data_manager = DataManager(config=_data_config)
+        _vectorstore = _data_manager.setup_vectorstore()
+        logger.info(f"Shared vectorstore ready: {_vectorstore._collection.count()} documents")
+    return _vectorstore
+@pytest_asyncio.fixture(scope="function")
+async def ai_me_agent():
+    """
+    Setup fixture for ai-me agent with vectorstore and MCP servers.
+    CRITICAL: Function-scoped fixture prevents hanging/blocking issues.
+    Each test gets its own agent instance with proper cleanup.
+    Reuses shared config and vectorstore (lazy-initialized on first use).
+    This fixture:
+    - Reuses shared config and vectorstore
+    - Creates agent WITH real subprocess MCP servers (GitHub, Time, Memory)
+    - Yields agent for test
+    - Cleans up MCP servers after test completes
+    """
+    config = _get_shared_config()
+    vectorstore = _get_shared_vectorstore()
+    # Initialize agent config with shared vectorstore
+    aime_agent = AIMeAgent(
+        bot_full_name=config.bot_full_name,
+        model=config.model,
+        vectorstore=vectorstore,
+        github_token=config.github_token,
+        session_id="test-session"
+    )
+    # Create the agent WITH MCP servers enabled
+    logger.info("Creating ai-me agent with MCP servers...")
+    assert aime_agent.session_id is not None, "session_id should be set"
+    await aime_agent.create_ai_me_agent(
+        mcp_params=[
+            aime_agent.mcp_github_params,
+            aime_agent.mcp_time_params,
+            aime_agent.get_mcp_memory_params(aime_agent.session_id),
+        ]
+    )
+    logger.info("Agent created successfully with MCP servers")
+    logger.info(f"Temperature set to {config.temperature}")
+    logger.info(f"Seed set to {config.seed}")
+    # Yield the agent for the test
+    yield aime_agent
+    # CRITICAL: Cleanup after test completes to prevent hanging
+    logger.info("Cleaning up MCP servers after test...")
+    await aime_agent.cleanup()
+    logger.info("Cleanup complete")
+@pytest.mark.asyncio
+async def test_github_documents_load():
+    """Tests FR-002: GitHub document loading with source metadata."""
+    config = Config()  # type: ignore
+    # Load GitHub documents directly
+    github_config = DataManagerConfig(
+        doc_load_local=[]
+    )
+    dm = DataManager(config=github_config)
+    vs = dm.setup_vectorstore(github_repos=["byoung/ai-me"])
+    agent = AIMeAgent(
+        bot_full_name=config.bot_full_name,
+        model=config.model,
+        vectorstore=vs,
+        github_token=config.github_token,
+        session_id="test-session"
+    )
+    await agent.create_ai_me_agent()
+    response = await agent.run("Do you have python experience?")
+    assert "yes" in response.lower(), (
+        f"yes' in response but got: {response}"
+    )
+@pytest.mark.asyncio
+async def test_rear_knowledge_contains_it245(ai_me_agent):
+    """Tests REQ-001: Knowledge base retrieval of personal documentation."""
+    response = await ai_me_agent.run("What is IT-245?")
+    assert "IT-245" in response or "It-245" in response or "it-245" in response
+    logger.info("✓ IT-245 found in response")
+@pytest.mark.asyncio
+async def test_github_commits_contains_shas(ai_me_agent):
+    """Tests REQ-002: MCP GitHub integration - retrieve commit history."""
+    response = await ai_me_agent.run("What are some recent commits I've made?")
+    assert response, "Response is empty"
+    assert len(response) > 10, "Response is too short"
+    logger.info("✓ Response contains commit information")
+@pytest.mark.asyncio
+async def test_unknown_person_contains_negative_response(ai_me_agent):
+    """Tests REQ-003: Graceful handling of out-of-scope requests."""
+    response = await ai_me_agent.run(
+        "Do you know Slartibartfast?"  # Presumed unknown person
+    )
+    assert response, "Response is empty"
+    assert (
+        "don't know" in response.lower()
+        or "not familiar" in response.lower()
+        or "no information" in response.lower()
+        or "don't have any information" in response.lower()
+    ), f"Response doesn't indicate lack of knowledge: {response}"
+    logger.info(f"✓ Test passed - correctly handled out-of-scope query")
+@pytest.mark.asyncio
+async def test_carol_knowledge_contains_product(ai_me_agent):
+    """Tests FR-002, FR-003: Verify asking about Carol returns 'product'."""
+    response_raw = await ai_me_agent.run("Do you know Carol?")
+    response = response_raw.lower()  # Convert to lowercase for matching
+    # Assert that 'product' appears in the response (Carol is Product Owner)
+    assert "product" in response, (
+        f"Expected 'product' in response but got: {response}"
+    )
+    logger.info("✓ Test passed: Response contains 'product'")
+@pytest.mark.asyncio
+async def test_mcp_time_server_returns_current_date(ai_me_agent):
+    """Tests FR-009, NFR-001: Verify that the MCP time server returns the current date."""
+    response = await ai_me_agent.run("What is today's date?")
+    # Check for current date in various formats (ISO or natural language)
+    now = datetime.now()
+    expected_date, current_year, current_month, current_day = (
+        now.strftime("%Y-%m-%d"),
+        str(now.year),
+        now.strftime("%B"),
+        str(now.day),
+    )
+    # Accept either ISO format or natural language date
+    has_date = (
+        expected_date in response
+        or (
+            current_year in response
+            and current_month in response
+            and current_day in response
+        )
+    )
+    assert has_date, (
+        f"Expected response to contain current date "
+        f"({expected_date} or {current_month} {current_day}, {current_year}) "
+        f"but got: {response}"
+    )
+    logger.info(f"✓ Test passed: Response contains current date")
+@pytest.mark.asyncio
+async def test_mcp_memory_server_remembers_favorite_color(ai_me_agent):
+    """Tests FR-013, NFR-002:
+        Verify that the MCP memory server persists information across interactions.
+    """
+    await ai_me_agent.run("My favorite color is chartreuse.")
+    response2 = await ai_me_agent.run("What's my favorite color?")
+    # Check that the agent remembers the color
+    assert "chartreuse" in response2.lower(), (
+        f"Expected agent to remember favorite color 'chartreuse' "
+        f"but got: {response2}"
+    )
+    msg = (
+        "✓ Test passed: Agent remembered favorite color 'chartreuse' "
+        "across interactions"
+    )
+    logger.info(msg)
+@pytest.mark.asyncio
+async def test_github_relative_links_converted_to_absolute_urls():
+    """Tests FR-004: Document processing converts relative GitHub links to absolute URLs.
+    Validates that when documents are loaded from GitHub with relative links
+    (e.g., /resume.md), they are rewritten to full GitHub URLs
+    (e.g., https://github.com/owner/repo/blob/main/resume.md).
+    This is a unit-level test of the DataManager.process_documents() method.
+    """
+    from langchain_core.documents import Document
+    sample_doc = Document(
+        page_content=(
+            "Check out [my resume](/resume.md) and "
+            "[projects](/projects.md) for more info."
+        ),
+        metadata={
+            "source": "github://byoung/ai-me/docs/about.md",
+            "github_repo": "byoung/ai-me"
+        }
+    )
+    # Verify metadata is set correctly before processing
+    assert sample_doc.metadata["github_repo"] == "byoung/ai-me", (
+        "Sample doc metadata should have github_repo"
+    )
+    data_config = DataManagerConfig()
+    data_manager = DataManager(config=data_config)
+    processed_docs = data_manager.process_documents([sample_doc])
+    assert len(processed_docs) == 1, "Expected 1 processed document"
+    processed_content = processed_docs[0].page_content
+    # Check that relative links have been converted to absolute GitHub URLs
+    assert "https://github.com/byoung/ai-me/blob/main/resume.md" in processed_content, (
+        f"Expected absolute GitHub URL for /resume.md in processed content, "
+        f"but got: {processed_content}"
+    )
+    assert "https://github.com/byoung/ai-me/blob/main/projects.md" in processed_content, (
+        f"Expected absolute GitHub URL for /projects.md in processed content, "
+        f"but got: {processed_content}"
+    )
+    logger.info("✓ Test passed: Relative GitHub links converted to absolute URLs")
+    logger.info(f"  Original: [my resume](/resume.md)")
+    logger.info(f"  Converted: [my resume](https://github.com/byoung/ai-me/blob/main/resume.md)")
+@pytest.mark.asyncio
+async def test_agent_responses_cite_sources(ai_me_agent):
+    """Tests FR-004, FR-011: Agent responses include source citations.
+    Validates that agent responses include proper source attribution,
+    which could be GitHub URLs, local paths, or explicit source references.
+    """
+    questions = [
+        "What do you know about ReaR?",
+        "Tell me about your experience in technology",
+    ]
+    for question in questions:
+        logger.info(f"\n{'='*60}\nSource citation test: {question}\n{'='*60}")
+        response = await ai_me_agent.run(question)
+        # Check that response includes some form of source attribution
+        # Could be: GitHub URL, local path, "Sources" section, etc.
+        has_source = (
+            "https://github.com/" in response or
+            ".md" in response or  # Local markdown file reference
+            "source" in response.lower() or
+            "documentation" in response.lower()
+        )
+        assert has_source, (
+            f"Expected source attribution in response to '{question}' "
+            f"but found none. Response: {response}"
+        )
+        # Verify response is substantive (not just metadata)
+        min_length = 50
+        assert len(response) > min_length, (
+            f"Response to '{question}' was too short: {response}"
+        )
+        logger.info(f"✓ Source citation found for: {question[:40]}...")
+    logger.info("\n✓ Test passed: Agent responses cite sources (FR-004, FR-011)")
+@pytest.mark.asyncio
+async def test_user_story_2_multi_topic_consistency(ai_me_agent):
+    """
+    Tests FR-001, FR-003, FR-005, NFR-002: User Story 2 - Multi-Topic Consistency
+    Verify that the agent maintains consistent first-person perspective
+    across multiple conversation topics.
+    This tests that the agent:
+    - Uses first-person perspective (I, my, me) consistently
+    - Maintains professional tone across different topic switches
+    - Shows context awareness of different topics
+    - Remains in-character as the personified individual
+    """
+    # Ask 3 questions about different topics
+    topics = [
+        ("What is your background in technology?", "background|experience|technology"),
+        ("What programming languages are you skilled in?", "programming|language|skilled"),
+    ]
+    first_person_patterns = [
+        r"\bi\b", r"\bme\b", r"\bmy\b", r"\bmyself\b",
+        r"\bI['m]", r"\bI['ve]", r"\bI['ll]"
+    ]
+    for question, topic_keywords in topics:
+        logger.info(f"\n{'='*60}\nMulti-topic test question: {question}\n{'='*60}")
+        response = await ai_me_agent.run(question)
+        response_lower = response.lower()
+        # Check for first-person usage
+        first_person_found = any(
+            re.search(pattern, response, re.IGNORECASE)
+            for pattern in first_person_patterns
+        )
+        assert first_person_found, (
+            f"Expected first-person perspective in response to '{question}' "
+            f"but got: {response}"
+        )
+        # Verify response is substantive (not just "I don't know")
+        min_length = 50  # Substantive responses should be > 50 chars
+        assert len(response) > min_length, (
+            f"Response to '{question}' was too short (likely not substantive): {response}"
+        )
+        logger.info(f"✓ First-person perspective maintained for: {question[:40]}...")
+        logger.info(f"  Response preview: {response[:100]}...")
+    logger.info("\n✓ Test passed: Consistent first-person perspective across 3+ topics")
+@pytest.mark.asyncio
+async def test_tool_failure_error_messages_are_friendly(caplog, ai_me_agent):
+    """
+    Tests FR-012, NFR-003: Error Message Quality (FR-012)
+    Verify that tool failures return user-friendly messages without Python tracebacks.
+    This tests that the agent:
+    - Returns human-readable error messages
+    - logs an error that can be reviewed in our dashboard/logs
+    Uses mocking to simulate tool failures without adding test-specific code to agent.py
+    """
+    logger.info(f"\n{'='*60}\nError Handling Test\n{'='*60}")
+    # Mock the Runner.run method to simulate a tool failure
+    # This tests the catch-all exception handler without adding test code to production
+    test_scenarios = [
+        RuntimeError("Simulated tool timeout"),
+        ValueError("Invalid tool parameters"),
+    ]
+    for error in test_scenarios:
+        logger.info(f"\nTesting error scenario: {error.__class__.__name__}: {error}")
+        # Clear previous log records for this iteration
+        caplog.clear()
+        # Mock Runner.run to raise an exception
+        with patch('agent.Runner.run', new_callable=AsyncMock) as mock_run:
+            mock_run.side_effect = error
+            response = await ai_me_agent.run("Any user question")
+            logger.info(f"Response: {response[:100]}...")
+            # PRIMARY CHECK: Verify "I encountered an unexpected error" is in response
+            assert "I encountered an unexpected error" in response, (
+                f"Response must contain 'I encountered an unexpected error'. Got: {response}"
+            )
+            # SECONDARY CHECK: Verify error was logged by agent.py
+            error_logs = [record for record in caplog.records if record.levelname == "ERROR"]
+            assert len(error_logs) > 0, "Expected at least one ERROR log record from agent.py"
+            # Find the agent.py error log (contains "Unexpected error:")
+            agent_error_logged = any(
+                "Unexpected error:" in record.message for record in error_logs
+            )
+            assert agent_error_logged, (
+                f"Expected ERROR log with 'Unexpected error:' from agent.py. "
+                f"Got: {[r.message for r in error_logs]}"
+            )
+            error_messages = [
+                r.message for r in error_logs
+                if "Unexpected error:" in r.message
+            ]
+            logger.info(
+                f"✓ Error properly logged to logger: {error_messages}"
+            )
+    logger.info("\n✓ Test passed: Error messages are friendly (FR-012) + properly logged")
+@pytest.mark.asyncio
+async def test_logger_setup_format(caplog):
+    """Tests NFR-003 (Structured Logging): Verify setup_logger creates structured logging.
+    Tests that setup_logger() configures syslog-style format with JSON support for
+    structured logging of user/agent interactions.
+    This validates the logger configuration that our production app relies on
+    for analytics and debugging.
+    """
+    # Force logger setup to run by clearing handlers so setup_logger reconfigures
+    root_logger = logging.getLogger()
+    original_handlers = root_logger.handlers[:]
+    for handler in root_logger.handlers[:]:
+        root_logger.removeHandler(handler)
+    try:
+        # Now call setup_logger with no handlers - should trigger full setup
+        test_logger = setup_logger("test.structured_logging")
+        # Verify logger was created
+        assert test_logger.name == "test.structured_logging"
+        # Verify root logger now has handlers (setup_logger should have added them)
+        assert len(root_logger.handlers) > 0, (
+            "Root logger should have handlers after setup_logger"
+        )
+        # Verify we have a StreamHandler (console output)
+        has_stream_handler = any(
+            isinstance(handler, logging.StreamHandler)
+            for handler in root_logger.handlers
+        )
+        assert has_stream_handler, "Should have StreamHandler for console output"
+        # Test that logging works with structured JSON format
+        # The formatters should support JSON logging for analytics
+        test_logger.info(
+            '{"session_id": "test-session", "user_input": "test message"}'
+        )
+        logger.info(
+            "✓ Test passed: Logger setup configures structured logging (NFR-003)"
+        )
+    finally:
+        # Restore original handlers
+        for handler in root_logger.handlers[:]:
+            root_logger.removeHandler(handler)
+        for handler in original_handlers:
+            root_logger.addHandler(handler)
+if __name__ == "__main__":
+    # Allow running tests directly with python test.py
+    pytest.main([__file__, "-v", "-s"])

tests/unit/test_config.py ADDED Viewed

	@@ -0,0 +1,36 @@

+"""
+Unit tests for config.py Config and DataManagerConfig classes.
+Tests configuration validation, Pydantic models, and environment variable parsing
+in isolation without requiring full application setup.
+"""
+import logging
+from config import Config
+def test_config_github_repos_parsing():
+    """Tests NFR-002 (Type-Safe Configuration): Config.parse_github_repos validator.
+    Validates that the field validator correctly parses comma-separated repository
+    strings from environment variables, including edge cases like empty strings and
+    pre-parsed lists. Ensures configuration is validated via Pydantic with strict
+    typing and no silent failures.
+    """
+    # Test empty string
+    result = Config.parse_github_repos("")
+    assert result == [], "Empty string should parse to empty list"
+    # Test single repo
+    result = Config.parse_github_repos("owner/repo")
+    assert result == ["owner/repo"], "Single repo should parse correctly"
+    # Test multiple repos with spaces
+    result = Config.parse_github_repos("owner1/repo1, owner2/repo2 , owner3/repo3")
+    assert result == ["owner1/repo1", "owner2/repo2", "owner3/repo3"], (
+        "Multiple repos with spaces should parse and strip correctly"
+    )
+    # Test already a list
+    result = Config.parse_github_repos(["owner/repo"])
+    assert result == ["owner/repo"], "Already a list should pass through"

tests/unit/test_data.py ADDED Viewed

	@@ -0,0 +1,175 @@

+"""
+Unit tests for data.py DataManager class.
+Tests individual methods of the DataManager and DataManagerConfig in isolation,
+without requiring external APIs or full integration setup.
+"""
+import pytest
+import os
+from pathlib import Path
+from unittest.mock import patch, MagicMock
+from langchain_core.documents import Document
+from data import DataManager, DataManagerConfig
+class TestLoadLocalDocuments:
+    """Tests for DataManager.load_local_documents() method.
+    Implements FR-002 (Knowledge Retrieval): Document loading from local filesystem.
+    """
+    def test_load_local_documents_missing_directory(self):
+        """Tests FR-002: Handle missing doc_root gracefully.
+        When doc_root directory doesn't exist, load_local_documents should
+        return empty list and log warning instead of raising exception.
+        """
+        # Create config pointing to non-existent directory
+        config = DataManagerConfig(doc_root="/nonexistent/path/xyz")
+        dm = DataManager(config=config)
+        # Should return empty list, not raise exception
+        docs = dm.load_local_documents()
+        assert docs == [], "Expected empty list for missing directory"
+        assert isinstance(docs, list), "Expected list return type"
+    def test_load_local_documents_valid_directory(self):
+        """Tests FR-002: Load documents from existing directory.
+        When doc_root exists, load_local_documents should return loaded documents.
+        Uses tests/data directory which contains sample markdown files.
+        """
+        # Use test data directory
+        test_data_dir = str(Path(__file__).parent.parent / "data")
+        config = DataManagerConfig(doc_root=test_data_dir)
+        dm = DataManager(config=config)
+        # Should load documents
+        docs = dm.load_local_documents()
+        assert isinstance(docs, list), "Expected list return type"
+        assert len(docs) > 0, "Expected to find documents in tests/data"
+        # Verify documents have required metadata
+        for doc in docs:
+            assert "source" in doc.metadata, "Document should have source metadata"
+            assert doc.page_content, "Document should have content"
+    def test_load_local_documents_multiple_glob_patterns(self):
+        """Tests FR-002: Load documents using multiple glob patterns (lines 81-83).
+        Tests the for loop iteration over multiple glob patterns in load_local_documents.
+        This covers lines 81-83 where patterns are iterated and loaded.
+        """
+        # Use test data directory
+        test_data_dir = str(Path(__file__).parent.parent / "data")
+        # Create config with multiple glob patterns
+        config = DataManagerConfig(
+            doc_root=test_data_dir,
+            doc_load_local=["*.md", "**/*.md"]  # Multiple patterns
+        )
+        dm = DataManager(config=config)
+        # Should load documents from all patterns
+        docs = dm.load_local_documents()
+        assert isinstance(docs, list), "Expected list return type"
+        assert len(docs) > 0, "Expected to find documents with multiple patterns"
+        # Verify all patterns were processed (should have more docs due to overlap)
+        assert len(docs) >= 3, "Expected at least 3 docs from test data"
+class TestProcessDocuments:
+    """Tests for DataManager.process_documents() method.
+    Implements FR-004 (Source Attribution): Converting relative GitHub links
+    to absolute URLs in markdown documents.
+    """
+    def test_process_documents_converts_relative_links_to_absolute(self):
+        """Tests FR-004: Relative GitHub links converted to absolute URLs.
+        Verifies that process_documents rewrites relative links like /path/file.md
+        to absolute GitHub URLs like https://github.com/owner/repo/blob/main/path/file.md
+        """
+        # Create a sample document with relative GitHub links
+        sample_doc = Document(
+            page_content=(
+                "Check out [my resume](/resume.md) and "
+                "[projects](/projects.md) for more info."
+            ),
+            metadata={
+                "source": "github://byoung/ai-me/docs/about.md",
+                "github_repo": "byoung/ai-me"
+            }
+        )
+        config = DataManagerConfig()
+        dm = DataManager(config=config)
+        # Process the document
+        processed_docs = dm.process_documents([sample_doc])
+        assert len(processed_docs) == 1, "Expected 1 processed document"
+        processed_content = processed_docs[0].page_content
+        # Verify relative links were converted to absolute GitHub URLs
+        assert "https://github.com/byoung/ai-me/blob/main/resume.md" in processed_content, (
+            f"Expected absolute URL for /resume.md in: {processed_content}"
+        )
+        assert "https://github.com/byoung/ai-me/blob/main/projects.md" in processed_content, (
+            f"Expected absolute URL for /projects.md in: {processed_content}"
+        )
+    def test_process_documents_preserves_non_github_docs(self):
+        """Tests FR-004: Non-GitHub documents are preserved unchanged.
+        Documents without github_repo metadata should pass through unchanged.
+        """
+        # Create a document without github_repo metadata
+        sample_doc = Document(
+            page_content="[my resume](/resume.md)",
+            metadata={
+                "source": "local://docs/about.md"
+            }
+        )
+        config = DataManagerConfig()
+        dm = DataManager(config=config)
+        # Process the document
+        processed_docs = dm.process_documents([sample_doc])
+        assert len(processed_docs) == 1, "Expected 1 processed document"
+        # Content should be unchanged (no github_repo in metadata)
+        assert processed_docs[0].page_content == "[my resume](/resume.md)", (
+            "Non-GitHub document should not be modified"
+        )
+    def test_process_documents_handles_markdown_with_anchors(self):
+        """Tests FR-004: Markdown links with anchor fragments are preserved.
+        Links like [text](/file.md#section) should preserve the anchor in the URL.
+        """
+        sample_doc = Document(
+            page_content="See [section](/docs/guide.md#installation) for details.",
+            metadata={
+                "source": "github://user/repo/README.md",
+                "github_repo": "user/repo"
+            }
+        )
+        config = DataManagerConfig()
+        dm = DataManager(config=config)
+        processed_docs = dm.process_documents([sample_doc])
+        processed_content = processed_docs[0].page_content
+        # Verify anchor is preserved in the URL
+        assert "https://github.com/user/repo/blob/main/docs/guide.md#installation" in processed_content, (
+            f"Expected anchor preserved in URL in: {processed_content}"
+        )

uv.lock CHANGED Viewed

@@ -5,7 +5,7 @@ requires-python = "==3.12.*"
 [[package]]
 name = "ai-me"
 version = "0.1.0"
-source = { virtual = "." }
 dependencies = [
     { name = "chromadb" },
     { name = "fastmcp" },
@@ -36,6 +36,7 @@ dev = [
     { name = "ipywidgets" },
     { name = "pytest" },
     { name = "pytest-asyncio" },
 ]
 [package.metadata]
@@ -69,6 +70,7 @@ dev = [
     { name = "ipywidgets", specifier = "~=8.1" },
     { name = "pytest", specifier = "~=8.0" },
     { name = "pytest-asyncio", specifier = "~=0.24" },
 ]
 [[package]]
@@ -418,6 +420,28 @@ wheels = [
     { url = "https://files.pythonhosted.org/packages/60/97/891a0971e1e4a8c5d2b20bbe0e524dc04548d2307fee33cdeba148fd4fc7/comm-0.2.3-py3-none-any.whl", hash = "sha256:c615d91d75f7f04f095b30d1c1711babd43bdc6419c1be9886a85f2f4e489417", size = 7294, upload-time = "2025-07-25T14:02:02.896Z" },
 ]
 [[package]]
 name = "cryptography"
 version = "46.0.2"
@@ -2331,6 +2355,20 @@ wheels = [
     { url = "https://files.pythonhosted.org/packages/20/7f/338843f449ace853647ace35870874f69a764d251872ed1b4de9f234822c/pytest_asyncio-0.26.0-py3-none-any.whl", hash = "sha256:7b51ed894f4fbea1340262bdae5135797ebbe21d8638978e35d31c6d19f72fb0", size = 19694, upload-time = "2025-03-25T06:22:27.807Z" },
 ]
 [[package]]
 name = "python-dateutil"
 version = "2.9.0.post0"

 [[package]]
 name = "ai-me"
 version = "0.1.0"
+source = { editable = "." }
 dependencies = [
     { name = "chromadb" },
     { name = "fastmcp" },
     { name = "ipywidgets" },
     { name = "pytest" },
     { name = "pytest-asyncio" },
+    { name = "pytest-cov" },
 ]
 [package.metadata]
     { name = "ipywidgets", specifier = "~=8.1" },
     { name = "pytest", specifier = "~=8.0" },
     { name = "pytest-asyncio", specifier = "~=0.24" },
+    { name = "pytest-cov", specifier = "~=6.0" },
 ]
 [[package]]
     { url = "https://files.pythonhosted.org/packages/60/97/891a0971e1e4a8c5d2b20bbe0e524dc04548d2307fee33cdeba148fd4fc7/comm-0.2.3-py3-none-any.whl", hash = "sha256:c615d91d75f7f04f095b30d1c1711babd43bdc6419c1be9886a85f2f4e489417", size = 7294, upload-time = "2025-07-25T14:02:02.896Z" },
 ]
+[[package]]
+name = "coverage"
+version = "7.11.0"
+source = { registry = "https://pypi.org/simple" }
+sdist = { url = "https://files.pythonhosted.org/packages/1c/38/ee22495420457259d2f3390309505ea98f98a5eed40901cf62196abad006/coverage-7.11.0.tar.gz", hash = "sha256:167bd504ac1ca2af7ff3b81d245dfea0292c5032ebef9d66cc08a7d28c1b8050", size = 811905, upload-time = "2025-10-15T15:15:08.542Z" }
+wheels = [
+    { url = "https://files.pythonhosted.org/packages/c4/db/86f6906a7c7edc1a52b2c6682d6dd9be775d73c0dfe2b84f8923dfea5784/coverage-7.11.0-cp312-cp312-macosx_10_13_x86_64.whl", hash = "sha256:9c49e77811cf9d024b95faf86c3f059b11c0c9be0b0d61bc598f453703bd6fd1", size = 216098, upload-time = "2025-10-15T15:13:02.916Z" },
+    { url = "https://files.pythonhosted.org/packages/21/54/e7b26157048c7ba555596aad8569ff903d6cd67867d41b75287323678ede/coverage-7.11.0-cp312-cp312-macosx_11_0_arm64.whl", hash = "sha256:a61e37a403a778e2cda2a6a39abcc895f1d984071942a41074b5c7ee31642007", size = 216331, upload-time = "2025-10-15T15:13:04.403Z" },
+    { url = "https://files.pythonhosted.org/packages/b9/19/1ce6bf444f858b83a733171306134a0544eaddf1ca8851ede6540a55b2ad/coverage-7.11.0-cp312-cp312-manylinux1_i686.manylinux_2_28_i686.manylinux_2_5_i686.whl", hash = "sha256:c79cae102bb3b1801e2ef1511fb50e91ec83a1ce466b2c7c25010d884336de46", size = 247825, upload-time = "2025-10-15T15:13:05.92Z" },
+    { url = "https://files.pythonhosted.org/packages/71/0b/d3bcbbc259fcced5fb67c5d78f6e7ee965f49760c14afd931e9e663a83b2/coverage-7.11.0-cp312-cp312-manylinux1_x86_64.manylinux_2_28_x86_64.manylinux_2_5_x86_64.whl", hash = "sha256:16ce17ceb5d211f320b62df002fa7016b7442ea0fd260c11cec8ce7730954893", size = 250573, upload-time = "2025-10-15T15:13:07.471Z" },
+    { url = "https://files.pythonhosted.org/packages/58/8d/b0ff3641a320abb047258d36ed1c21d16be33beed4152628331a1baf3365/coverage-7.11.0-cp312-cp312-manylinux2014_aarch64.manylinux_2_17_aarch64.manylinux_2_28_aarch64.whl", hash = "sha256:80027673e9d0bd6aef86134b0771845e2da85755cf686e7c7c59566cf5a89115", size = 251706, upload-time = "2025-10-15T15:13:09.4Z" },
+    { url = "https://files.pythonhosted.org/packages/59/c8/5a586fe8c7b0458053d9c687f5cff515a74b66c85931f7fe17a1c958b4ac/coverage-7.11.0-cp312-cp312-manylinux_2_31_riscv64.manylinux_2_39_riscv64.whl", hash = "sha256:4d3ffa07a08657306cd2215b0da53761c4d73cb54d9143b9303a6481ec0cd415", size = 248221, upload-time = "2025-10-15T15:13:10.964Z" },
+    { url = "https://files.pythonhosted.org/packages/d0/ff/3a25e3132804ba44cfa9a778cdf2b73dbbe63ef4b0945e39602fc896ba52/coverage-7.11.0-cp312-cp312-musllinux_1_2_aarch64.whl", hash = "sha256:a3b6a5f8b2524fd6c1066bc85bfd97e78709bb5e37b5b94911a6506b65f47186", size = 249624, upload-time = "2025-10-15T15:13:12.5Z" },
+    { url = "https://files.pythonhosted.org/packages/c5/12/ff10c8ce3895e1b17a73485ea79ebc1896a9e466a9d0f4aef63e0d17b718/coverage-7.11.0-cp312-cp312-musllinux_1_2_i686.whl", hash = "sha256:fcc0a4aa589de34bc56e1a80a740ee0f8c47611bdfb28cd1849de60660f3799d", size = 247744, upload-time = "2025-10-15T15:13:14.554Z" },
+    { url = "https://files.pythonhosted.org/packages/16/02/d500b91f5471b2975947e0629b8980e5e90786fe316b6d7299852c1d793d/coverage-7.11.0-cp312-cp312-musllinux_1_2_riscv64.whl", hash = "sha256:dba82204769d78c3fd31b35c3d5f46e06511936c5019c39f98320e05b08f794d", size = 247325, upload-time = "2025-10-15T15:13:16.438Z" },
+    { url = "https://files.pythonhosted.org/packages/77/11/dee0284fbbd9cd64cfce806b827452c6df3f100d9e66188e82dfe771d4af/coverage-7.11.0-cp312-cp312-musllinux_1_2_x86_64.whl", hash = "sha256:81b335f03ba67309a95210caf3eb43bd6fe75a4e22ba653ef97b4696c56c7ec2", size = 249180, upload-time = "2025-10-15T15:13:17.959Z" },
+    { url = "https://files.pythonhosted.org/packages/59/1b/cdf1def928f0a150a057cab03286774e73e29c2395f0d30ce3d9e9f8e697/coverage-7.11.0-cp312-cp312-win32.whl", hash = "sha256:037b2d064c2f8cc8716fe4d39cb705779af3fbf1ba318dc96a1af858888c7bb5", size = 218479, upload-time = "2025-10-15T15:13:19.608Z" },
+    { url = "https://files.pythonhosted.org/packages/ff/55/e5884d55e031da9c15b94b90a23beccc9d6beee65e9835cd6da0a79e4f3a/coverage-7.11.0-cp312-cp312-win_amd64.whl", hash = "sha256:d66c0104aec3b75e5fd897e7940188ea1892ca1d0235316bf89286d6a22568c0", size = 219290, upload-time = "2025-10-15T15:13:21.593Z" },
+    { url = "https://files.pythonhosted.org/packages/23/a8/faa930cfc71c1d16bc78f9a19bb73700464f9c331d9e547bfbc1dbd3a108/coverage-7.11.0-cp312-cp312-win_arm64.whl", hash = "sha256:d91ebeac603812a09cf6a886ba6e464f3bbb367411904ae3790dfe28311b15ad", size = 217924, upload-time = "2025-10-15T15:13:23.39Z" },
+    { url = "https://files.pythonhosted.org/packages/5f/04/642c1d8a448ae5ea1369eac8495740a79eb4e581a9fb0cbdce56bbf56da1/coverage-7.11.0-py3-none-any.whl", hash = "sha256:4b7589765348d78fb4e5fb6ea35d07564e387da2fc5efff62e0222971f155f68", size = 207761, upload-time = "2025-10-15T15:15:06.439Z" },
+]
 [[package]]
 name = "cryptography"
 version = "46.0.2"
     { url = "https://files.pythonhosted.org/packages/20/7f/338843f449ace853647ace35870874f69a764d251872ed1b4de9f234822c/pytest_asyncio-0.26.0-py3-none-any.whl", hash = "sha256:7b51ed894f4fbea1340262bdae5135797ebbe21d8638978e35d31c6d19f72fb0", size = 19694, upload-time = "2025-03-25T06:22:27.807Z" },
 ]
+[[package]]
+name = "pytest-cov"
+version = "6.3.0"
+source = { registry = "https://pypi.org/simple" }
+dependencies = [
+    { name = "coverage" },
+    { name = "pluggy" },
+    { name = "pytest" },
+]
+sdist = { url = "https://files.pythonhosted.org/packages/30/4c/f883ab8f0daad69f47efdf95f55a66b51a8b939c430dadce0611508d9e99/pytest_cov-6.3.0.tar.gz", hash = "sha256:35c580e7800f87ce892e687461166e1ac2bcb8fb9e13aea79032518d6e503ff2", size = 70398, upload-time = "2025-09-06T15:40:14.361Z" }
+wheels = [
+    { url = "https://files.pythonhosted.org/packages/80/b4/bb7263e12aade3842b938bc5c6958cae79c5ee18992f9b9349019579da0f/pytest_cov-6.3.0-py3-none-any.whl", hash = "sha256:440db28156d2468cafc0415b4f8e50856a0d11faefa38f30906048fe490f1749", size = 25115, upload-time = "2025-09-06T15:40:12.44Z" },
+]
 [[package]]
 name = "python-dateutil"
 version = "2.9.0.post0"