diff --git a/.gitignore b/.gitignore
index 385994fa..aa4442be 100644
--- a/.gitignore
+++ b/.gitignore
@@ -73,3 +73,7 @@ databricks-tools-core/tests/integration/pdf/generated_pdf/
 # Python cache
 __pycache__/
 windows_info.txt
+
+# Databricks local token files
+.DATABRICKS_TOKEN
+.databricks_cfg
diff --git a/databricks-codex/CODEX_INTEGRATION_PLAN.md b/databricks-codex/CODEX_INTEGRATION_PLAN.md
new file mode 100644
index 00000000..5cfbf823
--- /dev/null
+++ b/databricks-codex/CODEX_INTEGRATION_PLAN.md
@@ -0,0 +1,987 @@
+# Databricks Codex Integration Plugin
+
+## Complete Implementation Plan for ai-dev-kit
+
+---
+
+## Table of Contents
+
+1. [Executive Summary](#1-executive-summary)
+2. [Product Requirements Document (PRD)](#2-product-requirements-document-prd)
+3. [Technical Design](#3-technical-design)
+4. [Implementation Plan](#4-implementation-plan)
+5. [Non-Functional Requirements](#5-non-functional-requirements)
+6. [Gap Analysis](#6-gap-analysis)
+7. [Testing Strategy](#7-testing-strategy)
+8. [Appendix: Reference Code Patterns](#8-appendix-reference-code-patterns)
+
+---
+
+## 1. Executive Summary
+
+### Purpose
+
+Build a comprehensive `databricks-codex` module that integrates OpenAI Codex CLI with the Databricks ecosystem, providing:
+
+- **Python SDK** for programmatic Codex interaction
+- **MCP Client** for bidirectional communication with Codex-as-MCP-server
+- **Configuration Management** for unified Codex + Databricks setup
+- **Session Management** for conversation persistence and forking
+- **Complete Test Suite** with unit and integration tests
+
+### What is Codex CLI?
+
+OpenAI Codex CLI is a terminal-based coding agent that can read, modify, and execute code on your machine. Key characteristics:
+
+- Built in **Rust** (95.8%) for performance
+- Supports **MCP (Model Context Protocol)** for third-party tools
+- Multiple **sandbox modes** for security (read-only, workspace-write, full-access)
+- **Session persistence** with resume and fork capabilities
+- Available on macOS, Linux, and Windows (experimental)
+
+### Current State in ai-dev-kit
+
+The `install.sh` and `install.ps1` scripts already configure Codex with the Databricks MCP server:
+
+```toml
+# ~/.codex/config.toml
+[mcp_servers.databricks]
+command = "/path/to/.venv/bin/python"
+args = ["/path/to/databricks-mcp-server/run_server.py"]
+```
+
+This plan extends that integration with a full Python SDK.
+
+---
+
+## 2. Product Requirements Document (PRD)
+
+### 2.1 Problem Statement
+
+Developers using Codex CLI with Databricks need:
+
+1. **Programmatic access** to Codex capabilities from Python workflows
+2. **Unified authentication** bridging Databricks and Codex credentials
+3. **Robust error handling** for long-running operations
+4. **Session management** for complex multi-step tasks
+5. **Testing infrastructure** to validate integrations
+
+### 2.2 User Personas
+
+| Persona | Description | Primary Use Case |
+|---------|-------------|------------------|
+| **Data Engineer** | Builds ETL pipelines on Databricks | Use Codex to generate Spark SQL, validate with Databricks tools |
+| **ML Engineer** | Develops models with MLflow | Use Codex for code review, generate documentation |
+| **Platform Engineer** | Maintains Databricks infrastructure | Automate Codex workflows in CI/CD pipelines |
+| **Developer Advocate** | Creates demos and tutorials | Build interactive Databricks + AI experiences |
+
+### 2.3 Functional Requirements
+
+#### FR-1: Configuration Management
+
+| ID | Requirement | Priority |
+|----|-------------|----------|
+| FR-1.1 | Read/write `~/.codex/config.toml` programmatically | P0 |
+| FR-1.2 | Support project-local `.codex/config.toml` | P1 |
+| FR-1.3 | Configure Databricks MCP server with profile selection | P0 |
+| FR-1.4 | Validate TOML syntax before writing | P1 |
+| FR-1.5 | Atomic writes to prevent corruption | P0 |
+
+#### FR-2: Authentication
+
+| ID | Requirement | Priority |
+|----|-------------|----------|
+| FR-2.1 | Check Codex authentication status | P0 |
+| FR-2.2 | Support ChatGPT OAuth login | P1 |
+| FR-2.3 | Support device code flow for headless environments | P1 |
+| FR-2.4 | Support API key authentication | P1 |
+| FR-2.5 | Inject Databricks credentials into Codex environment | P0 |
+
+#### FR-3: Execution
+
+| ID | Requirement | Priority |
+|----|-------------|----------|
+| FR-3.1 | Execute `codex exec` synchronously with output capture | P0 |
+| FR-3.2 | Execute `codex exec` asynchronously with timeout handling | P0 |
+| FR-3.3 | Support all sandbox modes (read-only, workspace-write, full-access) | P0 |
+| FR-3.4 | Pass environment variables to Codex subprocess | P0 |
+| FR-3.5 | Track async operations with unique IDs | P1 |
+
+#### FR-4: MCP Client
+
+| ID | Requirement | Priority |
+|----|-------------|----------|
+| FR-4.1 | Connect to Codex running as MCP server (stdio transport) | P0 |
+| FR-4.2 | Connect via HTTP transport | P1 |
+| FR-4.3 | List available tools from Codex MCP server | P0 |
+| FR-4.4 | Call tools with argument passing | P0 |
+| FR-4.5 | Handle JSON-RPC errors gracefully | P0 |
+
+#### FR-5: Session Management
+
+| ID | Requirement | Priority |
+|----|-------------|----------|
+| FR-5.1 | List recent Codex sessions | P1 |
+| FR-5.2 | Resume a previous session by ID | P1 |
+| FR-5.3 | Fork a session into a new conversation | P2 |
+
+### 2.4 Success Metrics
+
+| Metric | Target | Measurement |
+|--------|--------|-------------|
+| Unit test coverage | > 80% | pytest-cov |
+| Integration test pass rate | 100% | CI/CD pipeline |
+| Codex exec latency overhead | < 500ms | Benchmark tests |
+| Documentation completeness | 100% of public APIs | Manual review |
+
+---
+
+## 3. Technical Design
+
+### 3.1 Architecture Overview
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│                           databricks-codex Module                            │
+├─────────────────────────────────────────────────────────────────────────────┤
+│                                                                             │
+│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐  ┌─────────────────────┐│
+│  │   config    │  │    auth     │  │  executor   │  │     mcp_client      ││
+│  │             │  │             │  │             │  │                     ││
+│  │ - read()    │  │ - check()   │  │ - exec_sync │  │ - connect()         ││
+│  │ - write()   │  │ - login()   │  │ - exec_async│  │ - list_tools()      ││
+│  │ - configure │  │ - logout()  │  │ - get_op()  │  │ - call_tool()       ││
+│  └──────┬──────┘  └──────┬──────┘  └──────┬──────┘  └──────────┬──────────┘│
+│         │                │                │                     │          │
+│         └────────────────┴────────────────┴─────────────────────┘          │
+│                                    │                                        │
+│                           ┌────────▼────────┐                              │
+│                           │     models      │                              │
+│                           │                 │                              │
+│                           │ - Enums         │                              │
+│                           │ - TypedDicts    │                              │
+│                           │ - Pydantic      │                              │
+│                           └─────────────────┘                              │
+│                                                                             │
+└─────────────────────────────────────────────────────────────────────────────┘
+                                     │
+                                     │ subprocess / MCP
+                                     ▼
+                           ┌─────────────────┐
+                           │   Codex CLI     │
+                           │                 │
+                           │ codex exec      │
+                           │ codex mcp-server│
+                           └─────────────────┘
+                                     │
+                                     │ MCP Protocol
+                                     ▼
+                    ┌─────────────────────────────────┐
+                    │     databricks-mcp-server       │
+                    │                                 │
+                    │ 50+ Databricks tools            │
+                    │ - SQL execution                 │
+                    │ - Unity Catalog                 │
+                    │ - Jobs/Pipelines               │
+                    │ - Genie Spaces                  │
+                    └─────────────────────────────────┘
+```
+
+### 3.2 Directory Structure
+
+```
+databricks-codex/
+├── databricks_codex/
+│   ├── __init__.py                 # Package exports
+│   ├── auth.py                     # Codex authentication utilities
+│   ├── config.py                   # TOML configuration management
+│   ├── executor.py                 # codex exec wrapper (sync/async)
+│   ├── mcp_client.py               # Client for Codex-as-MCP-server
+│   ├── session.py                  # Session management
+│   ├── middleware.py               # Timeout/retry handling
+│   └── models.py                   # Data models
+├── tests/
+│   ├── __init__.py
+│   ├── conftest.py                 # Shared fixtures
+│   ├── test_auth.py
+│   ├── test_config.py
+│   ├── test_executor.py
+│   ├── test_mcp_client.py
+│   ├── test_session.py
+│   └── integration/
+│       ├── __init__.py
+│       ├── conftest.py
+│       ├── test_codex_exec.py
+│       └── test_mcp_server.py
+├── docs/
+│   └── API.md                      # API documentation
+├── pyproject.toml
+├── README.md
+└── LICENSE
+```
+
+### 3.3 Component Specifications
+
+#### 3.3.1 Configuration Manager (`config.py`)
+
+```python
+class MCPServerConfig(BaseModel):
+    """MCP server entry in config.toml."""
+    command: str
+    args: List[str] = []
+    env: Dict[str, str] = {}
+
+class CodexConfig(BaseModel):
+    """Full Codex configuration."""
+    mcp_servers: Dict[str, MCPServerConfig] = {}
+
+class CodexConfigManager:
+    """Manage ~/.codex/config.toml and .codex/config.toml."""
+
+    def __init__(self, config_path: Optional[Path] = None, scope: str = "global"):
+        """Initialize with global or project scope."""
+
+    def read(self) -> CodexConfig:
+        """Read and parse TOML configuration."""
+
+    def write(self, config: CodexConfig) -> None:
+        """Write configuration atomically (temp file + rename)."""
+
+    def configure_databricks_mcp(
+        self,
+        profile: str = "DEFAULT",
+        python_path: Optional[str] = None,
+        mcp_entry: Optional[str] = None,
+    ) -> None:
+        """Add/update Databricks MCP server configuration."""
+```
+
+**Key Pattern:** Atomic writes using temp file + `os.replace()` (from `manifest.py`).
+
+#### 3.3.2 Authentication (`auth.py`)
+
+```python
+class CodexAuthMethod(Enum):
+    CHATGPT_OAUTH = "chatgpt"
+    DEVICE_CODE = "device"
+    API_KEY = "api_key"
+    NONE = "none"
+
+@dataclass
+class CodexAuthStatus:
+    method: CodexAuthMethod
+    is_authenticated: bool
+    username: Optional[str] = None
+    error: Optional[str] = None
+
+def check_codex_auth() -> CodexAuthStatus:
+    """Check if Codex CLI is authenticated."""
+
+def login_codex(
+    method: CodexAuthMethod = CodexAuthMethod.CHATGPT_OAUTH,
+    api_key: Optional[str] = None,
+) -> CodexAuthStatus:
+    """Authenticate with Codex CLI."""
+
+def logout_codex() -> bool:
+    """Log out from Codex CLI."""
+
+def get_combined_auth_context() -> Tuple[Optional[str], Optional[str]]:
+    """Get Databricks host/token for injection into Codex environment."""
+```
+
+**Key Pattern:** Uses `databricks_tools_core.auth.get_workspace_client()` for Databricks credentials.
+
+#### 3.3.3 Executor (`executor.py`)
+
+```python
+class SandboxMode(Enum):
+    READ_ONLY = "read-only"
+    WORKSPACE_WRITE = "workspace-write"
+    FULL_ACCESS = "danger-full-access"
+
+class ExecutionStatus(Enum):
+    PENDING = "pending"
+    RUNNING = "running"
+    COMPLETED = "completed"
+    FAILED = "failed"
+    TIMEOUT = "timeout"
+
+@dataclass
+class ExecutionResult:
+    status: ExecutionStatus
+    stdout: str = ""
+    stderr: str = ""
+    exit_code: Optional[int] = None
+    elapsed_seconds: float = 0.0
+    operation_id: Optional[str] = None
+
+class CodexExecOptions(BaseModel):
+    prompt: str
+    working_dir: Optional[Path] = None
+    sandbox_mode: SandboxMode = SandboxMode.READ_ONLY
+    model: Optional[str] = None
+    timeout: int = 300
+    env_vars: Dict[str, str] = {}
+    databricks_profile: str = "DEFAULT"
+    inject_databricks_env: bool = True
+
+class CodexExecutor:
+    """Execute Codex commands with Databricks context."""
+
+    def exec_sync(self, options: CodexExecOptions) -> ExecutionResult:
+        """Execute synchronously."""
+
+    async def exec_async(
+        self,
+        options: CodexExecOptions,
+        callback: Optional[Callable] = None,
+    ) -> ExecutionResult:
+        """Execute asynchronously with timeout handoff at 30s."""
+
+    def get_operation(self, operation_id: str) -> Optional[ExecutionResult]:
+        """Check status of async operation."""
+```
+
+**Key Pattern:** Async handoff at 30s threshold (from `databricks_tools.py`), prevents connection timeouts.
+
+#### 3.3.4 MCP Client (`mcp_client.py`)
+
+```python
+class MCPClientConfig(BaseModel):
+    # Stdio transport
+    command: Optional[str] = None
+    args: List[str] = []
+    env: Dict[str, str] = {}
+    # HTTP transport
+    url: Optional[str] = None
+    bearer_token_env_var: Optional[str] = None
+    timeout: int = 120
+
+@dataclass
+class MCPToolInfo:
+    name: str
+    description: str
+    input_schema: Dict[str, Any] = field(default_factory=dict)
+
+class CodexMCPClient:
+    """Client for Codex running as MCP server."""
+
+    async def connect(self) -> None:
+        """Establish connection (stdio or HTTP)."""
+
+    async def disconnect(self) -> None:
+        """Close connection."""
+
+    async def list_tools(self) -> List[MCPToolInfo]:
+        """List available tools."""
+
+    async def call_tool(
+        self,
+        name: str,
+        arguments: Dict[str, Any],
+        timeout: Optional[int] = None,
+    ) -> Dict[str, Any]:
+        """Call an MCP tool."""
+
+    # Context manager support
+    async def __aenter__(self): ...
+    async def __aexit__(self, *args): ...
+```
+
+**Usage Example:**
+
+```python
+async with CodexMCPClient() as client:
+    tools = await client.list_tools()
+    result = await client.call_tool("generate_code", {"prompt": "Create a Python function"})
+```
+
+#### 3.3.5 Session Manager (`session.py`)
+
+```python
+@dataclass
+class CodexSession:
+    session_id: str
+    created_at: datetime
+    last_activity: Optional[datetime] = None
+    project_dir: Optional[Path] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
+
+class SessionManager:
+    """Manage Codex CLI sessions."""
+
+    def list_sessions(self, limit: int = 10) -> List[CodexSession]:
+        """List recent sessions."""
+
+    def resume_session(self, session_id: str) -> Optional[str]:
+        """Resume a previous session."""
+
+    def fork_session(
+        self,
+        session_id: str,
+        new_prompt: Optional[str] = None,
+    ) -> Optional[str]:
+        """Fork an existing session."""
+```
+
+### 3.4 Authentication Flow
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│                          Authentication Flow                                 │
+├─────────────────────────────────────────────────────────────────────────────┤
+│                                                                             │
+│  ┌─────────────┐                                                           │
+│  │   User      │                                                           │
+│  └──────┬──────┘                                                           │
+│         │                                                                   │
+│         ▼                                                                   │
+│  ┌─────────────────────────────────────────────────────────────────────┐   │
+│  │                        Codex Authentication                          │   │
+│  │                                                                      │   │
+│  │  Option 1: ChatGPT OAuth (Browser)                                   │   │
+│  │  $ codex login                                                       │   │
+│  │  → Opens browser for ChatGPT sign-in                                 │   │
+│  │                                                                      │   │
+│  │  Option 2: Device Code (Headless)                                    │   │
+│  │  $ codex login --device-auth                                         │   │
+│  │  → Displays code to enter at https://...                             │   │
+│  │                                                                      │   │
+│  │  Option 3: API Key                                                   │   │
+│  │  $ printenv OPENAI_API_KEY | codex login --with-api-key              │   │
+│  │                                                                      │   │
+│  └─────────────────────────────────────────────────────────────────────┘   │
+│         │                                                                   │
+│         ▼                                                                   │
+│  ┌─────────────────┐                                                       │
+│  │ ~/.codex/       │  (Codex credentials stored here)                      │
+│  │ credentials     │                                                       │
+│  └─────────────────┘                                                       │
+│                                                                             │
+│  ┌─────────────────────────────────────────────────────────────────────┐   │
+│  │                      Databricks Integration                          │   │
+│  │                                                                      │   │
+│  │  Priority Order:                                                     │   │
+│  │  1. DATABRICKS_HOST + DATABRICKS_TOKEN (environment)                 │   │
+│  │  2. ~/.databrickscfg [PROFILE] (config file)                         │   │
+│  │  3. OAuth M2M (Databricks Apps)                                      │   │
+│  │                                                                      │   │
+│  │  ┌─────────────────────┐        ┌─────────────────────────────────┐ │   │
+│  │  │ get_workspace_      │  ───▶  │ CodexExecutor._build_env()      │ │   │
+│  │  │ client()            │        │                                 │ │   │
+│  │  │                     │        │ Injects:                        │ │   │
+│  │  │ Returns: host,token │        │ - DATABRICKS_HOST               │ │   │
+│  │  └─────────────────────┘        │ - DATABRICKS_TOKEN              │ │   │
+│  │                                 │ - DATABRICKS_CONFIG_PROFILE     │ │   │
+│  │                                 └─────────────────────────────────┘ │   │
+│  │                                                                      │   │
+│  └─────────────────────────────────────────────────────────────────────┘   │
+│                                                                             │
+└─────────────────────────────────────────────────────────────────────────────┘
+```
+
+### 3.5 Data Flow
+
+```
+┌─────────────────────────────────────────────────────────────────────────────┐
+│                            Data Flow: codex exec                             │
+├─────────────────────────────────────────────────────────────────────────────┤
+│                                                                             │
+│  Python Application                                                         │
+│  ┌─────────────────────────────────────────────────────────────────────┐   │
+│  │                                                                      │   │
+│  │  options = CodexExecOptions(                                         │   │
+│  │      prompt="Create a Delta table from JSON files",                  │   │
+│  │      sandbox_mode=SandboxMode.WORKSPACE_WRITE,                       │   │
+│  │      inject_databricks_env=True,                                     │   │
+│  │  )                                                                   │   │
+│  │                                                                      │   │
+│  │  result = executor.exec_sync(options)                                │   │
+│  │                                                                      │   │
+│  └─────────────────────────────────────────────────────────────────────┘   │
+│         │                                                                   │
+│         │ 1. Build command                                                  │
+│         │ 2. Build environment (inject Databricks creds)                    │
+│         │ 3. subprocess.run()                                               │
+│         ▼                                                                   │
+│  ┌─────────────────────────────────────────────────────────────────────┐   │
+│  │  codex exec                                                          │   │
+│  │      --sandbox workspace-write                                       │   │
+│  │      "Create a Delta table from JSON files"                          │   │
+│  │                                                                      │   │
+│  │  Environment:                                                        │   │
+│  │      DATABRICKS_HOST=https://xxx.cloud.databricks.com                │   │
+│  │      DATABRICKS_TOKEN=dapi...                                        │   │
+│  │      DATABRICKS_CONFIG_PROFILE=DEFAULT                               │   │
+│  │                                                                      │   │
+│  └─────────────────────────────────────────────────────────────────────┘   │
+│         │                                                                   │
+│         │ 4. Codex processes prompt                                         │
+│         │ 5. Uses MCP tools if configured                                   │
+│         ▼                                                                   │
+│  ┌─────────────────────────────────────────────────────────────────────┐   │
+│  │  databricks-mcp-server (via MCP protocol)                            │   │
+│  │                                                                      │   │
+│  │  Tools called:                                                       │   │
+│  │  - execute_sql("CREATE TABLE...")                                    │   │
+│  │  - list_volumes("/Volumes/...")                                      │   │
+│  │                                                                      │   │
+│  └─────────────────────────────────────────────────────────────────────┘   │
+│         │                                                                   │
+│         │ 6. Return stdout/stderr                                           │
+│         ▼                                                                   │
+│  ┌─────────────────────────────────────────────────────────────────────┐   │
+│  │  ExecutionResult(                                                    │   │
+│  │      status=ExecutionStatus.COMPLETED,                               │   │
+│  │      stdout="Created table `catalog.schema.my_table`...",            │   │
+│  │      stderr="",                                                      │   │
+│  │      exit_code=0,                                                    │   │
+│  │      elapsed_seconds=12.5,                                           │   │
+│  │  )                                                                   │   │
+│  │                                                                      │   │
+│  └─────────────────────────────────────────────────────────────────────┘   │
+│                                                                             │
+└─────────────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## 4. Implementation Plan
+
+### Phase 1: Foundation (Week 1)
+
+| Task | Description | Output |
+|------|-------------|--------|
+| 1.1 | Create package structure | `databricks-codex/` directory |
+| 1.2 | Implement `models.py` | Enums, TypedDicts, Pydantic models |
+| 1.3 | Implement `config.py` | TOML read/write with atomic writes |
+| 1.4 | Write unit tests for config | `test_config.py` |
+
+### Phase 2: Authentication (Week 2)
+
+| Task | Description | Output |
+|------|-------------|--------|
+| 2.1 | Implement `auth.py` | Auth status, login, logout |
+| 2.2 | Integrate with databricks-tools-core | `get_combined_auth_context()` |
+| 2.3 | Write unit tests for auth | `test_auth.py` |
+
+### Phase 3: Executor (Week 2-3)
+
+| Task | Description | Output |
+|------|-------------|--------|
+| 3.1 | Implement `executor.py` sync | `exec_sync()` |
+| 3.2 | Implement `executor.py` async | `exec_async()` with timeout handoff |
+| 3.3 | Add operation tracking | `get_operation()` |
+| 3.4 | Write unit tests | `test_executor.py` |
+
+### Phase 4: MCP Client (Week 3-4)
+
+| Task | Description | Output |
+|------|-------------|--------|
+| 4.1 | Implement stdio transport | `_send_stdio()` |
+| 4.2 | Implement HTTP transport | `_send_http()` |
+| 4.3 | Implement tool listing/calling | `list_tools()`, `call_tool()` |
+| 4.4 | Write unit tests | `test_mcp_client.py` |
+
+### Phase 5: Session Management (Week 4)
+
+| Task | Description | Output |
+|------|-------------|--------|
+| 5.1 | Implement `session.py` | List, resume, fork |
+| 5.2 | Write unit tests | `test_session.py` |
+
+### Phase 6: Integration Testing (Week 5)
+
+| Task | Description | Output |
+|------|-------------|--------|
+| 6.1 | Set up integration test fixtures | `integration/conftest.py` |
+| 6.2 | Write Codex exec integration tests | `test_codex_exec.py` |
+| 6.3 | Write MCP server integration tests | `test_mcp_server.py` |
+
+### Phase 7: Documentation & Polish (Week 6)
+
+| Task | Description | Output |
+|------|-------------|--------|
+| 7.1 | Write README.md | Installation, usage examples |
+| 7.2 | Write API documentation | `docs/API.md` |
+| 7.3 | Update `install.sh` | Add databricks-codex to installation |
+| 7.4 | Create pyproject.toml | Package metadata and dependencies |
+
+---
+
+## 5. Non-Functional Requirements
+
+### 5.1 Performance
+
+| Metric | Target | Rationale |
+|--------|--------|-----------|
+| Codex exec startup overhead | < 500ms | Minimal wrapper overhead |
+| MCP tool call latency | < 100ms | Excluding Codex processing time |
+| Config read/write | < 50ms | Atomic writes via temp file |
+| Memory footprint | < 100MB | Per executor instance |
+| Async handoff threshold | 30s | Prevent API connection timeouts (50s limit) |
+
+### 5.2 Security
+
+| Requirement | Implementation |
+|-------------|----------------|
+| **No credential storage in config** | Databricks tokens via environment or databrickscfg profile only |
+| **Default sandbox mode** | `read-only` by default, require explicit opt-in for write access |
+| **Token injection safety** | Inject to subprocess only, never log credentials |
+| **TOML validation** | Validate before writing to prevent injection attacks |
+| **Subprocess isolation** | Use `subprocess.run()` with explicit environment, no shell=True |
+
+### 5.3 Reliability
+
+| Requirement | Implementation |
+|-------------|----------------|
+| **Atomic config writes** | Use temp file + `os.replace()` pattern from `manifest.py` |
+| **Timeout handling** | Match `TimeoutHandlingMiddleware` pattern for graceful degradation |
+| **Retry logic** | Exponential backoff for transient failures (network, rate limits) |
+| **Cleanup on shutdown** | Close subprocess handles and MCP connections |
+| **Graceful degradation** | Return structured errors, don't crash on Codex CLI issues |
+
+### 5.4 Observability
+
+| Requirement | Implementation |
+|-------------|----------------|
+| **Structured logging** | Use `logging.getLogger(__name__)` throughout |
+| **Operation IDs** | UUID-based IDs for async operation tracking |
+| **Metrics** | Execution times, success/failure rates |
+| **Error context** | Include command, arguments, and environment in error messages |
+
+### 5.5 Compatibility
+
+| Requirement | Implementation |
+|-------------|----------------|
+| **Python versions** | 3.9+ (match databricks-tools-core) |
+| **Codex CLI versions** | Version detection, warn on incompatible |
+| **Platform support** | macOS, Linux; Windows experimental (WSL) |
+| **MCP protocol** | Version negotiation in handshake |
+
+---
+
+## 6. Gap Analysis
+
+### 6.1 Technical Gaps
+
+| Gap | Impact | Severity | Mitigation |
+|-----|--------|----------|------------|
+| **No Python SDK for Codex** | Must shell out via subprocess | High | Design clean subprocess wrapper with proper error handling |
+| **No MCP client library** | Must implement JSON-RPC from scratch | Medium | Use httpx for HTTP, asyncio.subprocess for stdio |
+| **Undocumented output format** | Parsing may break on updates | Medium | Version detection, graceful degradation, structured output flags |
+| **Session persistence unclear** | May lose session state | Low | Implement local session tracking as backup |
+| **Codex not bundled** | External dependency | Medium | Clear installation docs, version checking, helpful error messages |
+
+### 6.2 API Gaps
+
+| Gap | Description | Workaround |
+|-----|-------------|------------|
+| **No `codex status` command** | Can't programmatically check auth | Use `codex --version` + try simple exec |
+| **No streaming output** | Output only available after completion | Poll for long operations, use `--json` flag |
+| **No cancel operation** | Can't stop running exec | Kill subprocess, implement timeout |
+
+### 6.3 Documentation Gaps
+
+| Gap | Description | Action Required |
+|-----|-------------|-----------------|
+| **Session format** | Session storage format not documented | Reverse engineer from ~/.codex directory |
+| **MCP server output** | Response format for `codex mcp-server` | Test and document |
+| **Error codes** | Exit code meanings not documented | Catalog through testing |
+
+### 6.4 Dependencies and Risks
+
+| Dependency | Risk Level | Mitigation |
+|------------|------------|------------|
+| **Codex CLI binary** | Medium | Must be installed separately; provide clear docs |
+| **OpenAI API availability** | Low | Timeout handling, retry logic |
+| **Databricks SDK** | Low | Pin version, integration tests |
+| **MCP protocol stability** | Medium | Version negotiation, graceful degradation |
+
+### 6.5 Future Enhancements
+
+| Enhancement | Description | Priority |
+|-------------|-------------|----------|
+| **Databricks Apps integration** | Run Codex as a Databricks App with OAuth M2M | P2 |
+| **Vector Search context** | Use Databricks Vector Search for prompt context | P2 |
+| **Unity Catalog discovery** | Auto-discover schemas/tables for context | P2 |
+| **MLflow tracing** | Trace Codex operations through MLflow | P3 |
+| **Genie enhancement** | Use Codex to enhance Genie queries | P3 |
+
+---
+
+## 7. Testing Strategy
+
+### 7.1 Unit Testing
+
+**Framework:** pytest with pytest-asyncio
+
+**Mocking Strategy:**
+
+```python
+# Mock subprocess.run for CLI calls
+@pytest.fixture
+def mock_subprocess_run():
+    with patch("subprocess.run") as mock_run:
+        mock_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Success",
+            stderr="",
+        )
+        yield mock_run
+
+# Mock async calls
+@pytest.fixture
+def mock_async_exec():
+    return AsyncMock(return_value=ExecutionResult(
+        status=ExecutionStatus.COMPLETED,
+        stdout="Done",
+    ))
+```
+
+**Test Categories:**
+
+| Category | Tests | Example |
+|----------|-------|---------|
+| Config | Read/write TOML, atomic writes | `test_write_and_read_roundtrip` |
+| Auth | Status check, login flows | `test_check_auth_not_installed` |
+| Executor | Sync/async exec, timeout | `test_exec_sync_success` |
+| MCP | Connect, list, call tools | `test_list_tools` |
+| Session | List, resume, fork | `test_list_sessions` |
+
+### 7.2 Integration Testing
+
+**Prerequisites:**
+- Codex CLI installed and authenticated
+- Databricks connection configured (profile or environment)
+
+**Fixtures:**
+
+```python
+@pytest.fixture(scope="session")
+def codex_authenticated():
+    """Skip if Codex not authenticated."""
+    status = check_codex_auth()
+    if not status.is_authenticated:
+        pytest.skip(f"Codex not authenticated: {status.error}")
+    return status
+
+@pytest.fixture(scope="session")
+def databricks_connected():
+    """Skip if Databricks not configured."""
+    try:
+        client = get_workspace_client()
+        client.current_user.me()
+        return client
+    except Exception as e:
+        pytest.skip(f"Databricks not configured: {e}")
+```
+
+**Test Categories:**
+
+| Category | Tests | Example |
+|----------|-------|---------|
+| Codex Exec | Simple prompt, Databricks context | `test_exec_with_databricks_context` |
+| Async Exec | Short operations, long with handoff | `test_exec_async_long_running` |
+| MCP Server | Connect, list tools, call | `test_connect_to_codex_mcp` |
+
+### 7.3 Test Coverage Targets
+
+| Module | Target Coverage |
+|--------|-----------------|
+| `config.py` | 90% |
+| `auth.py` | 85% |
+| `executor.py` | 85% |
+| `mcp_client.py` | 80% |
+| `session.py` | 80% |
+| **Overall** | **> 80%** |
+
+---
+
+## 8. Appendix: Reference Code Patterns
+
+### 8.1 Atomic Write Pattern (from manifest.py)
+
+```python
+import tempfile
+import os
+
+def atomic_write(path: Path, data: dict) -> None:
+    """Write file atomically to prevent corruption."""
+    path.parent.mkdir(parents=True, exist_ok=True)
+
+    fd, tmp_path = tempfile.mkstemp(
+        dir=path.parent,
+        prefix=".tmp-",
+        suffix=path.suffix,
+    )
+    try:
+        with os.fdopen(fd, "w") as f:
+            json.dump(data, f, indent=2)
+        os.replace(tmp_path, path)
+    except Exception:
+        try:
+            os.unlink(tmp_path)
+        except OSError:
+            pass
+        raise
+```
+
+### 8.2 Timeout Middleware Pattern (from middleware.py)
+
+```python
+class TimeoutHandlingMiddleware:
+    """Convert TimeoutError to structured result instead of exception."""
+
+    async def on_call_tool(self, context, call_next):
+        try:
+            return await call_next(context)
+        except TimeoutError as e:
+            return ToolResult(
+                content=[TextContent(
+                    type="text",
+                    text=json.dumps({
+                        "timed_out": True,
+                        "tool": context.tool_name,
+                        "message": str(e),
+                        "action_required": "Operation may still be in progress. Do NOT retry."
+                    })
+                )]
+            )
+```
+
+### 8.3 Async Handoff Pattern (from databricks_tools.py)
+
+```python
+SAFE_EXECUTION_THRESHOLD = 30  # seconds
+
+async def execute_with_handoff(func, *args, **kwargs):
+    """Execute with async handoff for long operations."""
+    start_time = time.time()
+    ctx = copy_context()
+
+    def run_in_context():
+        return ctx.run(func, *args, **kwargs)
+
+    loop = asyncio.get_event_loop()
+    future = loop.run_in_executor(executor, run_in_context)
+
+    HEARTBEAT_INTERVAL = 10
+
+    while True:
+        try:
+            return await asyncio.wait_for(
+                asyncio.shield(future),
+                timeout=HEARTBEAT_INTERVAL,
+            )
+        except asyncio.TimeoutError:
+            elapsed = time.time() - start_time
+            if elapsed > SAFE_EXECUTION_THRESHOLD:
+                # Hand off to background
+                op_id = str(uuid.uuid4())[:8]
+                threading.Thread(
+                    target=lambda: track_completion(op_id, future),
+                    daemon=True,
+                ).start()
+                return {"status": "running", "operation_id": op_id}
+```
+
+### 8.4 Manager Pattern (from agent_bricks/manager.py)
+
+```python
+class AgentBricksManager:
+    """Unified manager for resource lifecycle."""
+
+    def __init__(
+        self,
+        client: Optional[WorkspaceClient] = None,
+        default_timeout_s: int = 600,
+        default_poll_s: float = 2.0,
+    ):
+        self.w = client or get_workspace_client()
+        self.default_timeout_s = default_timeout_s
+        self.default_poll_s = default_poll_s
+
+    def _get(self, path: str, params: Optional[Dict] = None) -> Dict:
+        """GET with auth headers."""
+        headers = self.w.config.authenticate()
+        response = requests.get(
+            f"{self.w.config.host}{path}",
+            headers=headers,
+            params=params or {},
+            timeout=20,
+        )
+        if response.status_code >= 400:
+            self._handle_response_error(response, "GET", path)
+        return response.json()
+
+    def wait_until_ready(self, resource_id: str, timeout_s: int) -> Dict:
+        """Poll until resource is ready."""
+        deadline = time.time() + timeout_s
+        last_status = None
+
+        while True:
+            resource = self._get(f"/api/resource/{resource_id}")
+            status = resource.get("status")
+
+            if status != last_status:
+                logger.info(f"Status: {last_status} -> {status}")
+                last_status = status
+
+            if status == "READY":
+                return resource
+
+            if time.time() >= deadline:
+                raise TimeoutError(f"Not ready within {timeout_s}s")
+
+            time.sleep(self.default_poll_s)
+```
+
+### 8.5 Unit Test with Mocks (from test_middleware.py)
+
+```python
+from unittest.mock import AsyncMock, MagicMock
+import pytest
+
+@pytest.fixture
+def middleware():
+    return TimeoutHandlingMiddleware()
+
+def _make_context(tool_name="test_tool", arguments=None):
+    ctx = MagicMock()
+    ctx.message.name = tool_name
+    ctx.message.arguments = arguments or {}
+    return ctx
+
+@pytest.mark.asyncio
+async def test_timeout_returns_structured_result(middleware):
+    """TimeoutError is caught and converted to structured JSON."""
+    call_next = AsyncMock(side_effect=TimeoutError("Timed out"))
+    ctx = _make_context(tool_name="slow_operation")
+
+    result = await middleware.on_call_tool(ctx, call_next)
+
+    assert result is not None
+    payload = json.loads(result.content[0].text)
+    assert payload["timed_out"] is True
+    assert payload["tool"] == "slow_operation"
+```
+
+---
+
+## Summary
+
+This document provides a complete blueprint for implementing the `databricks-codex` integration plugin:
+
+1. **PRD** defines what we're building and why
+2. **Technical Design** specifies how each component works
+3. **Implementation Plan** breaks work into weekly phases
+4. **NFRs** establish quality standards
+5. **Gap Analysis** identifies risks and mitigations
+6. **Testing Strategy** ensures quality
+
+**Next Steps:**
+1. Review and approve this plan
+2. Create the `databricks-codex/` directory structure
+3. Begin Phase 1: Foundation implementation
diff --git a/databricks-codex/README.md b/databricks-codex/README.md
new file mode 100644
index 00000000..243a758f
--- /dev/null
+++ b/databricks-codex/README.md
@@ -0,0 +1,265 @@
+# Databricks Codex Integration
+
+Python SDK for integrating OpenAI Codex CLI with Databricks.
+
+## Features
+
+- **Configuration Management** - Programmatic control of `~/.codex/config.toml`
+- **Authentication** - Bridge Databricks credentials into Codex environment
+- **Executor** - Sync/async execution of `codex exec` with timeout handling
+- **MCP Client** - Connect to Codex running as MCP server
+- **Session Management** - Resume and fork Codex sessions
+
+## Installation
+
+```bash
+# Basic installation
+pip install databricks-codex
+
+# With Databricks SDK support
+pip install databricks-codex[databricks]
+
+# With HTTP transport for MCP
+pip install databricks-codex[http]
+
+# All optional dependencies
+pip install databricks-codex[all]
+
+# Development dependencies
+pip install databricks-codex[dev]
+```
+
+## Prerequisites
+
+- Python 3.9+
+- Codex CLI installed: `npm i -g @openai/codex`
+- Codex authenticated: `codex login`
+- (Optional) Databricks CLI configured for credential injection
+
+## Quick Start
+
+### Install Skills + MCP (Codex Session)
+
+```bash
+# From ai-dev-kit repo root
+bash databricks-codex/scripts/install_codex_skills_and_mcp.sh
+# or override profile explicitly
+bash databricks-codex/scripts/install_codex_skills_and_mcp.sh --profile my-profile
+```
+
+This installs Databricks skills into `~/.codex/skills` and writes a project MCP entry to `.codex/config.toml` for `databricks-mcp-server`.
+By default it uses the first profile from `databricks auth profiles`.
+Restart Codex after running it.
+
+### Run It (End-to-End)
+
+```bash
+# from ai-dev-kit repo root
+source ./.databricks_cfg  # or export DATABRICKS_HOST / DATABRICKS_TOKEN
+
+# install skills + MCP wiring
+bash databricks-codex/scripts/install_codex_skills_and_mcp.sh
+
+# run the installer + dashboard-prompt integration tests
+cd databricks-codex
+uv run pytest tests/integration/test_install_and_dashboard_prompt.py -vv
+```
+
+What this validates:
+- installer copies skills into `~/.codex/skills`
+- project MCP config is written to `.codex/config.toml`
+- Codex can execute a prompt that generates dashboard SQL
+
+### Configuration
+
+```python
+from databricks_codex import CodexConfigManager
+
+# Configure Databricks MCP server
+manager = CodexConfigManager()
+manager.configure_databricks_mcp(profile="DEFAULT")
+
+# Check configuration
+if manager.has_databricks_mcp():
+    config = manager.get_databricks_mcp_config()
+    print(f"MCP server: {config.command}")
+```
+
+### Authentication
+
+```python
+from databricks_codex import check_codex_auth, login_codex, CodexAuthMethod
+
+# Check auth status
+status = check_codex_auth()
+if status.is_authenticated:
+    print(f"Logged in via {status.method.value}")
+else:
+    # Login with device code (for headless environments)
+    login_codex(method=CodexAuthMethod.DEVICE_CODE)
+```
+
+### Executor
+
+```python
+from databricks_codex import CodexExecutor, CodexExecOptions, SandboxMode
+
+# Create executor
+executor = CodexExecutor()
+
+# Synchronous execution
+options = CodexExecOptions(
+    prompt="Create a Python function to calculate factorial",
+    sandbox_mode=SandboxMode.READ_ONLY,
+    timeout=60,
+)
+result = executor.exec_sync(options)
+print(result.stdout)
+
+# Async execution with Databricks context
+import asyncio
+
+async def main():
+    options = CodexExecOptions(
+        prompt="Query the customers table",
+        inject_databricks_env=True,
+        databricks_profile="PROD",
+    )
+    result = await executor.exec_async(options)
+
+    if result.operation_id:
+        # Long-running operation handed off
+        print(f"Operation {result.operation_id} running in background")
+    else:
+        print(result.stdout)
+
+asyncio.run(main())
+```
+
+### MCP Client
+
+```python
+from databricks_codex import CodexMCPClient
+import asyncio
+
+async def main():
+    async with CodexMCPClient() as client:
+        # List available tools
+        tools = await client.list_tools()
+        for tool in tools:
+            print(f"{tool.name}: {tool.description}")
+
+        # Call a tool
+        result = await client.call_tool(
+            "generate_code",
+            {"prompt": "Create a hello world function"}
+        )
+        print(result)
+
+asyncio.run(main())
+```
+
+### Session Management
+
+```python
+from databricks_codex import SessionManager
+
+manager = SessionManager()
+
+# List recent sessions
+sessions = manager.list_sessions(limit=5)
+for session in sessions:
+    print(f"{session.session_id}: {session.created_at}")
+
+# Resume last session
+manager.resume_session(last=True)
+
+# Fork a session
+new_id = manager.fork_session(
+    "original-session-id",
+    new_prompt="Continue but focus on error handling"
+)
+```
+
+## API Reference
+
+### Models
+
+| Class | Description |
+|-------|-------------|
+| `SandboxMode` | Enum: READ_ONLY, WORKSPACE_WRITE, FULL_ACCESS |
+| `ExecutionStatus` | Enum: PENDING, RUNNING, COMPLETED, FAILED, TIMEOUT |
+| `CodexAuthMethod` | Enum: CHATGPT_OAUTH, DEVICE_CODE, API_KEY, NONE |
+| `CodexExecOptions` | Pydantic model for executor options |
+| `ExecutionResult` | Dataclass for execution results |
+| `MCPToolInfo` | Dataclass for MCP tool metadata |
+
+### Configuration
+
+| Class/Function | Description |
+|----------------|-------------|
+| `CodexConfigManager` | Manage `~/.codex/config.toml` |
+| `MCPServerConfig` | MCP server configuration model |
+| `CodexConfig` | Full configuration model |
+
+### Authentication
+
+| Function | Description |
+|----------|-------------|
+| `check_codex_auth()` | Check Codex authentication status |
+| `login_codex()` | Authenticate with Codex CLI |
+| `logout_codex()` | Log out from Codex CLI |
+| `get_combined_auth_context()` | Get Databricks credentials |
+| `get_databricks_env()` | Get env vars for Databricks |
+
+### Executor
+
+| Class/Method | Description |
+|--------------|-------------|
+| `CodexExecutor` | Execute Codex commands |
+| `exec_sync()` | Synchronous execution |
+| `exec_async()` | Async execution with timeout handling |
+| `get_operation()` | Check async operation status |
+
+### MCP Client
+
+| Class/Method | Description |
+|--------------|-------------|
+| `CodexMCPClient` | MCP protocol client |
+| `connect()` | Establish connection |
+| `list_tools()` | List available tools |
+| `call_tool()` | Execute a tool |
+
+### Session Management
+
+| Class/Method | Description |
+|--------------|-------------|
+| `SessionManager` | Manage Codex sessions |
+| `list_sessions()` | List recent sessions |
+| `resume_session()` | Resume a session |
+| `fork_session()` | Fork a session |
+
+## Testing
+
+```bash
+# Export Databricks env vars for integration tests
+export DATABRICKS_HOST="https://<your-workspace-host>"
+export DATABRICKS_TOKEN="<your-pat>"
+# or: source ./.databricks_cfg
+
+# Run unit tests
+pytest tests/ -v --ignore=tests/integration
+
+# Run integration tests (requires Codex + Databricks)
+pytest tests/integration/ -v -m integration
+
+# Run installer + dashboard prompt tests only
+pytest tests/integration/test_install_and_dashboard_prompt.py -vv
+
+# Run with coverage
+pytest tests/ --cov=databricks_codex --cov-report=html
+```
+
+## License
+
+Apache-2.0
diff --git a/databricks-codex/databricks_codex/__init__.py b/databricks-codex/databricks_codex/__init__.py
new file mode 100644
index 00000000..69398add
--- /dev/null
+++ b/databricks-codex/databricks_codex/__init__.py
@@ -0,0 +1,65 @@
+"""Databricks Codex Integration - Python SDK for OpenAI Codex CLI with Databricks.
+
+This package provides programmatic access to Codex CLI capabilities with
+Databricks authentication and tool integration.
+"""
+
+from databricks_codex.models import (
+    CodexToolCategory,
+    TransportType,
+    SandboxMode,
+    ExecutionStatus,
+    CodexAuthMethod,
+    ExecutionResult,
+    CodexExecOptions,
+    CodexResponse,
+    MCPToolInfo,
+)
+from databricks_codex.config import (
+    MCPServerConfig,
+    CodexConfig,
+    CodexConfigManager,
+)
+from databricks_codex.auth import (
+    CodexAuthStatus,
+    check_codex_auth,
+    login_codex,
+    logout_codex,
+    get_combined_auth_context,
+)
+from databricks_codex.executor import CodexExecutor
+from databricks_codex.mcp_client import CodexMCPClient, MCPClientConfig
+from databricks_codex.session import CodexSession, SessionManager
+
+__version__ = "0.1.0"
+
+__all__ = [
+    # Models
+    "CodexToolCategory",
+    "TransportType",
+    "SandboxMode",
+    "ExecutionStatus",
+    "CodexAuthMethod",
+    "ExecutionResult",
+    "CodexExecOptions",
+    "CodexResponse",
+    "MCPToolInfo",
+    # Config
+    "MCPServerConfig",
+    "CodexConfig",
+    "CodexConfigManager",
+    # Auth
+    "CodexAuthStatus",
+    "check_codex_auth",
+    "login_codex",
+    "logout_codex",
+    "get_combined_auth_context",
+    # Executor
+    "CodexExecutor",
+    # MCP Client
+    "CodexMCPClient",
+    "MCPClientConfig",
+    # Session
+    "CodexSession",
+    "SessionManager",
+]
diff --git a/databricks-codex/databricks_codex/auth.py b/databricks-codex/databricks_codex/auth.py
new file mode 100644
index 00000000..66134f6c
--- /dev/null
+++ b/databricks-codex/databricks_codex/auth.py
@@ -0,0 +1,267 @@
+"""Authentication utilities for Codex CLI integration.
+
+Provides utilities to check Codex authentication status, manage login/logout,
+and bridge Databricks credentials into Codex environment.
+"""
+
+import logging
+import os
+import subprocess
+from dataclasses import dataclass
+from typing import Optional, Tuple
+
+from databricks_codex.models import CodexAuthMethod
+
+logger = logging.getLogger(__name__)
+
+
+@dataclass
+class CodexAuthStatus:
+    """Current Codex authentication status."""
+
+    method: CodexAuthMethod
+    is_authenticated: bool
+    username: Optional[str] = None
+    error: Optional[str] = None
+
+
+def check_codex_auth() -> CodexAuthStatus:
+    """Check current Codex CLI authentication status.
+
+    Returns:
+        CodexAuthStatus with current authentication state.
+
+    Example:
+        >>> status = check_codex_auth()
+        >>> if status.is_authenticated:
+        ...     print(f"Logged in as {status.username}")
+    """
+    try:
+        # Check if codex CLI exists and is functional
+        result = subprocess.run(
+            ["codex", "--version"],
+            capture_output=True,
+            text=True,
+            timeout=10,
+        )
+        if result.returncode != 0:
+            return CodexAuthStatus(
+                method=CodexAuthMethod.NONE,
+                is_authenticated=False,
+                error=f"Codex CLI error: {result.stderr.strip()}",
+            )
+
+        # Try a simple operation to verify auth
+        # Note: This is a heuristic since Codex doesn't have a status command
+        # We assume if --version works, the CLI is installed
+        # Full auth check would require attempting an actual operation
+
+        return CodexAuthStatus(
+            method=CodexAuthMethod.CHATGPT_OAUTH,  # Assume OAuth if working
+            is_authenticated=True,
+        )
+
+    except FileNotFoundError:
+        return CodexAuthStatus(
+            method=CodexAuthMethod.NONE,
+            is_authenticated=False,
+            error="Codex CLI not found. Install from: npm i -g @openai/codex",
+        )
+    except subprocess.TimeoutExpired:
+        return CodexAuthStatus(
+            method=CodexAuthMethod.NONE,
+            is_authenticated=False,
+            error="Codex CLI timed out",
+        )
+    except Exception as e:
+        return CodexAuthStatus(
+            method=CodexAuthMethod.NONE,
+            is_authenticated=False,
+            error=str(e),
+        )
+
+
+def login_codex(
+    method: CodexAuthMethod = CodexAuthMethod.CHATGPT_OAUTH,
+    api_key: Optional[str] = None,
+) -> CodexAuthStatus:
+    """Authenticate with Codex CLI.
+
+    Args:
+        method: Authentication method to use
+        api_key: API key if using API_KEY method
+
+    Returns:
+        Updated authentication status
+
+    Example:
+        >>> status = login_codex(method=CodexAuthMethod.DEVICE_CODE)
+        >>> print(f"Auth status: {status.is_authenticated}")
+    """
+    cmd = ["codex", "login"]
+
+    if method == CodexAuthMethod.DEVICE_CODE:
+        cmd.append("--device-auth")
+
+    try:
+        if method == CodexAuthMethod.API_KEY:
+            if not api_key:
+                return CodexAuthStatus(
+                    method=CodexAuthMethod.NONE,
+                    is_authenticated=False,
+                    error="API key required for API_KEY auth method",
+                )
+            # Pipe API key to stdin
+            result = subprocess.run(
+                cmd + ["--with-api-key"],
+                input=api_key,
+                capture_output=True,
+                text=True,
+                timeout=60,
+            )
+        else:
+            # Interactive auth (opens browser for OAuth)
+            logger.info(f"Starting Codex login with method: {method.value}")
+            result = subprocess.run(
+                cmd,
+                capture_output=True,
+                text=True,
+                timeout=180,  # Longer timeout for OAuth flow
+            )
+
+        if result.returncode == 0:
+            logger.info("Codex login successful")
+            return check_codex_auth()
+        else:
+            error_msg = result.stderr.strip() or result.stdout.strip() or "Login failed"
+            logger.error(f"Codex login failed: {error_msg}")
+            return CodexAuthStatus(
+                method=CodexAuthMethod.NONE,
+                is_authenticated=False,
+                error=error_msg,
+            )
+
+    except subprocess.TimeoutExpired:
+        return CodexAuthStatus(
+            method=CodexAuthMethod.NONE,
+            is_authenticated=False,
+            error="Login timed out - user may need to complete browser auth",
+        )
+    except FileNotFoundError:
+        return CodexAuthStatus(
+            method=CodexAuthMethod.NONE,
+            is_authenticated=False,
+            error="Codex CLI not found",
+        )
+    except Exception as e:
+        logger.exception("Codex login error")
+        return CodexAuthStatus(
+            method=CodexAuthMethod.NONE,
+            is_authenticated=False,
+            error=str(e),
+        )
+
+
+def logout_codex() -> bool:
+    """Log out from Codex CLI.
+
+    Returns:
+        True if logout succeeded
+
+    Example:
+        >>> if logout_codex():
+        ...     print("Logged out successfully")
+    """
+    try:
+        result = subprocess.run(
+            ["codex", "logout"],
+            capture_output=True,
+            text=True,
+            timeout=30,
+        )
+        if result.returncode == 0:
+            logger.info("Codex logout successful")
+            return True
+        else:
+            logger.warning(f"Codex logout returned: {result.stderr}")
+            return False
+    except Exception as e:
+        logger.error(f"Codex logout error: {e}")
+        return False
+
+
+def get_combined_auth_context() -> Tuple[Optional[str], Optional[str]]:
+    """Get Databricks auth from environment or profile.
+
+    Used to pass Databricks credentials to Codex exec sessions.
+
+    Priority:
+    1. DATABRICKS_HOST + DATABRICKS_TOKEN environment variables
+    2. databricks-tools-core get_workspace_client()
+    3. (None, None) if not configured
+
+    Returns:
+        Tuple of (host, token) or (None, None) if not configured
+
+    Example:
+        >>> host, token = get_combined_auth_context()
+        >>> if host:
+        ...     print(f"Connected to {host}")
+    """
+    # First try environment variables
+    host = os.environ.get("DATABRICKS_HOST")
+    token = os.environ.get("DATABRICKS_TOKEN")
+
+    if host and token:
+        return (host, token)
+
+    # Try to get from databricks-tools-core
+    try:
+        from databricks_tools_core.auth import get_workspace_client
+
+        client = get_workspace_client()
+        return (client.config.host, client.config.token)
+    except ImportError:
+        logger.debug("databricks-tools-core not available")
+    except Exception as e:
+        logger.debug(f"Could not get Databricks credentials: {e}")
+
+    # Try databricks-sdk directly
+    try:
+        from databricks.sdk import WorkspaceClient
+
+        client = WorkspaceClient()
+        return (client.config.host, client.config.token)
+    except ImportError:
+        logger.debug("databricks-sdk not available")
+    except Exception as e:
+        logger.debug(f"Could not get Databricks credentials from SDK: {e}")
+
+    return (None, None)
+
+
+def get_databricks_env(profile: str = "DEFAULT") -> dict:
+    """Get environment variables for Databricks integration.
+
+    Args:
+        profile: Databricks config profile to use
+
+    Returns:
+        Dictionary of environment variables to inject
+
+    Example:
+        >>> env = get_databricks_env(profile="PROD")
+        >>> subprocess.run(["codex", "exec", "..."], env={**os.environ, **env})
+    """
+    env: dict = {}
+
+    host, token = get_combined_auth_context()
+    if host:
+        env["DATABRICKS_HOST"] = host
+    if token:
+        env["DATABRICKS_TOKEN"] = token
+
+    if profile != "DEFAULT":
+        env["DATABRICKS_CONFIG_PROFILE"] = profile
+
+    return env
diff --git a/databricks-codex/databricks_codex/config.py b/databricks-codex/databricks_codex/config.py
new file mode 100644
index 00000000..7f5219e0
--- /dev/null
+++ b/databricks-codex/databricks_codex/config.py
@@ -0,0 +1,241 @@
+"""Configuration management for Codex integration.
+
+Handles reading/writing ~/.codex/config.toml with Databricks MCP server configuration.
+Supports profile management and environment variable handling.
+"""
+
+import os
+import sys
+import tempfile
+from pathlib import Path
+from typing import Any, Dict, List, Optional
+
+from pydantic import BaseModel, Field
+
+# Python 3.11+ has tomllib built-in
+if sys.version_info >= (3, 11):
+    import tomllib
+else:
+    try:
+        import tomli as tomllib
+    except ImportError:
+        tomllib = None  # type: ignore
+
+try:
+    import tomli_w
+except ImportError:
+    tomli_w = None  # type: ignore
+
+
+class MCPServerConfig(BaseModel):
+    """MCP server entry in config.toml."""
+
+    command: str = Field(..., description="Command to run the MCP server")
+    args: List[str] = Field(default_factory=list, description="Command arguments")
+    env: Dict[str, str] = Field(default_factory=dict, description="Environment variables")
+
+
+class CodexConfig(BaseModel):
+    """Full Codex configuration model."""
+
+    mcp_servers: Dict[str, MCPServerConfig] = Field(
+        default_factory=dict, description="MCP server configurations"
+    )
+
+
+class CodexConfigManager:
+    """Manage Codex configuration files.
+
+    Supports both global (~/.codex/config.toml) and project-local
+    (.codex/config.toml) configurations.
+
+    Example:
+        >>> manager = CodexConfigManager()
+        >>> config = manager.read()
+        >>> manager.configure_databricks_mcp(profile="PROD")
+    """
+
+    DEFAULT_GLOBAL_PATH = Path.home() / ".codex" / "config.toml"
+    DEFAULT_LOCAL_PATH = Path(".codex") / "config.toml"
+
+    def __init__(self, config_path: Optional[Path] = None, scope: str = "global"):
+        """Initialize configuration manager.
+
+        Args:
+            config_path: Override path to config.toml
+            scope: "global" or "project" (default: global)
+        """
+        if config_path:
+            self.config_path = Path(config_path)
+        elif scope == "project":
+            self.config_path = self.DEFAULT_LOCAL_PATH
+        else:
+            self.config_path = self.DEFAULT_GLOBAL_PATH
+
+    def read(self) -> CodexConfig:
+        """Read and parse the configuration file.
+
+        Returns:
+            CodexConfig with parsed configuration
+        """
+        if tomllib is None:
+            raise ImportError("tomli package required for Python < 3.11: pip install tomli")
+
+        if not self.config_path.exists():
+            return CodexConfig()
+
+        with open(self.config_path, "rb") as f:
+            data = tomllib.load(f)
+
+        # Transform nested TOML tables into our model
+        # TOML format: [mcp_servers.databricks] becomes key "mcp_servers.databricks"
+        # or nested dict mcp_servers: {databricks: {...}}
+        mcp_servers: Dict[str, MCPServerConfig] = {}
+
+        # Handle both flat keys (mcp_servers.name) and nested dicts
+        for key, value in data.items():
+            if key.startswith("mcp_servers."):
+                server_name = key.split(".", 1)[1]
+                if isinstance(value, dict):
+                    mcp_servers[server_name] = MCPServerConfig(**value)
+            elif key == "mcp_servers" and isinstance(value, dict):
+                for server_name, server_config in value.items():
+                    if isinstance(server_config, dict):
+                        mcp_servers[server_name] = MCPServerConfig(**server_config)
+
+        return CodexConfig(mcp_servers=mcp_servers)
+
+    def write(self, config: CodexConfig) -> None:
+        """Write configuration to file atomically.
+
+        Uses temp file + rename pattern to prevent corruption.
+
+        Args:
+            config: Configuration to write
+        """
+        if tomli_w is None:
+            raise ImportError("tomli-w package required: pip install tomli-w")
+
+        self.config_path.parent.mkdir(parents=True, exist_ok=True)
+
+        # Convert to TOML-compatible dict with nested structure
+        data: Dict[str, Any] = {}
+
+        if config.mcp_servers:
+            data["mcp_servers"] = {}
+            for name, server in config.mcp_servers.items():
+                server_dict = server.model_dump(exclude_none=True)
+                # Remove empty collections
+                if not server_dict.get("args"):
+                    server_dict.pop("args", None)
+                if not server_dict.get("env"):
+                    server_dict.pop("env", None)
+                data["mcp_servers"][name] = server_dict
+
+        # Atomic write: temp file + rename
+        fd, tmp_path = tempfile.mkstemp(
+            dir=self.config_path.parent,
+            prefix=".config-tmp-",
+            suffix=".toml",
+        )
+        try:
+            with os.fdopen(fd, "wb") as f:
+                tomli_w.dump(data, f)
+            os.replace(tmp_path, self.config_path)
+        except Exception:
+            try:
+                os.unlink(tmp_path)
+            except OSError:
+                pass
+            raise
+
+    def configure_databricks_mcp(
+        self,
+        profile: str = "DEFAULT",
+        python_path: Optional[str] = None,
+        mcp_entry: Optional[str] = None,
+    ) -> None:
+        """Configure Databricks MCP server in Codex config.
+
+        Args:
+            profile: Databricks config profile to use
+            python_path: Path to Python executable (auto-detected if None)
+            mcp_entry: Path to MCP entry script (auto-detected if None)
+        """
+        config = self.read()
+
+        # Auto-detect paths if not provided
+        if python_path is None:
+            python_path = self._detect_venv_python()
+        if mcp_entry is None:
+            mcp_entry = self._detect_mcp_entry()
+
+        env: Dict[str, str] = {}
+        if profile != "DEFAULT":
+            env["DATABRICKS_CONFIG_PROFILE"] = profile
+
+        config.mcp_servers["databricks"] = MCPServerConfig(
+            command=python_path,
+            args=[mcp_entry] if mcp_entry else [],
+            env=env,
+        )
+
+        self.write(config)
+
+    def remove_databricks_mcp(self) -> bool:
+        """Remove Databricks MCP server from configuration.
+
+        Returns:
+            True if removed, False if not found
+        """
+        config = self.read()
+        if "databricks" in config.mcp_servers:
+            del config.mcp_servers["databricks"]
+            self.write(config)
+            return True
+        return False
+
+    def _detect_venv_python(self) -> str:
+        """Detect the ai-dev-kit venv Python path."""
+        default_paths = [
+            Path.home() / ".ai-dev-kit" / ".venv" / "bin" / "python",
+            Path.home() / ".ai-dev-kit" / "venv" / "bin" / "python",
+        ]
+        for path in default_paths:
+            if path.exists():
+                return str(path)
+        return sys.executable
+
+    def _detect_mcp_entry(self) -> str:
+        """Detect the MCP server entry point."""
+        default_paths = [
+            Path.home() / ".ai-dev-kit" / "repo" / "databricks-mcp-server" / "run_server.py",
+            Path.home()
+            / ".ai-dev-kit"
+            / "databricks-mcp-server"
+            / "databricks_mcp_server"
+            / "__main__.py",
+        ]
+        for path in default_paths:
+            if path.exists():
+                return str(path)
+        # Fallback to module invocation
+        return "-m databricks_mcp_server"
+
+    def has_databricks_mcp(self) -> bool:
+        """Check if Databricks MCP server is configured.
+
+        Returns:
+            True if configured
+        """
+        config = self.read()
+        return "databricks" in config.mcp_servers
+
+    def get_databricks_mcp_config(self) -> Optional[MCPServerConfig]:
+        """Get the Databricks MCP server configuration.
+
+        Returns:
+            MCPServerConfig if configured, None otherwise
+        """
+        config = self.read()
+        return config.mcp_servers.get("databricks")
diff --git a/databricks-codex/databricks_codex/executor.py b/databricks-codex/databricks_codex/executor.py
new file mode 100644
index 00000000..a14a2cd2
--- /dev/null
+++ b/databricks-codex/databricks_codex/executor.py
@@ -0,0 +1,313 @@
+"""Codex exec wrapper with Databricks context.
+
+Provides Python SDK for running codex exec commands with proper
+Databricks authentication and environment setup.
+"""
+
+import asyncio
+import logging
+import os
+import subprocess
+import threading
+import time
+import uuid
+from concurrent.futures import ThreadPoolExecutor
+from contextvars import copy_context
+from typing import Callable, Dict, List, Optional
+
+from databricks_codex.auth import get_databricks_env
+from databricks_codex.models import CodexExecOptions, ExecutionResult, ExecutionStatus, SandboxMode
+
+logger = logging.getLogger(__name__)
+
+# Threshold for switching to async mode (prevents connection timeouts)
+SAFE_EXECUTION_THRESHOLD = 30
+
+
+class CodexExecutor:
+    """Execute Codex commands with Databricks context.
+
+    Provides both synchronous and asynchronous execution patterns with
+    timeout handling for long-running operations.
+
+    Example:
+        >>> executor = CodexExecutor()
+        >>> options = CodexExecOptions(
+        ...     prompt="List all files in current directory",
+        ...     sandbox_mode=SandboxMode.READ_ONLY,
+        ... )
+        >>> result = executor.exec_sync(options)
+        >>> print(result.stdout)
+    """
+
+    def __init__(
+        self,
+        default_timeout: int = 300,
+        max_workers: int = 4,
+    ):
+        """Initialize executor.
+
+        Args:
+            default_timeout: Default timeout in seconds
+            max_workers: Maximum worker threads for async operations
+        """
+        self.default_timeout = default_timeout
+        self._executor = ThreadPoolExecutor(max_workers=max_workers)
+        self._operations: Dict[str, ExecutionResult] = {}
+        self._lock = threading.Lock()
+
+    def exec_sync(self, options: CodexExecOptions) -> ExecutionResult:
+        """Execute codex exec synchronously.
+
+        Args:
+            options: Execution options
+
+        Returns:
+            ExecutionResult with output and status
+
+        Example:
+            >>> result = executor.exec_sync(CodexExecOptions(prompt="echo hello"))
+            >>> assert result.status == ExecutionStatus.COMPLETED
+        """
+        start_time = time.time()
+
+        cmd = self._build_command(options)
+        env = self._build_env(options)
+
+        logger.info(f"Running codex exec: {' '.join(cmd[:5])}...")
+        logger.debug(f"Full command: {cmd}")
+
+        try:
+            result = subprocess.run(
+                cmd,
+                capture_output=True,
+                text=True,
+                timeout=options.timeout or self.default_timeout,
+                cwd=options.working_dir,
+                env=env,
+            )
+
+            elapsed = time.time() - start_time
+
+            status = ExecutionStatus.COMPLETED if result.returncode == 0 else ExecutionStatus.FAILED
+
+            logger.info(f"Codex exec completed in {elapsed:.1f}s with status {status.value}")
+
+            return ExecutionResult(
+                status=status,
+                stdout=result.stdout,
+                stderr=result.stderr,
+                exit_code=result.returncode,
+                elapsed_seconds=elapsed,
+            )
+
+        except subprocess.TimeoutExpired as e:
+            elapsed = time.time() - start_time
+            logger.warning(f"Codex exec timed out after {elapsed:.1f}s")
+            return ExecutionResult(
+                status=ExecutionStatus.TIMEOUT,
+                stdout=e.stdout or "" if hasattr(e, "stdout") else "",
+                stderr=e.stderr or "" if hasattr(e, "stderr") else "",
+                elapsed_seconds=elapsed,
+            )
+        except FileNotFoundError:
+            elapsed = time.time() - start_time
+            return ExecutionResult(
+                status=ExecutionStatus.FAILED,
+                stderr="Codex CLI not found. Install from: npm i -g @openai/codex",
+                elapsed_seconds=elapsed,
+            )
+        except Exception as e:
+            elapsed = time.time() - start_time
+            logger.exception("Codex exec error")
+            return ExecutionResult(
+                status=ExecutionStatus.FAILED,
+                stderr=str(e),
+                elapsed_seconds=elapsed,
+            )
+
+    async def exec_async(
+        self,
+        options: CodexExecOptions,
+        callback: Optional[Callable[[ExecutionResult], None]] = None,
+    ) -> ExecutionResult:
+        """Execute codex exec asynchronously with timeout handling.
+
+        Uses async handoff pattern for long-running operations to prevent
+        connection timeouts.
+
+        Args:
+            options: Execution options
+            callback: Optional callback when execution completes
+
+        Returns:
+            ExecutionResult (may contain operation_id for async tracking)
+
+        Example:
+            >>> async def main():
+            ...     result = await executor.exec_async(options)
+            ...     if result.status == ExecutionStatus.RUNNING:
+            ...         # Poll for completion
+            ...         final = executor.get_operation(result.operation_id)
+        """
+        start_time = time.time()
+
+        # Copy context for thread execution
+        ctx = copy_context()
+
+        def run_in_context():
+            return ctx.run(self.exec_sync, options)
+
+        # Run in executor with heartbeat pattern
+        loop = asyncio.get_event_loop()
+        future = loop.run_in_executor(self._executor, run_in_context)
+
+        HEARTBEAT_INTERVAL = 10
+
+        while True:
+            try:
+                result = await asyncio.wait_for(
+                    asyncio.shield(future),
+                    timeout=HEARTBEAT_INTERVAL,
+                )
+                if callback:
+                    callback(result)
+                return result
+
+            except asyncio.TimeoutError:
+                elapsed = time.time() - start_time
+                logger.debug(f"Codex exec still running... ({elapsed:.0f}s elapsed)")
+
+                # Switch to async mode if exceeding threshold
+                if elapsed > SAFE_EXECUTION_THRESHOLD:
+                    op_id = str(uuid.uuid4())[:8]
+
+                    # Track operation
+                    with self._lock:
+                        self._operations[op_id] = ExecutionResult(
+                            status=ExecutionStatus.RUNNING,
+                            operation_id=op_id,
+                            elapsed_seconds=elapsed,
+                        )
+
+                    # Background completion handler
+                    def complete_background():
+                        try:
+                            result = future.result()
+                            result.operation_id = op_id
+                            with self._lock:
+                                self._operations[op_id] = result
+                            if callback:
+                                callback(result)
+                            logger.info(f"Operation {op_id} completed: {result.status.value}")
+                        except Exception as e:
+                            with self._lock:
+                                self._operations[op_id] = ExecutionResult(
+                                    status=ExecutionStatus.FAILED,
+                                    stderr=str(e),
+                                    operation_id=op_id,
+                                )
+                            logger.error(f"Operation {op_id} failed: {e}")
+
+                    threading.Thread(
+                        target=complete_background,
+                        daemon=True,
+                    ).start()
+
+                    logger.info(f"Operation handed off to background: {op_id}")
+
+                    return ExecutionResult(
+                        status=ExecutionStatus.RUNNING,
+                        operation_id=op_id,
+                        elapsed_seconds=elapsed,
+                    )
+
+    def get_operation(self, operation_id: str) -> Optional[ExecutionResult]:
+        """Get status of an async operation.
+
+        Args:
+            operation_id: Operation ID from exec_async
+
+        Returns:
+            ExecutionResult if found, None otherwise
+
+        Example:
+            >>> result = executor.get_operation("abc123")
+            >>> if result and result.status == ExecutionStatus.COMPLETED:
+            ...     print(result.stdout)
+        """
+        with self._lock:
+            return self._operations.get(operation_id)
+
+    def list_operations(self) -> Dict[str, ExecutionResult]:
+        """List all tracked operations.
+
+        Returns:
+            Dictionary of operation_id -> ExecutionResult
+        """
+        with self._lock:
+            return dict(self._operations)
+
+    def clear_operation(self, operation_id: str) -> bool:
+        """Clear a completed operation from tracking.
+
+        Args:
+            operation_id: Operation ID to clear
+
+        Returns:
+            True if cleared, False if not found
+        """
+        with self._lock:
+            if operation_id in self._operations:
+                del self._operations[operation_id]
+                return True
+            return False
+
+    def _build_command(self, options: CodexExecOptions) -> List[str]:
+        """Build the codex exec command."""
+        cmd = ["codex", "exec"]
+
+        # Sandbox mode
+        if options.sandbox_mode != SandboxMode.READ_ONLY:
+            cmd.extend(["--sandbox", options.sandbox_mode.value])
+
+        # Model selection
+        if options.model:
+            cmd.extend(["--model", options.model])
+
+        # Working directory
+        if options.working_dir:
+            cmd.extend(["--cd", str(options.working_dir)])
+
+        # The prompt is the final positional argument
+        cmd.append(options.prompt)
+
+        return cmd
+
+    def _build_env(self, options: CodexExecOptions) -> Dict[str, str]:
+        """Build environment for execution."""
+        env = os.environ.copy()
+
+        # Inject Databricks credentials
+        if options.inject_databricks_env:
+            databricks_env = get_databricks_env(options.databricks_profile)
+            env.update(databricks_env)
+
+        # Add custom environment variables
+        env.update(options.env_vars)
+
+        return env
+
+    def shutdown(self, wait: bool = True) -> None:
+        """Shutdown the executor.
+
+        Args:
+            wait: Whether to wait for pending operations
+        """
+        self._executor.shutdown(wait=wait)
+
+    def __enter__(self):
+        return self
+
+    def __exit__(self, *args):
+        self.shutdown()
diff --git a/databricks-codex/databricks_codex/mcp_client.py b/databricks-codex/databricks_codex/mcp_client.py
new file mode 100644
index 00000000..336d8e2d
--- /dev/null
+++ b/databricks-codex/databricks_codex/mcp_client.py
@@ -0,0 +1,368 @@
+"""Client for Codex running as MCP server.
+
+Enables Databricks workflows to use Codex AI capabilities through
+the MCP protocol.
+"""
+
+import asyncio
+import json
+import logging
+from dataclasses import dataclass, field
+from typing import Any, Dict, List, Optional
+
+from pydantic import BaseModel, Field
+
+from databricks_codex.models import MCPToolInfo
+
+logger = logging.getLogger(__name__)
+
+
+class MCPClientConfig(BaseModel):
+    """Configuration for MCP client connection."""
+
+    # Stdio transport (default for Codex)
+    command: Optional[str] = Field(None, description="Command to run MCP server")
+    args: List[str] = Field(default_factory=list, description="Command arguments")
+    env: Dict[str, str] = Field(default_factory=dict, description="Environment variables")
+
+    # HTTP transport (alternative)
+    url: Optional[str] = Field(None, description="HTTP URL for MCP server")
+    bearer_token_env_var: Optional[str] = Field(
+        None, description="Environment variable containing bearer token"
+    )
+
+    # Common settings
+    timeout: int = Field(default=120, description="Request timeout in seconds")
+
+
+@dataclass
+class MCPError(Exception):
+    """MCP protocol error."""
+
+    code: int
+    message: str
+    data: Optional[Any] = None
+
+
+class CodexMCPClient:
+    """Client to interact with Codex running as MCP server.
+
+    Supports both stdio and HTTP transports as documented in
+    Codex CLI reference.
+
+    Example:
+        >>> async with CodexMCPClient() as client:
+        ...     tools = await client.list_tools()
+        ...     result = await client.call_tool("generate_code", {"prompt": "..."})
+    """
+
+    def __init__(self, config: Optional[MCPClientConfig] = None):
+        """Initialize the MCP client.
+
+        Args:
+            config: Connection configuration (defaults to Codex mcp-server via stdio)
+        """
+        self.config = config or MCPClientConfig(
+            command="codex",
+            args=["mcp-server"],
+        )
+        self._process: Optional[asyncio.subprocess.Process] = None
+        self._http_client: Optional[Any] = None  # httpx.AsyncClient if available
+        self._request_id = 0
+        self._connected = False
+        self._pending_requests: Dict[int, asyncio.Future] = {}
+        self._reader_task: Optional[asyncio.Task] = None
+
+    async def connect(self) -> None:
+        """Establish connection to Codex MCP server.
+
+        Raises:
+            ConnectionError: If connection fails
+        """
+        if self.config.url:
+            await self._connect_http()
+        else:
+            await self._connect_stdio()
+        self._connected = True
+        logger.info("Connected to Codex MCP server")
+
+    async def _connect_stdio(self) -> None:
+        """Connect via stdio transport."""
+        if not self.config.command:
+            raise ValueError("command required for stdio transport")
+
+        cmd = [self.config.command] + self.config.args
+
+        logger.info(f"Starting Codex MCP server: {' '.join(cmd)}")
+
+        import os
+
+        env = os.environ.copy()
+        env.update(self.config.env)
+
+        self._process = await asyncio.create_subprocess_exec(
+            *cmd,
+            stdin=asyncio.subprocess.PIPE,
+            stdout=asyncio.subprocess.PIPE,
+            stderr=asyncio.subprocess.PIPE,
+            env=env,
+        )
+
+        # Start background reader for responses
+        self._reader_task = asyncio.create_task(self._read_responses())
+
+        # Send initialize request
+        response = await self._send_request(
+            "initialize",
+            {
+                "protocolVersion": "2024-11-05",
+                "capabilities": {},
+                "clientInfo": {
+                    "name": "databricks-codex",
+                    "version": "0.1.0",
+                },
+            },
+        )
+
+        logger.debug(f"MCP initialize response: {response}")
+
+    async def _connect_http(self) -> None:
+        """Connect via HTTP transport."""
+        try:
+            import httpx
+        except ImportError:
+            raise ImportError("httpx required for HTTP transport: pip install httpx")
+
+        headers: Dict[str, str] = {"Content-Type": "application/json"}
+
+        if self.config.bearer_token_env_var:
+            import os
+
+            token = os.environ.get(self.config.bearer_token_env_var)
+            if token:
+                headers["Authorization"] = f"Bearer {token}"
+
+        self._http_client = httpx.AsyncClient(
+            base_url=self.config.url,
+            headers=headers,
+            timeout=self.config.timeout,
+        )
+
+    async def disconnect(self) -> None:
+        """Close the connection."""
+        self._connected = False
+
+        if self._reader_task:
+            self._reader_task.cancel()
+            try:
+                await self._reader_task
+            except asyncio.CancelledError:
+                pass
+            self._reader_task = None
+
+        if self._process:
+            self._process.terminate()
+            try:
+                await asyncio.wait_for(self._process.wait(), timeout=5)
+            except asyncio.TimeoutError:
+                self._process.kill()
+            self._process = None
+
+        if self._http_client:
+            await self._http_client.aclose()
+            self._http_client = None
+
+        logger.info("Disconnected from Codex MCP server")
+
+    async def list_tools(self) -> List[MCPToolInfo]:
+        """List available tools from the MCP server.
+
+        Returns:
+            List of available tools with their schemas
+
+        Example:
+            >>> tools = await client.list_tools()
+            >>> for tool in tools:
+            ...     print(f"{tool.name}: {tool.description}")
+        """
+        result = await self._send_request("tools/list", {})
+
+        tools = []
+        for tool_data in result.get("tools", []):
+            tools.append(
+                MCPToolInfo(
+                    name=tool_data["name"],
+                    description=tool_data.get("description", ""),
+                    input_schema=tool_data.get("inputSchema", {}),
+                )
+            )
+
+        return tools
+
+    async def call_tool(
+        self,
+        name: str,
+        arguments: Optional[Dict[str, Any]] = None,
+        timeout: Optional[int] = None,
+    ) -> Dict[str, Any]:
+        """Call an MCP tool.
+
+        Args:
+            name: Tool name
+            arguments: Tool arguments
+            timeout: Optional timeout override
+
+        Returns:
+            Tool result as dictionary
+
+        Example:
+            >>> result = await client.call_tool(
+            ...     "generate_code",
+            ...     {"prompt": "Create a hello world function"}
+            ... )
+        """
+        result = await self._send_request(
+            "tools/call",
+            {"name": name, "arguments": arguments or {}},
+            timeout=timeout,
+        )
+
+        # Parse content from result
+        content = result.get("content", [])
+        if content and isinstance(content, list) and len(content) > 0:
+            first_content = content[0]
+            if isinstance(first_content, dict) and first_content.get("type") == "text":
+                text = first_content.get("text", "")
+                try:
+                    return json.loads(text)
+                except json.JSONDecodeError:
+                    return {"text": text}
+
+        return result
+
+    async def _send_request(
+        self,
+        method: str,
+        params: Dict[str, Any],
+        timeout: Optional[int] = None,
+    ) -> Dict[str, Any]:
+        """Send a JSON-RPC request to the MCP server."""
+        self._request_id += 1
+        request = {
+            "jsonrpc": "2.0",
+            "id": self._request_id,
+            "method": method,
+            "params": params,
+        }
+
+        if self._http_client:
+            return await self._send_http(request, timeout)
+        else:
+            return await self._send_stdio(request, timeout)
+
+    async def _send_stdio(
+        self,
+        request: Dict[str, Any],
+        timeout: Optional[int] = None,
+    ) -> Dict[str, Any]:
+        """Send request via stdio."""
+        if not self._process or not self._process.stdin:
+            raise RuntimeError("Not connected")
+
+        request_id = request["id"]
+
+        # Create future for response
+        future: asyncio.Future = asyncio.get_event_loop().create_future()
+        self._pending_requests[request_id] = future
+
+        # Send request
+        request_bytes = (json.dumps(request) + "\n").encode()
+        self._process.stdin.write(request_bytes)
+        await self._process.stdin.drain()
+
+        # Wait for response
+        try:
+            response = await asyncio.wait_for(
+                future,
+                timeout=timeout or self.config.timeout,
+            )
+            return response
+        finally:
+            self._pending_requests.pop(request_id, None)
+
+    async def _read_responses(self) -> None:
+        """Background task to read responses from stdout."""
+        if not self._process or not self._process.stdout:
+            return
+
+        try:
+            while self._connected:
+                line = await self._process.stdout.readline()
+                if not line:
+                    break
+
+                try:
+                    response = json.loads(line.decode())
+                    request_id = response.get("id")
+
+                    if request_id and request_id in self._pending_requests:
+                        future = self._pending_requests[request_id]
+                        if "error" in response:
+                            error = response["error"]
+                            future.set_exception(
+                                MCPError(
+                                    code=error.get("code", -1),
+                                    message=error.get("message", "Unknown error"),
+                                    data=error.get("data"),
+                                )
+                            )
+                        else:
+                            future.set_result(response.get("result", {}))
+                except json.JSONDecodeError:
+                    logger.warning(f"Invalid JSON response: {line}")
+                except Exception as e:
+                    logger.exception(f"Error processing response: {e}")
+        except asyncio.CancelledError:
+            pass
+        except Exception as e:
+            logger.exception(f"Reader task error: {e}")
+
+    async def _send_http(
+        self,
+        request: Dict[str, Any],
+        timeout: Optional[int] = None,
+    ) -> Dict[str, Any]:
+        """Send request via HTTP."""
+        if not self._http_client:
+            raise RuntimeError("Not connected")
+
+        response = await self._http_client.post(
+            "/",
+            json=request,
+            timeout=timeout or self.config.timeout,
+        )
+        response.raise_for_status()
+
+        data = response.json()
+
+        if "error" in data:
+            error = data["error"]
+            raise MCPError(
+                code=error.get("code", -1),
+                message=error.get("message", "Unknown error"),
+                data=error.get("data"),
+            )
+
+        return data.get("result", {})
+
+    @property
+    def is_connected(self) -> bool:
+        """Check if client is connected."""
+        return self._connected
+
+    async def __aenter__(self):
+        await self.connect()
+        return self
+
+    async def __aexit__(self, *args):
+        await self.disconnect()
diff --git a/databricks-codex/databricks_codex/models.py b/databricks-codex/databricks_codex/models.py
new file mode 100644
index 00000000..38162035
--- /dev/null
+++ b/databricks-codex/databricks_codex/models.py
@@ -0,0 +1,187 @@
+"""Data models for Databricks Codex integration.
+
+Provides Enums, TypedDicts, and Pydantic models for type safety.
+"""
+
+from dataclasses import dataclass, field
+from datetime import datetime
+from enum import Enum
+from typing import Any, Dict, List, Optional, TypedDict
+
+from pydantic import BaseModel, Field
+
+
+# ============================================================================
+# Enums
+# ============================================================================
+
+
+class CodexToolCategory(Enum):
+    """Categories of Codex tools."""
+
+    GENERATION = "generation"
+    ANALYSIS = "analysis"
+    REFACTORING = "refactoring"
+    TESTING = "testing"
+    DOCUMENTATION = "documentation"
+
+
+class TransportType(Enum):
+    """MCP transport types."""
+
+    STDIO = "stdio"
+    HTTP = "http"
+
+
+class SandboxMode(Enum):
+    """Codex sandbox modes for security."""
+
+    READ_ONLY = "read-only"
+    WORKSPACE_WRITE = "workspace-write"
+    FULL_ACCESS = "danger-full-access"
+
+
+class ExecutionStatus(Enum):
+    """Status of a Codex execution."""
+
+    PENDING = "pending"
+    RUNNING = "running"
+    COMPLETED = "completed"
+    FAILED = "failed"
+    TIMEOUT = "timeout"
+
+
+class CodexAuthMethod(Enum):
+    """Codex authentication methods."""
+
+    CHATGPT_OAUTH = "chatgpt"
+    DEVICE_CODE = "device"
+    API_KEY = "api_key"
+    NONE = "none"
+
+
+# ============================================================================
+# TypedDicts (for loose typing)
+# ============================================================================
+
+
+class ToolResultDict(TypedDict, total=False):
+    """Tool call result."""
+
+    content: List[Dict[str, Any]]
+    is_error: bool
+    metadata: Dict[str, Any]
+
+
+class SessionInfoDict(TypedDict, total=False):
+    """Session information."""
+
+    session_id: str
+    created_at: str
+    last_activity: str
+    project_dir: str
+
+
+class MCPServerDict(TypedDict, total=False):
+    """MCP server configuration."""
+
+    command: str
+    args: List[str]
+    env: Dict[str, str]
+    url: str
+    bearer_token_env_var: str
+
+
+# ============================================================================
+# Dataclasses
+# ============================================================================
+
+
+@dataclass
+class ExecutionResult:
+    """Result from codex exec."""
+
+    status: ExecutionStatus
+    stdout: str = ""
+    stderr: str = ""
+    exit_code: Optional[int] = None
+    elapsed_seconds: float = 0.0
+    operation_id: Optional[str] = None
+
+
+@dataclass
+class MCPToolInfo:
+    """Information about an available MCP tool."""
+
+    name: str
+    description: str
+    input_schema: Dict[str, Any] = field(default_factory=dict)
+
+
+# ============================================================================
+# Pydantic Models (for strict typing and validation)
+# ============================================================================
+
+
+class CodexExecOptions(BaseModel):
+    """Options for codex exec command."""
+
+    prompt: str = Field(..., description="The prompt to send to Codex")
+    working_dir: Optional[str] = Field(None, description="Working directory for execution")
+    sandbox_mode: SandboxMode = Field(
+        default=SandboxMode.READ_ONLY, description="Sandbox security mode"
+    )
+    model: Optional[str] = Field(None, description="Model to use (e.g., gpt-4)")
+    timeout: int = Field(default=300, description="Timeout in seconds")
+    env_vars: Dict[str, str] = Field(default_factory=dict, description="Environment variables")
+    databricks_profile: str = Field(default="DEFAULT", description="Databricks config profile")
+    inject_databricks_env: bool = Field(
+        default=True, description="Inject Databricks credentials into environment"
+    )
+
+    model_config = {"use_enum_values": False}
+
+
+class CodexToolCall(BaseModel):
+    """A tool call made through Codex."""
+
+    id: str = Field(..., description="Unique tool call ID")
+    name: str = Field(..., description="Tool name")
+    arguments: Dict[str, Any] = Field(default_factory=dict, description="Tool arguments")
+    timestamp: datetime = Field(default_factory=datetime.now, description="Call timestamp")
+
+
+class CodexResponse(BaseModel):
+    """Response from a Codex operation."""
+
+    success: bool = Field(..., description="Whether the operation succeeded")
+    content: str = Field(default="", description="Response content")
+    tool_calls: List[CodexToolCall] = Field(
+        default_factory=list, description="Tool calls made during execution"
+    )
+    error: Optional[str] = Field(None, description="Error message if failed")
+    elapsed_ms: int = Field(default=0, description="Execution time in milliseconds")
+
+
+class DatabricksContext(BaseModel):
+    """Databricks context for Codex operations."""
+
+    host: Optional[str] = Field(None, description="Databricks workspace host")
+    profile: str = Field(default="DEFAULT", description="Config profile name")
+    catalog: Optional[str] = Field(None, description="Default catalog")
+    schema_name: Optional[str] = Field(None, description="Default schema")
+    warehouse_id: Optional[str] = Field(None, description="SQL warehouse ID")
+
+
+class CodexIntegrationConfig(BaseModel):
+    """Full integration configuration."""
+
+    codex_path: str = Field(default="codex", description="Path to Codex CLI binary")
+    databricks_context: DatabricksContext = Field(
+        default_factory=DatabricksContext, description="Databricks context"
+    )
+    mcp_servers: Dict[str, MCPServerDict] = Field(
+        default_factory=dict, description="MCP server configurations"
+    )
+    default_timeout: int = Field(default=300, description="Default timeout in seconds")
+    sandbox_mode: str = Field(default="read-only", description="Default sandbox mode")
diff --git a/databricks-codex/databricks_codex/session.py b/databricks-codex/databricks_codex/session.py
new file mode 100644
index 00000000..b7b5f340
--- /dev/null
+++ b/databricks-codex/databricks_codex/session.py
@@ -0,0 +1,286 @@
+"""Session management for Codex CLI.
+
+Provides utilities for session persistence, resume, and fork
+operations as supported by Codex CLI.
+"""
+
+import logging
+import re
+import subprocess
+from dataclasses import dataclass, field
+from datetime import datetime
+from pathlib import Path
+from typing import Any, Dict, List, Optional
+
+logger = logging.getLogger(__name__)
+
+
+@dataclass
+class CodexSession:
+    """Represents a Codex CLI session."""
+
+    session_id: str
+    created_at: datetime
+    last_activity: Optional[datetime] = None
+    project_dir: Optional[Path] = None
+    metadata: Dict[str, Any] = field(default_factory=dict)
+
+
+class SessionManager:
+    """Manage Codex CLI sessions.
+
+    Provides utilities for:
+    - Listing recent sessions
+    - Resuming previous sessions
+    - Forking sessions into new conversations
+
+    Example:
+        >>> manager = SessionManager()
+        >>> sessions = manager.list_sessions()
+        >>> if sessions:
+        ...     manager.resume_session(sessions[0].session_id)
+    """
+
+    def __init__(self, session_dir: Optional[Path] = None):
+        """Initialize session manager.
+
+        Args:
+            session_dir: Directory for session data (default: ~/.codex/sessions)
+        """
+        self.session_dir = session_dir or (Path.home() / ".codex" / "sessions")
+
+    def list_sessions(self, limit: int = 10) -> List[CodexSession]:
+        """List recent Codex sessions.
+
+        Args:
+            limit: Maximum sessions to return
+
+        Returns:
+            List of sessions, most recent first
+
+        Example:
+            >>> sessions = manager.list_sessions(limit=5)
+            >>> for s in sessions:
+            ...     print(f"{s.session_id}: {s.created_at}")
+        """
+        try:
+            # Codex doesn't have a direct list command, try resume --list or similar
+            result = subprocess.run(
+                ["codex", "resume", "--last"],
+                capture_output=True,
+                text=True,
+                timeout=30,
+            )
+
+            # Parse output to extract session info
+            # Note: This is heuristic as Codex output format may vary
+            sessions = self._parse_session_output(result.stdout)
+            return sessions[:limit]
+
+        except FileNotFoundError:
+            logger.warning("Codex CLI not found")
+            return []
+        except subprocess.TimeoutExpired:
+            logger.warning("Codex session list timed out")
+            return []
+        except Exception as e:
+            logger.warning(f"Failed to list sessions: {e}")
+            return []
+
+    def get_last_session(self) -> Optional[CodexSession]:
+        """Get the most recent session.
+
+        Returns:
+            Most recent CodexSession or None
+
+        Example:
+            >>> session = manager.get_last_session()
+            >>> if session:
+            ...     print(f"Last session: {session.session_id}")
+        """
+        sessions = self.list_sessions(limit=1)
+        return sessions[0] if sessions else None
+
+    def resume_session(
+        self,
+        session_id: Optional[str] = None,
+        last: bool = False,
+    ) -> Optional[str]:
+        """Resume a previous session.
+
+        Args:
+            session_id: Session ID to resume (optional if last=True)
+            last: Resume the most recent session
+
+        Returns:
+            Session ID if successful, None otherwise
+
+        Example:
+            >>> new_id = manager.resume_session(last=True)
+            >>> # Or with specific ID
+            >>> new_id = manager.resume_session("abc123")
+        """
+        try:
+            cmd = ["codex", "resume"]
+
+            if last:
+                cmd.append("--last")
+            elif session_id:
+                cmd.append(session_id)
+            else:
+                logger.error("Must provide session_id or set last=True")
+                return None
+
+            result = subprocess.run(
+                cmd,
+                capture_output=True,
+                text=True,
+                timeout=30,
+            )
+
+            if result.returncode == 0:
+                # Extract session ID from output
+                resumed_id = self._extract_session_id(result.stdout) or session_id
+                logger.info(f"Resumed session: {resumed_id}")
+                return resumed_id
+
+            logger.warning(f"Failed to resume session: {result.stderr}")
+            return None
+
+        except FileNotFoundError:
+            logger.error("Codex CLI not found")
+            return None
+        except subprocess.TimeoutExpired:
+            logger.warning("Resume session timed out")
+            return None
+        except Exception as e:
+            logger.error(f"Failed to resume session: {e}")
+            return None
+
+    def fork_session(
+        self,
+        session_id: str,
+        new_prompt: Optional[str] = None,
+    ) -> Optional[str]:
+        """Fork an existing session into a new conversation.
+
+        Args:
+            session_id: Session to fork from
+            new_prompt: Optional initial prompt for forked session
+
+        Returns:
+            New session ID if successful
+
+        Example:
+            >>> new_id = manager.fork_session(
+            ...     "abc123",
+            ...     new_prompt="Continue but focus on error handling"
+            ... )
+        """
+        try:
+            cmd = ["codex", "fork", session_id]
+
+            if new_prompt:
+                cmd.extend(["--prompt", new_prompt])
+
+            result = subprocess.run(
+                cmd,
+                capture_output=True,
+                text=True,
+                timeout=30,
+            )
+
+            if result.returncode == 0:
+                new_id = self._extract_session_id(result.stdout)
+                logger.info(f"Forked session {session_id} -> {new_id}")
+                return new_id
+
+            logger.warning(f"Failed to fork session: {result.stderr}")
+            return None
+
+        except FileNotFoundError:
+            logger.error("Codex CLI not found")
+            return None
+        except subprocess.TimeoutExpired:
+            logger.warning("Fork session timed out")
+            return None
+        except Exception as e:
+            logger.error(f"Failed to fork session {session_id}: {e}")
+            return None
+
+    def session_exists(self, session_id: str) -> bool:
+        """Check if a session exists.
+
+        Args:
+            session_id: Session ID to check
+
+        Returns:
+            True if session exists
+        """
+        # Try to get info about the session
+        # This is a heuristic approach
+        sessions = self.list_sessions(limit=100)
+        return any(s.session_id == session_id for s in sessions)
+
+    def _parse_session_output(self, output: str) -> List[CodexSession]:
+        """Parse session information from Codex output.
+
+        This is heuristic as Codex output format may vary.
+        """
+        sessions = []
+
+        # Look for session ID patterns (typically UUID-like)
+        # Format may be: "Session: abc123..." or "abc123-def456..."
+        id_pattern = re.compile(r"([a-f0-9]{8}(?:-[a-f0-9]{4}){3}-[a-f0-9]{12}|[a-f0-9]{8,})")
+
+        for line in output.strip().split("\n"):
+            line = line.strip()
+            if not line:
+                continue
+
+            match = id_pattern.search(line)
+            if match:
+                session_id = match.group(1)
+                sessions.append(
+                    CodexSession(
+                        session_id=session_id,
+                        created_at=datetime.now(),  # Placeholder
+                    )
+                )
+
+        return sessions
+
+    def _extract_session_id(self, output: str) -> Optional[str]:
+        """Extract session ID from Codex output."""
+        # Look for UUID-like patterns
+        id_pattern = re.compile(r"([a-f0-9]{8}(?:-[a-f0-9]{4}){3}-[a-f0-9]{12}|[a-f0-9]{8,})")
+
+        match = id_pattern.search(output)
+        if match:
+            return match.group(1)
+
+        # Fallback: try to get last word/line
+        lines = output.strip().split("\n")
+        if lines:
+            last_line = lines[-1].strip()
+            words = last_line.split()
+            if words:
+                return words[-1]
+
+        return None
+
+    def clear_sessions(self, keep_last: int = 5) -> int:
+        """Clear old sessions, keeping the most recent.
+
+        Args:
+            keep_last: Number of recent sessions to keep
+
+        Returns:
+            Number of sessions cleared
+
+        Note: This may not be supported by Codex CLI directly.
+        """
+        # Codex may not have a clear command
+        # This would need to clean up ~/.codex/sessions directory
+        logger.warning("Session clearing not implemented - manual cleanup may be required")
+        return 0
diff --git a/databricks-codex/pyproject.toml b/databricks-codex/pyproject.toml
new file mode 100644
index 00000000..cb5023f3
--- /dev/null
+++ b/databricks-codex/pyproject.toml
@@ -0,0 +1,84 @@
+[build-system]
+requires = ["hatchling"]
+build-backend = "hatchling.build"
+
+[project]
+name = "databricks-codex"
+version = "0.1.0"
+description = "Databricks integration for OpenAI Codex CLI"
+readme = "README.md"
+license = "Apache-2.0"
+requires-python = ">=3.9"
+authors = [
+    { name = "Databricks", email = "solutions@databricks.com" }
+]
+classifiers = [
+    "Development Status :: 3 - Alpha",
+    "Intended Audience :: Developers",
+    "License :: OSI Approved :: Apache Software License",
+    "Programming Language :: Python :: 3",
+    "Programming Language :: Python :: 3.9",
+    "Programming Language :: Python :: 3.10",
+    "Programming Language :: Python :: 3.11",
+    "Programming Language :: Python :: 3.12",
+]
+keywords = ["databricks", "codex", "openai", "mcp", "ai"]
+
+dependencies = [
+    "pydantic>=2.0.0",
+    "tomli>=2.0.0;python_version<'3.11'",
+    "tomli-w>=1.0.0",
+]
+
+[project.optional-dependencies]
+databricks = [
+    "databricks-sdk>=0.12.0",
+]
+http = [
+    "httpx>=0.25.0",
+]
+all = [
+    "databricks-sdk>=0.12.0",
+    "httpx>=0.25.0",
+]
+dev = [
+    "pytest>=8.0",
+    "pytest-asyncio>=0.23",
+    "pytest-timeout>=2.0.0",
+    "pytest-cov>=4.0.0",
+    "ruff>=0.1.0",
+    "mypy>=1.0.0",
+]
+
+[project.urls]
+Homepage = "https://github.com/databricks-solutions/ai-dev-kit"
+Documentation = "https://github.com/databricks-solutions/ai-dev-kit/tree/main/databricks-codex"
+Repository = "https://github.com/databricks-solutions/ai-dev-kit"
+
+[tool.hatch.build.targets.wheel]
+packages = ["databricks_codex"]
+
+[tool.pytest.ini_options]
+testpaths = ["tests"]
+pythonpath = ["."]
+asyncio_mode = "auto"
+markers = [
+    "integration: marks tests as integration tests (require Codex + Databricks)",
+]
+filterwarnings = [
+    "ignore::DeprecationWarning",
+]
+
+[tool.ruff]
+line-length = 100
+target-version = "py39"
+
+[tool.ruff.lint]
+select = ["E", "F", "I", "W"]
+ignore = ["E501"]
+
+[tool.mypy]
+python_version = "3.9"
+warn_return_any = true
+warn_unused_ignores = true
+disallow_untyped_defs = true
diff --git a/databricks-codex/scripts/install_codex_skills_and_mcp.sh b/databricks-codex/scripts/install_codex_skills_and_mcp.sh
new file mode 100755
index 00000000..415cbb25
--- /dev/null
+++ b/databricks-codex/scripts/install_codex_skills_and_mcp.sh
@@ -0,0 +1,191 @@
+#!/usr/bin/env bash
+set -euo pipefail
+
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+REPO_ROOT="$(cd "${SCRIPT_DIR}/../.." && pwd)"
+SKILLS_SRC_DIR="${REPO_ROOT}/databricks-skills"
+CODEX_HOME="${CODEX_HOME:-${HOME}/.codex}"
+CODEX_SKILLS_DIR="${CODEX_HOME}/skills"
+PROJECT_CODEX_CONFIG="${REPO_ROOT}/.codex/config.toml"
+PROFILE=""
+INSTALL_ALL=true
+SELECTED_SKILLS=()
+
+usage() {
+  cat <<EOF
+Install Databricks skills for Codex and configure Databricks MCP for this project.
+
+Usage:
+  $(basename "$0") [options] [skill1 skill2 ...]
+
+Options:
+  --profile <name>   Databricks profile for MCP server env (default: first databricks-cli profile)
+  --list             List installable skills and exit
+  -h, --help         Show help
+
+Examples:
+  $(basename "$0")
+  $(basename "$0") --profile my-profile
+  $(basename "$0") databricks-dbsql databricks-genie
+EOF
+}
+
+list_skills() {
+  find "${SKILLS_SRC_DIR}" -mindepth 1 -maxdepth 1 -type d \
+    ! -name "TEMPLATE" \
+    -exec test -f "{}/SKILL.md" ';' -print \
+    | xargs -n1 basename \
+    | sort
+}
+
+detect_first_databricks_profile() {
+  if ! command -v databricks >/dev/null 2>&1; then
+    return 1
+  fi
+
+  local first_profile
+  first_profile="$(
+    databricks auth profiles --output json 2>/dev/null | python3 -c '
+import json, sys
+try:
+    data = json.load(sys.stdin)
+    profiles = data.get("profiles") or []
+    if profiles and isinstance(profiles[0], dict):
+        name = profiles[0].get("name")
+        if name:
+            print(name)
+except Exception:
+    pass
+'
+  )"
+
+  if [[ -n "${first_profile}" ]]; then
+    printf "%s" "${first_profile}"
+    return 0
+  fi
+  return 1
+}
+
+toml_escape() {
+  local s="$1"
+  s="${s//\\/\\\\}"
+  s="${s//\"/\\\"}"
+  printf "%s" "${s}"
+}
+
+while [[ $# -gt 0 ]]; do
+  case "$1" in
+    --profile)
+      PROFILE="$2"
+      shift 2
+      ;;
+    --list)
+      list_skills
+      exit 0
+      ;;
+    -h|--help)
+      usage
+      exit 0
+      ;;
+    *)
+      INSTALL_ALL=false
+      SELECTED_SKILLS+=("$1")
+      shift
+      ;;
+  esac
+done
+
+if [[ ! -d "${SKILLS_SRC_DIR}" ]]; then
+  echo "Skills source not found: ${SKILLS_SRC_DIR}" >&2
+  exit 1
+fi
+
+mkdir -p "${CODEX_SKILLS_DIR}"
+
+if [[ "${INSTALL_ALL}" == "true" ]]; then
+  while IFS= read -r skill; do
+    SELECTED_SKILLS+=("$skill")
+  done < <(list_skills)
+fi
+
+if [[ ${#SELECTED_SKILLS[@]} -eq 0 ]]; then
+  echo "No skills selected." >&2
+  exit 1
+fi
+
+if [[ -z "${PROFILE}" ]]; then
+  if PROFILE="$(detect_first_databricks_profile)"; then
+    echo "Using first Databricks CLI profile: ${PROFILE}"
+  fi
+fi
+
+echo "Installing ${#SELECTED_SKILLS[@]} skill(s) into ${CODEX_SKILLS_DIR}"
+for skill in "${SELECTED_SKILLS[@]}"; do
+  src="${SKILLS_SRC_DIR}/${skill}"
+  dst="${CODEX_SKILLS_DIR}/${skill}"
+  if [[ ! -f "${src}/SKILL.md" ]]; then
+    echo "Skipping '${skill}' (SKILL.md not found in ${src})" >&2
+    continue
+  fi
+  rm -rf "${dst}"
+  cp -R "${src}" "${dst}"
+  echo "  - installed ${skill}"
+done
+
+mkdir -p "$(dirname "${PROJECT_CODEX_CONFIG}")"
+touch "${PROJECT_CODEX_CONFIG}"
+
+tmp_cfg="$(mktemp)"
+awk '
+BEGIN { skip = 0 }
+{
+  if ($0 ~ /^\[mcp_servers\.databricks\]$/ || $0 ~ /^\[mcp_servers\.databricks\.env\]$/) {
+    skip = 1
+    next
+  }
+  if (skip && $0 ~ /^\[/) {
+    skip = 0
+  }
+  if (!skip) {
+    print
+  }
+}
+' "${PROJECT_CODEX_CONFIG}" > "${tmp_cfg}"
+
+MCP_COMMAND="uv"
+MCP_ARGS_1="run"
+MCP_ARGS_2="--directory"
+MCP_ARGS_3="${REPO_ROOT}/databricks-mcp-server"
+MCP_ARGS_4="python"
+MCP_ARGS_5="run_server.py"
+
+if ! command -v uv >/dev/null 2>&1; then
+  MCP_COMMAND="${PYTHON:-python3}"
+  MCP_ARGS_1="${REPO_ROOT}/databricks-mcp-server/run_server.py"
+  MCP_ARGS_2=""
+  MCP_ARGS_3=""
+  MCP_ARGS_4=""
+  MCP_ARGS_5=""
+fi
+
+{
+  echo ""
+  echo "[mcp_servers.databricks]"
+  echo "command = \"$(toml_escape "${MCP_COMMAND}")\""
+  if [[ "${MCP_COMMAND}" == "uv" ]]; then
+    echo "args = [\"$(toml_escape "${MCP_ARGS_1}")\", \"$(toml_escape "${MCP_ARGS_2}")\", \"$(toml_escape "${MCP_ARGS_3}")\", \"$(toml_escape "${MCP_ARGS_4}")\", \"$(toml_escape "${MCP_ARGS_5}")\"]"
+  else
+    echo "args = [\"$(toml_escape "${MCP_ARGS_1}")\"]"
+  fi
+  echo ""
+  echo "[mcp_servers.databricks.env]"
+  if [[ -n "${PROFILE}" ]]; then
+    echo "DATABRICKS_CONFIG_PROFILE = \"$(toml_escape "${PROFILE}")\""
+  fi
+} >> "${tmp_cfg}"
+
+mv "${tmp_cfg}" "${PROJECT_CODEX_CONFIG}"
+
+echo ""
+echo "Updated MCP config: ${PROJECT_CODEX_CONFIG}"
+echo "Restart Codex to pick up newly installed skills and MCP config."
diff --git a/databricks-codex/tests/__init__.py b/databricks-codex/tests/__init__.py
new file mode 100644
index 00000000..2bae3393
--- /dev/null
+++ b/databricks-codex/tests/__init__.py
@@ -0,0 +1 @@
+"""Tests for databricks-codex package."""
diff --git a/databricks-codex/tests/conftest.py b/databricks-codex/tests/conftest.py
new file mode 100644
index 00000000..664f3039
--- /dev/null
+++ b/databricks-codex/tests/conftest.py
@@ -0,0 +1,147 @@
+"""Shared test fixtures for databricks-codex tests."""
+
+import os
+import tempfile
+from pathlib import Path
+from unittest.mock import MagicMock, AsyncMock, patch
+
+import pytest
+
+from databricks_codex.config import CodexConfigManager, CodexConfig, MCPServerConfig
+from databricks_codex.executor import CodexExecutor
+from databricks_codex.auth import CodexAuthStatus
+from databricks_codex.models import CodexAuthMethod, ExecutionStatus, ExecutionResult
+
+
+@pytest.fixture
+def temp_config_dir():
+    """Create a temporary directory for config files."""
+    with tempfile.TemporaryDirectory() as tmpdir:
+        yield Path(tmpdir)
+
+
+@pytest.fixture
+def mock_config_manager(temp_config_dir):
+    """Config manager with temporary directory."""
+    config_path = temp_config_dir / "config.toml"
+    return CodexConfigManager(config_path=config_path)
+
+
+@pytest.fixture
+def sample_config():
+    """Sample configuration for testing."""
+    return CodexConfig(
+        mcp_servers={
+            "databricks": MCPServerConfig(
+                command="/usr/bin/python",
+                args=["-m", "databricks_mcp_server"],
+                env={"DATABRICKS_CONFIG_PROFILE": "DEFAULT"},
+            )
+        }
+    )
+
+
+@pytest.fixture
+def empty_config():
+    """Empty configuration for testing."""
+    return CodexConfig()
+
+
+@pytest.fixture
+def mock_executor():
+    """Mock executor for unit tests."""
+    executor = MagicMock(spec=CodexExecutor)
+    executor.exec_sync = MagicMock(
+        return_value=ExecutionResult(
+            status=ExecutionStatus.COMPLETED,
+            stdout="Success",
+            exit_code=0,
+        )
+    )
+    executor.exec_async = AsyncMock(
+        return_value=ExecutionResult(
+            status=ExecutionStatus.COMPLETED,
+            stdout="Success",
+            exit_code=0,
+        )
+    )
+    return executor
+
+
+@pytest.fixture
+def mock_subprocess_run():
+    """Mock subprocess.run for testing CLI calls."""
+    with patch("subprocess.run") as mock_run:
+        mock_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Success",
+            stderr="",
+        )
+        yield mock_run
+
+
+@pytest.fixture
+def mock_subprocess_run_not_found():
+    """Mock subprocess.run that raises FileNotFoundError."""
+    with patch("subprocess.run") as mock_run:
+        mock_run.side_effect = FileNotFoundError("codex not found")
+        yield mock_run
+
+
+@pytest.fixture
+def mock_subprocess_run_timeout():
+    """Mock subprocess.run that times out."""
+    import subprocess
+    with patch("subprocess.run") as mock_run:
+        mock_run.side_effect = subprocess.TimeoutExpired(cmd="codex", timeout=30)
+        yield mock_run
+
+
+@pytest.fixture
+def authenticated_status():
+    """Mock authenticated status."""
+    return CodexAuthStatus(
+        method=CodexAuthMethod.CHATGPT_OAUTH,
+        is_authenticated=True,
+        username="test@example.com",
+    )
+
+
+@pytest.fixture
+def unauthenticated_status():
+    """Mock unauthenticated status."""
+    return CodexAuthStatus(
+        method=CodexAuthMethod.NONE,
+        is_authenticated=False,
+        error="Not logged in",
+    )
+
+
+@pytest.fixture
+def mock_databricks_env():
+    """Mock Databricks environment variables."""
+    env_vars = {
+        "DATABRICKS_HOST": "https://test.cloud.databricks.com",
+        "DATABRICKS_TOKEN": "dapi_test_token",
+    }
+    with patch.dict(os.environ, env_vars):
+        yield env_vars
+
+
+@pytest.fixture
+def clean_env():
+    """Clean environment without Databricks variables."""
+    env_to_remove = ["DATABRICKS_HOST", "DATABRICKS_TOKEN", "DATABRICKS_CONFIG_PROFILE"]
+    original_env = {k: os.environ.get(k) for k in env_to_remove}
+
+    for key in env_to_remove:
+        os.environ.pop(key, None)
+
+    yield
+
+    # Restore original environment
+    for key, value in original_env.items():
+        if value is not None:
+            os.environ[key] = value
+        else:
+            os.environ.pop(key, None)
diff --git a/databricks-codex/tests/integration/__init__.py b/databricks-codex/tests/integration/__init__.py
new file mode 100644
index 00000000..0dcba393
--- /dev/null
+++ b/databricks-codex/tests/integration/__init__.py
@@ -0,0 +1,6 @@
+"""Integration tests for databricks-codex.
+
+These tests require:
+- Codex CLI installed and authenticated
+- Databricks connection configured
+"""
diff --git a/databricks-codex/tests/integration/conftest.py b/databricks-codex/tests/integration/conftest.py
new file mode 100644
index 00000000..6738f33a
--- /dev/null
+++ b/databricks-codex/tests/integration/conftest.py
@@ -0,0 +1,112 @@
+"""Integration test fixtures.
+
+These tests require:
+- Codex CLI installed and authenticated
+- Databricks connection configured
+"""
+
+import logging
+import os
+
+import pytest
+
+logger = logging.getLogger(__name__)
+
+
+def pytest_configure(config):
+    """Register integration markers."""
+    config.addinivalue_line(
+        "markers", "integration: marks tests as integration tests (require Codex + Databricks)"
+    )
+    config.addinivalue_line(
+        "markers", "databricks_codex_itest: databricks-codex-itest label for asset-creation tests"
+    )
+
+
+@pytest.fixture(scope="session")
+def codex_installed():
+    """Verify Codex CLI is installed."""
+    import subprocess
+
+    try:
+        result = subprocess.run(
+            ["codex", "--version"],
+            capture_output=True,
+            text=True,
+            timeout=10,
+        )
+        if result.returncode != 0:
+            pytest.skip("Codex CLI not working properly")
+        logger.info(f"Codex CLI version: {result.stdout.strip()}")
+        return True
+    except FileNotFoundError:
+        pytest.skip("Codex CLI not installed")
+    except subprocess.TimeoutExpired:
+        pytest.skip("Codex CLI timed out")
+
+
+@pytest.fixture(scope="session")
+def codex_authenticated(codex_installed):
+    """Verify Codex is authenticated."""
+    from databricks_codex.auth import check_codex_auth
+
+    status = check_codex_auth()
+    if not status.is_authenticated:
+        pytest.skip(f"Codex not authenticated: {status.error}")
+    return status
+
+
+@pytest.fixture(scope="session")
+def databricks_connected():
+    """Verify Databricks connection from shell environment variables only."""
+    host = os.environ.get("DATABRICKS_HOST")
+    token = os.environ.get("DATABRICKS_TOKEN")
+
+    if not host or not token:
+        pytest.skip("Set DATABRICKS_HOST and DATABRICKS_TOKEN in shell environment")
+
+    try:
+        from databricks.sdk import WorkspaceClient
+        from databricks.sdk import config as sdk_config
+
+        # Keep fixture-level connectivity checks fast and explicit.
+        cfg = sdk_config.Config(
+            host=host,
+            token=token,
+            http_timeout_seconds=10,
+            retry_timeout_seconds=20,
+        )
+        client = WorkspaceClient(config=cfg)
+        user = client.current_user.me()
+        logger.info(f"Databricks connected via environment as: {user.user_name}")
+        return {"host": client.config.host, "token": token, "client": client}
+    except ImportError:
+        pytest.skip("databricks-sdk not installed")
+    except Exception as e:
+        pytest.skip(f"Databricks env configured but not reachable: {e}")
+
+
+@pytest.fixture(scope="session")
+def executor(codex_authenticated, databricks_connected):
+    """Create executor with both Codex and Databricks configured."""
+    from databricks_codex.executor import CodexExecutor
+
+    return CodexExecutor()
+
+
+@pytest.fixture(scope="function")
+def cleanup_operations(executor):
+    """Cleanup operations created during tests."""
+    created_ops = []
+
+    def register(op_id: str):
+        created_ops.append(op_id)
+
+    yield register
+
+    for op_id in created_ops:
+        try:
+            executor.clear_operation(op_id)
+            logger.info(f"Cleaned up operation: {op_id}")
+        except Exception as e:
+            logger.warning(f"Failed to cleanup operation {op_id}: {e}")
diff --git a/databricks-codex/tests/integration/test_codex_exec.py b/databricks-codex/tests/integration/test_codex_exec.py
new file mode 100644
index 00000000..69ba079f
--- /dev/null
+++ b/databricks-codex/tests/integration/test_codex_exec.py
@@ -0,0 +1,141 @@
+"""Integration tests for Codex executor.
+
+These tests require Codex CLI to be installed and authenticated.
+"""
+
+import pytest
+
+from databricks_codex.executor import CodexExecutor
+from databricks_codex.models import CodexExecOptions, ExecutionStatus, SandboxMode
+
+
+@pytest.mark.integration
+class TestCodexExecutorIntegration:
+    """Integration tests requiring live Codex."""
+
+    def test_exec_simple_prompt(self, executor):
+        """Execute a simple prompt through Codex."""
+        options = CodexExecOptions(
+            prompt="Echo 'hello world' to the console",
+            sandbox_mode=SandboxMode.READ_ONLY,
+            timeout=60,
+        )
+
+        result = executor.exec_sync(options)
+
+        # Should complete (may succeed or fail depending on Codex state)
+        assert result.status in [
+            ExecutionStatus.COMPLETED,
+            ExecutionStatus.FAILED,
+            ExecutionStatus.TIMEOUT,
+        ]
+        assert result.elapsed_seconds >= 0
+
+    def test_exec_with_databricks_context(self, executor, databricks_connected):
+        """Execute with Databricks environment injected."""
+        options = CodexExecOptions(
+            prompt="Print the value of DATABRICKS_HOST environment variable",
+            sandbox_mode=SandboxMode.READ_ONLY,
+            inject_databricks_env=True,
+            timeout=60,
+        )
+
+        result = executor.exec_sync(options)
+
+        # Verify execution attempted
+        assert result.status in [
+            ExecutionStatus.COMPLETED,
+            ExecutionStatus.FAILED,
+            ExecutionStatus.TIMEOUT,
+        ]
+
+    def test_exec_sandbox_modes(self, executor):
+        """Test different sandbox modes."""
+        for mode in [SandboxMode.READ_ONLY, SandboxMode.WORKSPACE_WRITE]:
+            options = CodexExecOptions(
+                prompt="List files in current directory",
+                sandbox_mode=mode,
+                timeout=30,
+            )
+
+            result = executor.exec_sync(options)
+
+            # Should at least attempt execution
+            assert result.status in [
+                ExecutionStatus.COMPLETED,
+                ExecutionStatus.FAILED,
+                ExecutionStatus.TIMEOUT,
+            ]
+
+    @pytest.mark.asyncio
+    async def test_exec_async_short(self, executor):
+        """Async execution for short operations."""
+        options = CodexExecOptions(
+            prompt="Echo hello",
+            timeout=30,
+        )
+
+        result = await executor.exec_async(options)
+
+        assert result.status in [
+            ExecutionStatus.COMPLETED,
+            ExecutionStatus.FAILED,
+            ExecutionStatus.TIMEOUT,
+            ExecutionStatus.RUNNING,  # If handed off
+        ]
+
+
+@pytest.mark.integration
+class TestCodexExecutorTimeout:
+    """Tests for timeout behavior."""
+
+    def test_short_timeout(self, executor):
+        """Test that short timeout is respected."""
+        options = CodexExecOptions(
+            prompt="This is a test prompt that should timeout",
+            timeout=1,  # Very short timeout
+        )
+
+        result = executor.exec_sync(options)
+
+        # Should timeout or complete quickly
+        assert result.status in [ExecutionStatus.TIMEOUT, ExecutionStatus.COMPLETED]
+
+
+@pytest.mark.integration
+class TestCodexExecutorEnvironment:
+    """Tests for environment handling."""
+
+    def test_custom_env_vars(self, executor):
+        """Test custom environment variables are passed."""
+        options = CodexExecOptions(
+            prompt="Print MY_CUSTOM_VAR",
+            env_vars={"MY_CUSTOM_VAR": "test_value_12345"},
+            timeout=30,
+        )
+
+        result = executor.exec_sync(options)
+
+        # Execution should complete
+        assert result.status in [
+            ExecutionStatus.COMPLETED,
+            ExecutionStatus.FAILED,
+            ExecutionStatus.TIMEOUT,
+        ]
+
+    def test_databricks_profile(self, executor, databricks_connected):
+        """Test Databricks profile is respected."""
+        options = CodexExecOptions(
+            prompt="Show Databricks config profile",
+            databricks_profile="DEFAULT",
+            inject_databricks_env=True,
+            timeout=30,
+        )
+
+        result = executor.exec_sync(options)
+
+        assert result.status in [
+            ExecutionStatus.COMPLETED,
+            ExecutionStatus.FAILED,
+            ExecutionStatus.TIMEOUT,
+        ]
diff --git a/databricks-codex/tests/integration/test_databricks_assets.py b/databricks-codex/tests/integration/test_databricks_assets.py
new file mode 100644
index 00000000..79407907
--- /dev/null
+++ b/databricks-codex/tests/integration/test_databricks_assets.py
@@ -0,0 +1,376 @@
+"""Integration tests that create Databricks assets.
+
+These tests are labeled databricks-codex-itest via the marker below.
+"""
+
+import base64
+import logging
+import os
+import time
+import uuid
+
+import pytest
+
+from databricks.sdk import WorkspaceClient
+from databricks.sdk import config as sdk_config
+from databricks.sdk.service.compute import Environment
+from databricks.sdk.service.dashboards import Dashboard
+from databricks.sdk.service.jobs import JobEnvironment, NotebookTask, Source, Task
+from databricks.sdk.service.pipelines import NotebookLibrary, PipelineLibrary
+from databricks.sdk.service.workspace import ImportFormat, Language
+
+LABEL = "databricks-codex-itest"
+CATALOG = "samples"
+SCHEMA = "bakehouse"
+PIPELINE_CATALOG = "main"
+PIPELINE_SCHEMA = "default"
+DEFAULT_HTTP_TIMEOUT_SECONDS = 30
+DEFAULT_RETRY_TIMEOUT_SECONDS = 120
+
+logger = logging.getLogger(__name__)
+
+
+def _unique_name(prefix: str) -> str:
+    return f"{prefix}-{uuid.uuid4().hex[:8]}"
+
+
+def _keep_assets() -> bool:
+    return os.environ.get("KEEP_ITEST_ASSETS", "").strip().lower() in {"1", "true", "yes"}
+
+
+def _get_client(databricks_connected):
+    if isinstance(databricks_connected, dict) and "client" in databricks_connected:
+        return databricks_connected["client"]
+
+    if "host" not in databricks_connected or "token" not in databricks_connected:
+        raise ValueError("databricks_connected must include either client or host/token")
+
+    cfg = sdk_config.Config(
+        host=databricks_connected["host"],
+        token=databricks_connected["token"],
+        http_timeout_seconds=DEFAULT_HTTP_TIMEOUT_SECONDS,
+        retry_timeout_seconds=DEFAULT_RETRY_TIMEOUT_SECONDS,
+    )
+    return WorkspaceClient(config=cfg)
+
+
+def _pick_warehouse(client):
+    for wh in client.warehouses.list():
+        wh_id = getattr(wh, "id", None) or getattr(wh, "warehouse_id", None)
+        if wh_id:
+            return wh
+    return None
+
+
+def _pick_cluster(client):
+    running = []
+    any_cluster = []
+    for cl in client.clusters.list():
+        any_cluster.append(cl)
+        state = getattr(cl, "state", None)
+        state_val = getattr(state, "value", state)
+        if state_val in {"RUNNING", "RESIZING"}:
+            running.append(cl)
+    if running:
+        return running[0]
+    if any_cluster:
+        return any_cluster[0]
+    return None
+
+
+def _wait_statement(client, statement_id: str, timeout_seconds: int = 30):
+    deadline = time.time() + timeout_seconds
+    last = None
+    while time.time() < deadline:
+        last = client.statement_execution.get_statement(statement_id)
+        status = getattr(last, "status", None)
+        state = getattr(status, "state", None)
+        state_value = getattr(state, "value", state)
+        if state_value in {"SUCCEEDED", "FAILED", "CANCELED", "CLOSED", "ERROR"}:
+            return last
+        time.sleep(1)
+    return last
+
+
+def _run_sql_statement(client, warehouse_id: str, sql: str, timeout_seconds: int = 60):
+    resp = client.statement_execution.execute_statement(
+        sql,
+        warehouse_id=warehouse_id,
+        wait_timeout="20s",
+    )
+    stmt = _wait_statement(client, resp.statement_id, timeout_seconds=timeout_seconds)
+    state = getattr(getattr(stmt, "status", None), "state", None)
+    state_value = getattr(state, "value", state)
+    return stmt, state_value
+
+
+@pytest.mark.integration
+@pytest.mark.databricks_codex_itest
+@pytest.mark.timeout(180)
+def test_run_simple_sql_script(databricks_connected):
+    """Run a simple SQL script on a SQL warehouse."""
+    logger.info("Starting SQL execution test using catalog=%s schema=%s", CATALOG, SCHEMA)
+    client = _get_client(databricks_connected)
+    warehouse = _pick_warehouse(client)
+    if not warehouse:
+        pytest.skip("No SQL warehouse available for statement execution")
+
+    warehouse_id = getattr(warehouse, "id", None) or getattr(warehouse, "warehouse_id", None)
+    resp = client.statement_execution.execute_statement(
+        "SELECT 1 AS one",
+        warehouse_id=warehouse_id,
+        catalog=CATALOG,
+        schema=SCHEMA,
+        wait_timeout="10s",
+    )
+    stmt = _wait_statement(client, resp.statement_id, timeout_seconds=30)
+    state = getattr(getattr(stmt, "status", None), "state", None)
+    state_value = getattr(state, "value", state)
+    assert state_value == "SUCCEEDED"
+
+
+@pytest.mark.integration
+@pytest.mark.databricks_codex_itest
+@pytest.mark.timeout(180)
+def test_create_databricks_job(databricks_connected):
+    """Create a Databricks job on serverless workflows compute."""
+    logger.info("Starting serverless job creation test")
+    client = _get_client(databricks_connected)
+    job_id = None
+    notebook_path = None
+    try:
+        user = client.current_user.me()
+        home_dir = f"/Users/{user.user_name}"
+        notebook_path = f"{home_dir}/{_unique_name('codex_job_notebook')}"
+        notebook_content = base64.b64encode(
+            b'print("hello from serverless job")\n'
+        ).decode("ascii")
+        client.workspace.import_(
+            path=notebook_path,
+            format=ImportFormat.SOURCE,
+            language=Language.PYTHON,
+            content=notebook_content,
+            overwrite=True,
+        )
+        task = Task(
+            task_key="codex_job_task",
+            notebook_task=NotebookTask(
+                notebook_path=notebook_path,
+                source=Source.WORKSPACE,
+            ),
+            environment_key="default",
+        )
+        env = JobEnvironment(
+            environment_key="default",
+            spec=Environment(environment_version="2"),
+        )
+        resp = client.jobs.create(
+            name=_unique_name("codex-itest-job"),
+            tasks=[task],
+            environments=[env],
+        )
+        job_id = resp.job_id
+        assert job_id is not None
+    except Exception as exc:
+        pytest.skip(f"Serverless job creation not available: {exc}")
+    finally:
+        if job_id is not None:
+            try:
+                client.jobs.delete(job_id=job_id)
+            except Exception:
+                pass
+        if notebook_path is not None:
+            try:
+                client.workspace.delete(path=notebook_path, recursive=False)
+            except Exception:
+                pass
+
+
+@pytest.mark.integration
+@pytest.mark.databricks_codex_itest
+@pytest.mark.timeout(180)
+def test_create_pipeline(databricks_connected):
+    """Create a simple DLT pipeline definition."""
+    logger.info(
+        "Starting pipeline creation test using catalog=%s schema=%s",
+        PIPELINE_CATALOG,
+        PIPELINE_SCHEMA,
+    )
+    client = _get_client(databricks_connected)
+    pipeline_id = None
+    user = client.current_user.me()
+    base_dir = f"/Users/{user.user_name}/{_unique_name('codex')}"
+    notebook_path = f"{base_dir}/codex_itest_pipeline"
+    try:
+        notebook_content = base64.b64encode(b"# DLT pipeline placeholder\n").decode("ascii")
+        client.workspace.mkdirs(base_dir)
+        client.workspace.import_(
+            path=notebook_path,
+            format=ImportFormat.SOURCE,
+            language=Language.PYTHON,
+            content=notebook_content,
+            overwrite=True,
+        )
+        library = PipelineLibrary(notebook=NotebookLibrary(path=notebook_path))
+        resp = client.pipelines.create(
+            name=_unique_name("codex-itest-pipeline"),
+            development=True,
+            serverless=True,
+            catalog=PIPELINE_CATALOG,
+            schema=PIPELINE_SCHEMA,
+            libraries=[library],
+        )
+        pipeline_id = resp.pipeline_id
+        assert pipeline_id is not None
+    except Exception as exc:
+        pytest.skip(f"Pipeline creation not available: {exc}")
+    finally:
+        if pipeline_id is not None:
+            try:
+                client.pipelines.delete(pipeline_id=pipeline_id)
+            except Exception:
+                pass
+        try:
+            client.workspace.delete(path=notebook_path, recursive=False)
+        except Exception:
+            pass
+        try:
+            client.workspace.delete(path=base_dir, recursive=True)
+        except Exception:
+            pass
+
+
+@pytest.mark.integration
+@pytest.mark.databricks_codex_itest
+@pytest.mark.timeout(180)
+def test_create_catalog(databricks_connected):
+    """Create and delete a Unity Catalog catalog."""
+    logger.info("Starting catalog creation test")
+    client = _get_client(databricks_connected)
+    warehouse = _pick_warehouse(client)
+    if not warehouse:
+        pytest.skip("No SQL warehouse available for catalog creation")
+
+    warehouse_id = getattr(warehouse, "id", None) or getattr(warehouse, "warehouse_id", None)
+    # Keep identifier SQL-safe without quoting.
+    name = _unique_name("codex_itest_catalog").replace("-", "_")
+    try:
+        _, create_state = _run_sql_statement(
+            client,
+            warehouse_id=warehouse_id,
+            sql=f"CREATE CATALOG {name} COMMENT 'codex integration test catalog'",
+            timeout_seconds=90,
+        )
+        assert create_state == "SUCCEEDED"
+    except Exception as exc:
+        pytest.skip(f"Catalog creation not available: {exc}")
+    finally:
+        try:
+            _run_sql_statement(
+                client,
+                warehouse_id=warehouse_id,
+                sql=f"DROP CATALOG IF EXISTS {name} CASCADE",
+                timeout_seconds=90,
+            )
+        except Exception:
+            pass
+
+
+@pytest.mark.integration
+@pytest.mark.databricks_codex_itest
+@pytest.mark.timeout(180)
+def test_create_genie_space(databricks_connected):
+    """Create a Genie room by cloning a template space."""
+    logger.info("Starting Genie space creation test")
+    client = _get_client(databricks_connected)
+    try:
+        spaces_response = client.genie.list_spaces()
+        spaces = getattr(spaces_response, "spaces", None)
+        if spaces is None and hasattr(spaces_response, "__iter__"):
+            spaces = list(spaces_response)
+        if spaces is None:
+            spaces = []
+    except Exception as exc:
+        pytest.skip(f"Genie not available: {exc}")
+    if not spaces:
+        pytest.skip("No Genie spaces available to use as a template")
+
+    template = None
+    for space in spaces:
+        try:
+            candidate = client.genie.get_space(space.space_id, include_serialized_space=True)
+        except Exception:
+            continue
+        warehouse_id = getattr(candidate, "warehouse_id", None)
+        serialized_space = getattr(candidate, "serialized_space", None)
+        if warehouse_id and serialized_space:
+            template = candidate
+            break
+    if template is None:
+        pytest.skip("No accessible Genie template space with warehouse_id + serialized_space")
+
+    warehouse_id = template.warehouse_id
+    serialized_space = template.serialized_space
+
+    new_space = None
+    try:
+        new_space = client.genie.create_space(
+            warehouse_id=warehouse_id,
+            serialized_space=serialized_space,
+            title=_unique_name("codex-itest-genie"),
+            description="codex integration test space",
+        )
+        assert new_space.space_id
+    except Exception as exc:
+        pytest.skip(f"Genie space creation not available: {exc}")
+    finally:
+        if new_space is not None:
+            try:
+                client.genie.trash_space(space_id=new_space.space_id)
+            except Exception:
+                pass
+
+
+@pytest.mark.integration
+@pytest.mark.databricks_codex_itest
+@pytest.mark.timeout(180)
+def test_create_ai_bi_dashboard(databricks_connected):
+    """Create an AI/BI (Lakeview) dashboard by cloning a template dashboard."""
+    logger.info("Starting Lakeview dashboard creation test using catalog=%s schema=%s", CATALOG, SCHEMA)
+    client = _get_client(databricks_connected)
+    try:
+        dashboards = list(client.lakeview.list())
+    except Exception as exc:
+        pytest.skip(f"Lakeview not available: {exc}")
+    if not dashboards:
+        pytest.skip("No Lakeview dashboards available to use as a template")
+
+    template = client.lakeview.get(dashboards[0].dashboard_id)
+    serialized = getattr(template, "serialized_dashboard", None)
+    if not serialized:
+        pytest.skip("Template dashboard missing serialized_dashboard")
+
+    created = None
+    try:
+        created = client.lakeview.create(
+            dashboard=Dashboard(
+                display_name=_unique_name("codex-itest-dashboard"),
+                serialized_dashboard=serialized,
+                warehouse_id=getattr(template, "warehouse_id", None),
+            ),
+            dataset_catalog=CATALOG,
+            dataset_schema=SCHEMA,
+        )
+        assert created.dashboard_id
+        host = str(getattr(client.config, "host", "")).rstrip("/")
+        dashboard_url = f"{host}/dashboardsv3/{created.dashboard_id}" if host else created.dashboard_id
+        print(f"Created dashboard ID: {created.dashboard_id}")
+        print(f"Dashboard URL: {dashboard_url}")
+    except Exception as exc:
+        pytest.skip(f"Lakeview dashboard creation not available: {exc}")
+    finally:
+        if created is not None and not _keep_assets():
+            try:
+                client.lakeview.trash(dashboard_id=created.dashboard_id)
+            except Exception:
+                pass
diff --git a/databricks-codex/tests/integration/test_install_and_dashboard_prompt.py b/databricks-codex/tests/integration/test_install_and_dashboard_prompt.py
new file mode 100644
index 00000000..2beb4384
--- /dev/null
+++ b/databricks-codex/tests/integration/test_install_and_dashboard_prompt.py
@@ -0,0 +1,162 @@
+"""Integration tests for Codex skill/MCP installer and dashboard prompting."""
+
+import os
+import shutil
+import subprocess
+import json
+from pathlib import Path
+
+import pytest
+
+from databricks_codex.models import CodexExecOptions, ExecutionStatus, SandboxMode
+
+try:
+    import tomllib  # type: ignore[attr-defined]
+except ModuleNotFoundError:  # pragma: no cover - py39 fallback
+    import tomli as tomllib  # type: ignore[no-redef]
+
+
+def _repo_root() -> Path:
+    return Path(__file__).resolve().parents[3]
+
+
+def _installer_script() -> Path:
+    return _repo_root() / "databricks-codex" / "scripts" / "install_codex_skills_and_mcp.sh"
+
+
+def _project_codex_config() -> Path:
+    return _repo_root() / ".codex" / "config.toml"
+
+
+def _first_databricks_profile() -> str:
+    result = subprocess.run(
+        ["databricks", "auth", "profiles", "--output", "json"],
+        capture_output=True,
+        text=True,
+        timeout=20,
+        check=False,
+    )
+    if result.returncode != 0:
+        pytest.skip(f"Unable to read Databricks profiles: {result.stderr.strip()}")
+    data = json.loads(result.stdout)
+    profiles = data.get("profiles") or []
+    if not profiles:
+        pytest.skip("No Databricks CLI profiles configured")
+    profile_name = profiles[0].get("name")
+    if not profile_name:
+        pytest.skip("First Databricks profile has no name")
+    return profile_name
+
+
+def _run_installer(codex_home: Path, args: list[str]) -> subprocess.CompletedProcess[str]:
+    env = os.environ.copy()
+    env["CODEX_HOME"] = str(codex_home)
+    return subprocess.run(
+        ["bash", str(_installer_script()), *args],
+        cwd=str(_repo_root()),
+        capture_output=True,
+        text=True,
+        timeout=120,
+        env=env,
+        check=False,
+    )
+
+
+def _seed_codex_auth(codex_home: Path) -> None:
+    """Copy local Codex auth artifacts into isolated CODEX_HOME for integration tests."""
+    src_home = Path.home() / ".codex"
+    if not src_home.exists():
+        pytest.skip("Local ~/.codex not found; cannot seed Codex auth for isolated CODEX_HOME")
+
+    codex_home.mkdir(parents=True, exist_ok=True)
+    copied = 0
+    for filename in ("auth.json", "credentials.json", "oauth.json", "config.toml"):
+        src = src_home / filename
+        dst = codex_home / filename
+        if src.exists():
+            shutil.copy2(src, dst)
+            copied += 1
+
+    if copied == 0:
+        pytest.skip("No Codex auth artifacts found in ~/.codex to seed isolated CODEX_HOME")
+
+
+@pytest.mark.integration
+def test_install_script_uses_first_databricks_profile_by_default(tmp_path):
+    """Installer should use the first Databricks CLI profile when --profile is omitted."""
+    config_path = _project_codex_config()
+    original_config = config_path.read_bytes() if config_path.exists() else None
+    codex_home = tmp_path / "codex-home"
+    first_profile = _first_databricks_profile()
+
+    try:
+        _seed_codex_auth(codex_home)
+        result = _run_installer(codex_home, ["databricks-dbsql", "aibi-dashboards"])
+        assert result.returncode == 0, result.stderr or result.stdout
+
+        skill_a = codex_home / "skills" / "databricks-dbsql" / "SKILL.md"
+        skill_b = codex_home / "skills" / "aibi-dashboards" / "SKILL.md"
+        assert skill_a.exists(), "databricks-dbsql skill was not installed"
+        assert skill_b.exists(), "aibi-dashboards skill was not installed"
+
+        config_data = tomllib.loads(config_path.read_text())
+        server = config_data["mcp_servers"]["databricks"]
+        assert server["command"] in {"uv", "python3", "python"}
+        assert server.get("args")
+        assert server.get("env", {}).get("DATABRICKS_CONFIG_PROFILE") == first_profile
+    finally:
+        if original_config is None:
+            config_path.unlink(missing_ok=True)
+        else:
+            config_path.write_bytes(original_config)
+        shutil.rmtree(codex_home, ignore_errors=True)
+
+
+@pytest.mark.integration
+@pytest.mark.timeout(240)
+def test_codex_prompt_generates_dashboard_sql_after_install(executor, tmp_path):
+    """Install skills/MCP, then send a Codex prompt to generate dashboard SQL."""
+    config_path = _project_codex_config()
+    original_config = config_path.read_bytes() if config_path.exists() else None
+    codex_home = tmp_path / "codex-home"
+    output_file = _repo_root() / "databricks-codex" / "tests" / ".tmp_dashboard_queries.sql"
+    first_profile = _first_databricks_profile()
+
+    try:
+        _seed_codex_auth(codex_home)
+        install_result = _run_installer(codex_home, ["aibi-dashboards", "databricks-dbsql"])
+        assert install_result.returncode == 0, install_result.stderr or install_result.stdout
+
+        output_file.parent.mkdir(parents=True, exist_ok=True)
+        output_file.unlink(missing_ok=True)
+
+        prompt = (
+            "Create a file at databricks-codex/tests/.tmp_dashboard_queries.sql with exactly three "
+            "Databricks SQL queries for an AI/BI dashboard over users.rlgarris.loan_data_2: "
+            "(1) total loan applications and average loan amount, "
+            "(2) loan_status distribution, "
+            "(3) top 10 occupations by average applicant_income. "
+            "Output only the file creation action."
+        )
+        options = CodexExecOptions(
+            prompt=prompt,
+            sandbox_mode=SandboxMode.WORKSPACE_WRITE,
+            timeout=180,
+            working_dir=str(_repo_root()),
+            env_vars={"CODEX_HOME": str(codex_home)},
+            inject_databricks_env=True,
+            databricks_profile=first_profile,
+        )
+        result = executor.exec_sync(options)
+        assert result.status == ExecutionStatus.COMPLETED, result.stderr or result.stdout
+        assert output_file.exists(), "Codex did not create dashboard SQL file"
+        sql_text = output_file.read_text()
+        assert "users.rlgarris.loan_data_2" in sql_text
+        assert "loan_status" in sql_text
+    finally:
+        output_file.unlink(missing_ok=True)
+        if original_config is None:
+            config_path.unlink(missing_ok=True)
+        else:
+            config_path.write_bytes(original_config)
+        shutil.rmtree(codex_home, ignore_errors=True)
diff --git a/databricks-codex/tests/test_auth.py b/databricks-codex/tests/test_auth.py
new file mode 100644
index 00000000..c031e637
--- /dev/null
+++ b/databricks-codex/tests/test_auth.py
@@ -0,0 +1,250 @@
+"""Tests for authentication utilities."""
+
+import subprocess
+from unittest.mock import patch, MagicMock
+
+import pytest
+
+from databricks_codex.auth import (
+    CodexAuthStatus,
+    check_codex_auth,
+    login_codex,
+    logout_codex,
+    get_combined_auth_context,
+    get_databricks_env,
+)
+from databricks_codex.models import CodexAuthMethod
+
+
+class TestCodexAuthStatus:
+    """Tests for CodexAuthStatus dataclass."""
+
+    def test_authenticated_status(self):
+        """Test authenticated status."""
+        status = CodexAuthStatus(
+            method=CodexAuthMethod.CHATGPT_OAUTH,
+            is_authenticated=True,
+            username="user@example.com",
+        )
+
+        assert status.is_authenticated is True
+        assert status.method == CodexAuthMethod.CHATGPT_OAUTH
+        assert status.username == "user@example.com"
+        assert status.error is None
+
+    def test_unauthenticated_status(self):
+        """Test unauthenticated status."""
+        status = CodexAuthStatus(
+            method=CodexAuthMethod.NONE,
+            is_authenticated=False,
+            error="Not logged in",
+        )
+
+        assert status.is_authenticated is False
+        assert status.error == "Not logged in"
+
+
+class TestCheckCodexAuth:
+    """Tests for check_codex_auth function."""
+
+    def test_codex_authenticated(self, mock_subprocess_run):
+        """Test when Codex is authenticated."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="codex 1.0.0",
+            stderr="",
+        )
+
+        status = check_codex_auth()
+
+        assert status.is_authenticated is True
+        mock_subprocess_run.assert_called_once()
+
+    def test_codex_not_installed(self, mock_subprocess_run_not_found):
+        """Test when Codex CLI is not installed."""
+        status = check_codex_auth()
+
+        assert status.is_authenticated is False
+        assert "not found" in status.error.lower()
+
+    def test_codex_timeout(self, mock_subprocess_run_timeout):
+        """Test when Codex CLI times out."""
+        status = check_codex_auth()
+
+        assert status.is_authenticated is False
+        assert "timed out" in status.error.lower()
+
+    def test_codex_error(self, mock_subprocess_run):
+        """Test when Codex returns error."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=1,
+            stdout="",
+            stderr="Authentication error",
+        )
+
+        status = check_codex_auth()
+
+        assert status.is_authenticated is False
+
+
+class TestLoginCodex:
+    """Tests for login_codex function."""
+
+    def test_login_oauth_success(self, mock_subprocess_run):
+        """Test successful OAuth login."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Logged in successfully",
+            stderr="",
+        )
+
+        status = login_codex(method=CodexAuthMethod.CHATGPT_OAUTH)
+
+        assert status.is_authenticated is True
+        # Should call login then version check
+        assert mock_subprocess_run.call_count >= 1
+
+    def test_login_device_code(self, mock_subprocess_run):
+        """Test device code login adds flag."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Success",
+            stderr="",
+        )
+
+        login_codex(method=CodexAuthMethod.DEVICE_CODE)
+
+        # Check that --device-auth flag was used
+        calls = mock_subprocess_run.call_args_list
+        login_call = calls[0]
+        cmd = login_call[0][0]  # First positional arg
+        assert "--device-auth" in cmd
+
+    def test_login_api_key_without_key(self, mock_subprocess_run):
+        """Test API key login without providing key."""
+        status = login_codex(method=CodexAuthMethod.API_KEY, api_key=None)
+
+        assert status.is_authenticated is False
+        assert "API key required" in status.error
+
+    def test_login_api_key_with_key(self, mock_subprocess_run):
+        """Test API key login with key provided."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Success",
+            stderr="",
+        )
+
+        status = login_codex(method=CodexAuthMethod.API_KEY, api_key="sk-test-key")
+
+        # Check that key was passed as input
+        calls = mock_subprocess_run.call_args_list
+        login_call = calls[0]
+        assert login_call[1].get("input") == "sk-test-key"
+
+    def test_login_failure(self, mock_subprocess_run):
+        """Test login failure."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=1,
+            stdout="",
+            stderr="Invalid credentials",
+        )
+
+        status = login_codex()
+
+        assert status.is_authenticated is False
+        assert "Invalid credentials" in status.error
+
+    def test_login_not_installed(self, mock_subprocess_run_not_found):
+        """Test login when Codex not installed."""
+        status = login_codex()
+
+        assert status.is_authenticated is False
+        assert "not found" in status.error.lower()
+
+
+class TestLogoutCodex:
+    """Tests for logout_codex function."""
+
+    def test_logout_success(self, mock_subprocess_run):
+        """Test successful logout."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Logged out",
+            stderr="",
+        )
+
+        result = logout_codex()
+
+        assert result is True
+
+    def test_logout_failure(self, mock_subprocess_run):
+        """Test logout failure."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=1,
+            stdout="",
+            stderr="Error",
+        )
+
+        result = logout_codex()
+
+        assert result is False
+
+    def test_logout_not_installed(self, mock_subprocess_run_not_found):
+        """Test logout when Codex not installed."""
+        result = logout_codex()
+
+        assert result is False
+
+
+class TestGetCombinedAuthContext:
+    """Tests for get_combined_auth_context function."""
+
+    def test_from_env_vars(self, mock_databricks_env):
+        """Test getting auth from environment variables."""
+        host, token = get_combined_auth_context()
+
+        assert host == "https://test.cloud.databricks.com"
+        assert token == "dapi_test_token"
+
+    def test_no_credentials(self, clean_env):
+        """Test when no credentials available."""
+        # Mock both SDK import paths to fail
+        with patch.dict("sys.modules", {"databricks_tools_core.auth": None, "databricks_tools_core": None}):
+            with patch.dict("sys.modules", {"databricks.sdk": None, "databricks": None}):
+                host, token = get_combined_auth_context()
+
+        # May return None if no credentials found
+        # This depends on whether SDK is installed and configured
+        # In test env without SDK, should return (None, None)
+
+    def test_env_vars_take_priority(self, mock_databricks_env):
+        """Test that env vars take priority over SDK."""
+        host, token = get_combined_auth_context()
+
+        # Should use env vars, not SDK
+        assert host == mock_databricks_env["DATABRICKS_HOST"]
+        assert token == mock_databricks_env["DATABRICKS_TOKEN"]
+
+
+class TestGetDatabricksEnv:
+    """Tests for get_databricks_env function."""
+
+    def test_with_credentials(self, mock_databricks_env):
+        """Test getting env with credentials."""
+        env = get_databricks_env()
+
+        assert "DATABRICKS_HOST" in env
+        assert "DATABRICKS_TOKEN" in env
+
+    def test_with_profile(self, mock_databricks_env):
+        """Test getting env with non-default profile."""
+        env = get_databricks_env(profile="PROD")
+
+        assert env.get("DATABRICKS_CONFIG_PROFILE") == "PROD"
+
+    def test_default_profile_not_added(self, mock_databricks_env):
+        """Test that DEFAULT profile doesn't add env var."""
+        env = get_databricks_env(profile="DEFAULT")
+
+        assert "DATABRICKS_CONFIG_PROFILE" not in env
diff --git a/databricks-codex/tests/test_config.py b/databricks-codex/tests/test_config.py
new file mode 100644
index 00000000..8d4fb98e
--- /dev/null
+++ b/databricks-codex/tests/test_config.py
@@ -0,0 +1,194 @@
+"""Tests for configuration management."""
+
+import pytest
+from pathlib import Path
+
+from databricks_codex.config import (
+    CodexConfigManager,
+    CodexConfig,
+    MCPServerConfig,
+)
+
+
+class TestMCPServerConfig:
+    """Tests for MCPServerConfig model."""
+
+    def test_minimal_config(self):
+        """Test minimal server config."""
+        config = MCPServerConfig(command="python")
+
+        assert config.command == "python"
+        assert config.args == []
+        assert config.env == {}
+
+    def test_full_config(self):
+        """Test full server config."""
+        config = MCPServerConfig(
+            command="/usr/bin/python3",
+            args=["-m", "databricks_mcp_server"],
+            env={"KEY": "value"},
+        )
+
+        assert config.command == "/usr/bin/python3"
+        assert config.args == ["-m", "databricks_mcp_server"]
+        assert config.env == {"KEY": "value"}
+
+
+class TestCodexConfig:
+    """Tests for CodexConfig model."""
+
+    def test_empty_config(self):
+        """Test empty configuration."""
+        config = CodexConfig()
+
+        assert config.mcp_servers == {}
+
+    def test_config_with_servers(self):
+        """Test configuration with MCP servers."""
+        config = CodexConfig(
+            mcp_servers={
+                "databricks": MCPServerConfig(command="python"),
+                "other": MCPServerConfig(command="node"),
+            }
+        )
+
+        assert len(config.mcp_servers) == 2
+        assert "databricks" in config.mcp_servers
+        assert "other" in config.mcp_servers
+
+
+class TestCodexConfigManager:
+    """Tests for CodexConfigManager."""
+
+    def test_read_nonexistent_returns_empty(self, mock_config_manager):
+        """Reading non-existent config returns empty config."""
+        config = mock_config_manager.read()
+
+        assert isinstance(config, CodexConfig)
+        assert len(config.mcp_servers) == 0
+
+    def test_write_and_read_roundtrip(self, mock_config_manager, sample_config):
+        """Config can be written and read back."""
+        mock_config_manager.write(sample_config)
+
+        read_config = mock_config_manager.read()
+
+        assert "databricks" in read_config.mcp_servers
+        assert read_config.mcp_servers["databricks"].command == "/usr/bin/python"
+        assert read_config.mcp_servers["databricks"].args == ["-m", "databricks_mcp_server"]
+
+    def test_write_empty_config(self, mock_config_manager):
+        """Writing empty config creates valid file."""
+        config = CodexConfig()
+        mock_config_manager.write(config)
+
+        read_config = mock_config_manager.read()
+        assert read_config.mcp_servers == {}
+
+    def test_configure_databricks_mcp(self, mock_config_manager):
+        """Configure Databricks MCP server."""
+        mock_config_manager.configure_databricks_mcp(
+            profile="PROD",
+            python_path="/custom/python",
+            mcp_entry="/custom/run_server.py",
+        )
+
+        config = mock_config_manager.read()
+
+        assert "databricks" in config.mcp_servers
+        assert config.mcp_servers["databricks"].command == "/custom/python"
+        assert config.mcp_servers["databricks"].args == ["/custom/run_server.py"]
+        assert config.mcp_servers["databricks"].env.get("DATABRICKS_CONFIG_PROFILE") == "PROD"
+
+    def test_configure_databricks_mcp_default_profile(self, mock_config_manager):
+        """Configure with DEFAULT profile doesn't add env var."""
+        mock_config_manager.configure_databricks_mcp(
+            profile="DEFAULT",
+            python_path="/python",
+            mcp_entry="/server.py",
+        )
+
+        config = mock_config_manager.read()
+
+        # DEFAULT profile shouldn't add env var
+        assert "DATABRICKS_CONFIG_PROFILE" not in config.mcp_servers["databricks"].env
+
+    def test_remove_databricks_mcp(self, mock_config_manager, sample_config):
+        """Remove Databricks MCP configuration."""
+        mock_config_manager.write(sample_config)
+
+        removed = mock_config_manager.remove_databricks_mcp()
+        assert removed is True
+
+        config = mock_config_manager.read()
+        assert "databricks" not in config.mcp_servers
+
+    def test_remove_databricks_mcp_not_exists(self, mock_config_manager):
+        """Remove returns False if not configured."""
+        removed = mock_config_manager.remove_databricks_mcp()
+        assert removed is False
+
+    def test_has_databricks_mcp(self, mock_config_manager, sample_config):
+        """Check if Databricks MCP is configured."""
+        assert mock_config_manager.has_databricks_mcp() is False
+
+        mock_config_manager.write(sample_config)
+
+        assert mock_config_manager.has_databricks_mcp() is True
+
+    def test_get_databricks_mcp_config(self, mock_config_manager, sample_config):
+        """Get Databricks MCP configuration."""
+        assert mock_config_manager.get_databricks_mcp_config() is None
+
+        mock_config_manager.write(sample_config)
+
+        config = mock_config_manager.get_databricks_mcp_config()
+        assert config is not None
+        assert config.command == "/usr/bin/python"
+
+    def test_atomic_write_creates_directory(self, temp_config_dir):
+        """Write creates parent directory if needed."""
+        nested_path = temp_config_dir / "nested" / "deep" / "config.toml"
+        manager = CodexConfigManager(config_path=nested_path)
+
+        config = CodexConfig(
+            mcp_servers={"test": MCPServerConfig(command="test")}
+        )
+        manager.write(config)
+
+        assert nested_path.exists()
+
+    def test_scope_global(self):
+        """Test global scope path."""
+        manager = CodexConfigManager(scope="global")
+        assert manager.config_path == Path.home() / ".codex" / "config.toml"
+
+    def test_scope_project(self):
+        """Test project scope path."""
+        manager = CodexConfigManager(scope="project")
+        assert manager.config_path == Path(".codex") / "config.toml"
+
+    def test_custom_path_overrides_scope(self, temp_config_dir):
+        """Custom path overrides scope setting."""
+        custom_path = temp_config_dir / "custom.toml"
+        manager = CodexConfigManager(config_path=custom_path, scope="global")
+
+        assert manager.config_path == custom_path
+
+    def test_multiple_servers(self, mock_config_manager):
+        """Config can have multiple MCP servers."""
+        config = CodexConfig(
+            mcp_servers={
+                "databricks": MCPServerConfig(command="python", args=["-m", "db"]),
+                "github": MCPServerConfig(command="node", args=["gh-server.js"]),
+                "slack": MCPServerConfig(command="python", args=["-m", "slack"]),
+            }
+        )
+
+        mock_config_manager.write(config)
+        read_config = mock_config_manager.read()
+
+        assert len(read_config.mcp_servers) == 3
+        assert "databricks" in read_config.mcp_servers
+        assert "github" in read_config.mcp_servers
+        assert "slack" in read_config.mcp_servers
diff --git a/databricks-codex/tests/test_executor.py b/databricks-codex/tests/test_executor.py
new file mode 100644
index 00000000..3f894afa
--- /dev/null
+++ b/databricks-codex/tests/test_executor.py
@@ -0,0 +1,319 @@
+"""Tests for Codex executor."""
+
+import subprocess
+from unittest.mock import patch, MagicMock
+
+import pytest
+
+from databricks_codex.executor import (
+    CodexExecutor,
+    SAFE_EXECUTION_THRESHOLD,
+)
+from databricks_codex.models import (
+    CodexExecOptions,
+    ExecutionResult,
+    ExecutionStatus,
+    SandboxMode,
+)
+
+
+class TestCodexExecutor:
+    """Tests for CodexExecutor class."""
+
+    @pytest.fixture
+    def executor(self):
+        """Create executor instance for testing."""
+        return CodexExecutor(default_timeout=60)
+
+    def test_init_defaults(self):
+        """Test executor initialization with defaults."""
+        executor = CodexExecutor()
+
+        assert executor.default_timeout == 300
+        assert executor._operations == {}
+
+    def test_init_custom(self):
+        """Test executor initialization with custom values."""
+        executor = CodexExecutor(default_timeout=600, max_workers=8)
+
+        assert executor.default_timeout == 600
+
+
+class TestExecSync:
+    """Tests for exec_sync method."""
+
+    @pytest.fixture
+    def executor(self):
+        return CodexExecutor()
+
+    def test_exec_sync_success(self, executor, mock_subprocess_run):
+        """Test successful synchronous execution."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Hello, world!",
+            stderr="",
+        )
+
+        options = CodexExecOptions(prompt="echo hello")
+        result = executor.exec_sync(options)
+
+        assert result.status == ExecutionStatus.COMPLETED
+        assert result.stdout == "Hello, world!"
+        assert result.exit_code == 0
+        assert result.elapsed_seconds >= 0
+
+    def test_exec_sync_failure(self, executor, mock_subprocess_run):
+        """Test failed execution."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=1,
+            stdout="",
+            stderr="Error occurred",
+        )
+
+        options = CodexExecOptions(prompt="failing command")
+        result = executor.exec_sync(options)
+
+        assert result.status == ExecutionStatus.FAILED
+        assert result.stderr == "Error occurred"
+        assert result.exit_code == 1
+
+    def test_exec_sync_timeout(self, executor, mock_subprocess_run_timeout):
+        """Test execution timeout."""
+        options = CodexExecOptions(prompt="slow command", timeout=1)
+        result = executor.exec_sync(options)
+
+        assert result.status == ExecutionStatus.TIMEOUT
+
+    def test_exec_sync_not_installed(self, executor, mock_subprocess_run_not_found):
+        """Test when Codex not installed."""
+        options = CodexExecOptions(prompt="test")
+        result = executor.exec_sync(options)
+
+        assert result.status == ExecutionStatus.FAILED
+        assert "not found" in result.stderr.lower()
+
+    def test_exec_sync_with_sandbox_mode(self, executor, mock_subprocess_run):
+        """Test sandbox mode is passed correctly."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0, stdout="", stderr=""
+        )
+
+        options = CodexExecOptions(
+            prompt="test",
+            sandbox_mode=SandboxMode.WORKSPACE_WRITE,
+        )
+        executor.exec_sync(options)
+
+        # Check command includes sandbox flag
+        call_args = mock_subprocess_run.call_args
+        cmd = call_args[0][0]
+        assert "--sandbox" in cmd
+        assert "workspace-write" in cmd
+
+    def test_exec_sync_read_only_no_flag(self, executor, mock_subprocess_run):
+        """Test read-only mode doesn't add extra flag."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0, stdout="", stderr=""
+        )
+
+        options = CodexExecOptions(
+            prompt="test",
+            sandbox_mode=SandboxMode.READ_ONLY,
+        )
+        executor.exec_sync(options)
+
+        call_args = mock_subprocess_run.call_args
+        cmd = call_args[0][0]
+        # read-only is default, shouldn't add --sandbox flag
+        assert "--sandbox" not in cmd
+
+    def test_exec_sync_with_model(self, executor, mock_subprocess_run):
+        """Test model parameter is passed."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0, stdout="", stderr=""
+        )
+
+        options = CodexExecOptions(prompt="test", model="gpt-4")
+        executor.exec_sync(options)
+
+        call_args = mock_subprocess_run.call_args
+        cmd = call_args[0][0]
+        assert "--model" in cmd
+        assert "gpt-4" in cmd
+
+    def test_exec_sync_with_working_dir(self, executor, mock_subprocess_run):
+        """Test working directory is passed."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0, stdout="", stderr=""
+        )
+
+        options = CodexExecOptions(prompt="test", working_dir="/tmp")
+        executor.exec_sync(options)
+
+        call_args = mock_subprocess_run.call_args
+        cmd = call_args[0][0]
+        assert "--cd" in cmd
+        assert "/tmp" in cmd
+
+
+class TestBuildCommand:
+    """Tests for _build_command method."""
+
+    @pytest.fixture
+    def executor(self):
+        return CodexExecutor()
+
+    def test_basic_command(self, executor):
+        """Test basic command building."""
+        options = CodexExecOptions(prompt="test prompt")
+        cmd = executor._build_command(options)
+
+        assert cmd[0] == "codex"
+        assert cmd[1] == "exec"
+        assert "test prompt" in cmd
+
+    def test_command_with_all_options(self, executor):
+        """Test command with all options."""
+        options = CodexExecOptions(
+            prompt="full test",
+            sandbox_mode=SandboxMode.FULL_ACCESS,
+            model="gpt-4",
+            working_dir="/workspace",
+        )
+        cmd = executor._build_command(options)
+
+        assert "--sandbox" in cmd
+        assert "danger-full-access" in cmd
+        assert "--model" in cmd
+        assert "gpt-4" in cmd
+        assert "--cd" in cmd
+        assert "/workspace" in cmd
+
+
+class TestBuildEnv:
+    """Tests for _build_env method."""
+
+    @pytest.fixture
+    def executor(self):
+        return CodexExecutor()
+
+    def test_env_includes_databricks(self, executor, mock_databricks_env):
+        """Test environment includes Databricks credentials."""
+        options = CodexExecOptions(prompt="test", inject_databricks_env=True)
+        env = executor._build_env(options)
+
+        assert "DATABRICKS_HOST" in env
+        assert "DATABRICKS_TOKEN" in env
+
+    def test_env_without_databricks(self, executor, mock_databricks_env):
+        """Test environment without Databricks injection."""
+        options = CodexExecOptions(prompt="test", inject_databricks_env=False)
+        env = executor._build_env(options)
+
+        # Should still have env vars from outer scope but not from our injection
+        # The mock_databricks_env fixture sets them, so they'd be in os.environ
+        # which we copy. This test verifies the option is respected.
+
+    def test_env_with_custom_vars(self, executor, mock_databricks_env):
+        """Test custom environment variables are added."""
+        options = CodexExecOptions(
+            prompt="test",
+            env_vars={"CUSTOM_VAR": "custom_value"},
+        )
+        env = executor._build_env(options)
+
+        assert env.get("CUSTOM_VAR") == "custom_value"
+
+    def test_env_with_profile(self, executor, mock_databricks_env):
+        """Test non-default profile adds env var."""
+        options = CodexExecOptions(
+            prompt="test",
+            databricks_profile="PROD",
+        )
+        env = executor._build_env(options)
+
+        assert env.get("DATABRICKS_CONFIG_PROFILE") == "PROD"
+
+
+class TestOperationTracking:
+    """Tests for operation tracking."""
+
+    @pytest.fixture
+    def executor(self):
+        return CodexExecutor()
+
+    def test_get_operation_not_found(self, executor):
+        """Test getting non-existent operation."""
+        result = executor.get_operation("nonexistent")
+        assert result is None
+
+    def test_list_operations_empty(self, executor):
+        """Test listing operations when empty."""
+        ops = executor.list_operations()
+        assert ops == {}
+
+    def test_clear_operation_not_found(self, executor):
+        """Test clearing non-existent operation."""
+        result = executor.clear_operation("nonexistent")
+        assert result is False
+
+
+class TestExecutorContextManager:
+    """Tests for context manager usage."""
+
+    def test_context_manager(self, mock_subprocess_run):
+        """Test executor as context manager."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0, stdout="", stderr=""
+        )
+
+        with CodexExecutor() as executor:
+            options = CodexExecOptions(prompt="test")
+            result = executor.exec_sync(options)
+            assert result.status == ExecutionStatus.COMPLETED
+
+    def test_shutdown(self):
+        """Test explicit shutdown."""
+        executor = CodexExecutor()
+        executor.shutdown(wait=True)
+        # Should not raise
+
+
+class TestExecAsync:
+    """Tests for exec_async method."""
+
+    @pytest.fixture
+    def executor(self):
+        return CodexExecutor()
+
+    @pytest.mark.asyncio
+    async def test_exec_async_fast(self, executor, mock_subprocess_run):
+        """Test fast async execution completes normally."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Fast result",
+            stderr="",
+        )
+
+        options = CodexExecOptions(prompt="fast command")
+        result = await executor.exec_async(options)
+
+        assert result.status == ExecutionStatus.COMPLETED
+        assert result.stdout == "Fast result"
+
+    @pytest.mark.asyncio
+    async def test_exec_async_with_callback(self, executor, mock_subprocess_run):
+        """Test async execution with callback."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0, stdout="", stderr=""
+        )
+
+        callback_called = []
+
+        def callback(result):
+            callback_called.append(result)
+
+        options = CodexExecOptions(prompt="test")
+        await executor.exec_async(options, callback=callback)
+
+        assert len(callback_called) == 1
diff --git a/databricks-codex/tests/test_mcp_client.py b/databricks-codex/tests/test_mcp_client.py
new file mode 100644
index 00000000..e55336fd
--- /dev/null
+++ b/databricks-codex/tests/test_mcp_client.py
@@ -0,0 +1,299 @@
+"""Tests for MCP client."""
+
+import asyncio
+import json
+from unittest.mock import AsyncMock, MagicMock, patch
+
+import pytest
+
+from databricks_codex.mcp_client import (
+    CodexMCPClient,
+    MCPClientConfig,
+    MCPError,
+)
+from databricks_codex.models import MCPToolInfo
+
+
+class TestMCPClientConfig:
+    """Tests for MCPClientConfig model."""
+
+    def test_default_config(self):
+        """Test default configuration."""
+        config = MCPClientConfig()
+
+        assert config.command is None
+        assert config.args == []
+        assert config.env == {}
+        assert config.url is None
+        assert config.timeout == 120
+
+    def test_stdio_config(self):
+        """Test stdio transport configuration."""
+        config = MCPClientConfig(
+            command="codex",
+            args=["mcp-server"],
+            env={"KEY": "value"},
+        )
+
+        assert config.command == "codex"
+        assert config.args == ["mcp-server"]
+        assert config.env == {"KEY": "value"}
+
+    def test_http_config(self):
+        """Test HTTP transport configuration."""
+        config = MCPClientConfig(
+            url="https://localhost:8080",
+            bearer_token_env_var="MCP_TOKEN",
+            timeout=60,
+        )
+
+        assert config.url == "https://localhost:8080"
+        assert config.bearer_token_env_var == "MCP_TOKEN"
+        assert config.timeout == 60
+
+
+class TestMCPError:
+    """Tests for MCPError exception."""
+
+    def test_error_creation(self):
+        """Test MCP error creation."""
+        error = MCPError(code=-32600, message="Invalid request")
+
+        assert error.code == -32600
+        assert error.message == "Invalid request"
+        assert error.data is None
+
+    def test_error_with_data(self):
+        """Test MCP error with additional data."""
+        error = MCPError(
+            code=-32602,
+            message="Invalid params",
+            data={"param": "name"},
+        )
+
+        assert error.data == {"param": "name"}
+
+
+class TestCodexMCPClient:
+    """Tests for CodexMCPClient class."""
+
+    def test_init_default(self):
+        """Test default initialization."""
+        client = CodexMCPClient()
+
+        assert client.config.command == "codex"
+        assert client.config.args == ["mcp-server"]
+        assert client._connected is False
+
+    def test_init_custom_config(self):
+        """Test initialization with custom config."""
+        config = MCPClientConfig(command="custom-codex", args=["--debug"])
+        client = CodexMCPClient(config=config)
+
+        assert client.config.command == "custom-codex"
+        assert client.config.args == ["--debug"]
+
+    def test_is_connected_initial(self):
+        """Test is_connected property initially false."""
+        client = CodexMCPClient()
+        assert client.is_connected is False
+
+
+class TestMCPClientAsync:
+    """Async tests for MCP client."""
+
+    @pytest.fixture
+    def mock_process(self):
+        """Create mock subprocess."""
+        process = MagicMock()
+        process.stdin = MagicMock()
+        process.stdin.write = MagicMock()
+        process.stdin.drain = AsyncMock()
+        process.stdout = MagicMock()
+        process.stdout.readline = AsyncMock(
+            return_value=json.dumps({
+                "jsonrpc": "2.0",
+                "id": 1,
+                "result": {"protocolVersion": "2024-11-05"},
+            }).encode() + b"\n"
+        )
+        process.stderr = MagicMock()
+        process.terminate = MagicMock()
+        process.wait = AsyncMock()
+        return process
+
+    @pytest.mark.asyncio
+    async def test_connect_stdio(self, mock_process):
+        """Test connecting via stdio transport."""
+        with patch("asyncio.create_subprocess_exec", return_value=mock_process) as mock_exec:
+            client = CodexMCPClient()
+
+            # Mock the reader task to not actually run
+            with patch.object(client, "_read_responses", return_value=None):
+                # We need to handle the initialization request
+                responses = [
+                    json.dumps({
+                        "jsonrpc": "2.0",
+                        "id": 1,
+                        "result": {"protocolVersion": "2024-11-05"},
+                    }).encode() + b"\n",
+                ]
+                mock_process.stdout.readline = AsyncMock(side_effect=responses)
+
+                # Can't fully test without running event loop properly
+                # This is a simplified test
+
+    @pytest.mark.asyncio
+    async def test_disconnect(self):
+        """Test disconnecting client."""
+        client = CodexMCPClient()
+        client._connected = True
+
+        await client.disconnect()
+
+        assert client._connected is False
+
+    @pytest.mark.asyncio
+    async def test_context_manager(self):
+        """Test async context manager."""
+        with patch.object(CodexMCPClient, "connect", new_callable=AsyncMock) as mock_connect:
+            with patch.object(CodexMCPClient, "disconnect", new_callable=AsyncMock) as mock_disconnect:
+                async with CodexMCPClient() as client:
+                    pass
+
+                mock_connect.assert_called_once()
+                mock_disconnect.assert_called_once()
+
+
+class TestMCPClientToolOperations:
+    """Tests for tool operations."""
+
+    @pytest.mark.asyncio
+    async def test_list_tools_parsing(self):
+        """Test parsing tool list response."""
+        client = CodexMCPClient()
+
+        # Mock the _send_request method
+        with patch.object(client, "_send_request", new_callable=AsyncMock) as mock_send:
+            mock_send.return_value = {
+                "tools": [
+                    {
+                        "name": "test_tool",
+                        "description": "A test tool",
+                        "inputSchema": {"type": "object"},
+                    },
+                    {
+                        "name": "another_tool",
+                        "description": "Another tool",
+                    },
+                ]
+            }
+
+            tools = await client.list_tools()
+
+            assert len(tools) == 2
+            assert tools[0].name == "test_tool"
+            assert tools[0].description == "A test tool"
+            assert tools[0].input_schema == {"type": "object"}
+            assert tools[1].name == "another_tool"
+            assert tools[1].input_schema == {}
+
+    @pytest.mark.asyncio
+    async def test_list_tools_empty(self):
+        """Test empty tool list."""
+        client = CodexMCPClient()
+
+        with patch.object(client, "_send_request", new_callable=AsyncMock) as mock_send:
+            mock_send.return_value = {"tools": []}
+
+            tools = await client.list_tools()
+
+            assert tools == []
+
+    @pytest.mark.asyncio
+    async def test_call_tool_json_response(self):
+        """Test calling tool with JSON response."""
+        client = CodexMCPClient()
+
+        with patch.object(client, "_send_request", new_callable=AsyncMock) as mock_send:
+            mock_send.return_value = {
+                "content": [
+                    {
+                        "type": "text",
+                        "text": '{"result": "success", "data": [1, 2, 3]}',
+                    }
+                ]
+            }
+
+            result = await client.call_tool("test_tool", {"arg": "value"})
+
+            assert result == {"result": "success", "data": [1, 2, 3]}
+            mock_send.assert_called_with(
+                "tools/call",
+                {"name": "test_tool", "arguments": {"arg": "value"}},
+                timeout=None,
+            )
+
+    @pytest.mark.asyncio
+    async def test_call_tool_text_response(self):
+        """Test calling tool with plain text response."""
+        client = CodexMCPClient()
+
+        with patch.object(client, "_send_request", new_callable=AsyncMock) as mock_send:
+            mock_send.return_value = {
+                "content": [
+                    {
+                        "type": "text",
+                        "text": "Plain text response",
+                    }
+                ]
+            }
+
+            result = await client.call_tool("test_tool")
+
+            assert result == {"text": "Plain text response"}
+
+    @pytest.mark.asyncio
+    async def test_call_tool_empty_arguments(self):
+        """Test calling tool without arguments."""
+        client = CodexMCPClient()
+
+        with patch.object(client, "_send_request", new_callable=AsyncMock) as mock_send:
+            mock_send.return_value = {"content": []}
+
+            await client.call_tool("test_tool")
+
+            mock_send.assert_called_with(
+                "tools/call",
+                {"name": "test_tool", "arguments": {}},
+                timeout=None,
+            )
+
+    @pytest.mark.asyncio
+    async def test_call_tool_with_timeout(self):
+        """Test calling tool with custom timeout."""
+        client = CodexMCPClient()
+
+        with patch.object(client, "_send_request", new_callable=AsyncMock) as mock_send:
+            mock_send.return_value = {"content": []}
+
+            await client.call_tool("test_tool", timeout=30)
+
+            mock_send.assert_called_with(
+                "tools/call",
+                {"name": "test_tool", "arguments": {}},
+                timeout=30,
+            )
+
+
+class TestMCPClientHTTP:
+    """Tests for HTTP transport."""
+
+    @pytest.mark.asyncio
+    async def test_http_not_connected_error(self):
+        """Test error when calling without connection."""
+        config = MCPClientConfig(url="https://localhost:8080")
+        client = CodexMCPClient(config=config)
+
+        with pytest.raises(RuntimeError, match="Not connected"):
+            await client._send_http({"test": "request"}, timeout=10)
diff --git a/databricks-codex/tests/test_models.py b/databricks-codex/tests/test_models.py
new file mode 100644
index 00000000..e8d84f77
--- /dev/null
+++ b/databricks-codex/tests/test_models.py
@@ -0,0 +1,216 @@
+"""Tests for data models."""
+
+import pytest
+from datetime import datetime
+
+from databricks_codex.models import (
+    CodexToolCategory,
+    TransportType,
+    SandboxMode,
+    ExecutionStatus,
+    CodexAuthMethod,
+    ExecutionResult,
+    MCPToolInfo,
+    CodexExecOptions,
+    CodexToolCall,
+    CodexResponse,
+    DatabricksContext,
+    CodexIntegrationConfig,
+)
+
+
+class TestEnums:
+    """Tests for enum types."""
+
+    def test_sandbox_mode_values(self):
+        """Test SandboxMode enum values."""
+        assert SandboxMode.READ_ONLY.value == "read-only"
+        assert SandboxMode.WORKSPACE_WRITE.value == "workspace-write"
+        assert SandboxMode.FULL_ACCESS.value == "danger-full-access"
+
+    def test_execution_status_values(self):
+        """Test ExecutionStatus enum values."""
+        assert ExecutionStatus.PENDING.value == "pending"
+        assert ExecutionStatus.RUNNING.value == "running"
+        assert ExecutionStatus.COMPLETED.value == "completed"
+        assert ExecutionStatus.FAILED.value == "failed"
+        assert ExecutionStatus.TIMEOUT.value == "timeout"
+
+    def test_auth_method_values(self):
+        """Test CodexAuthMethod enum values."""
+        assert CodexAuthMethod.CHATGPT_OAUTH.value == "chatgpt"
+        assert CodexAuthMethod.DEVICE_CODE.value == "device"
+        assert CodexAuthMethod.API_KEY.value == "api_key"
+        assert CodexAuthMethod.NONE.value == "none"
+
+    def test_transport_type_values(self):
+        """Test TransportType enum values."""
+        assert TransportType.STDIO.value == "stdio"
+        assert TransportType.HTTP.value == "http"
+
+    def test_tool_category_values(self):
+        """Test CodexToolCategory enum values."""
+        assert CodexToolCategory.GENERATION.value == "generation"
+        assert CodexToolCategory.ANALYSIS.value == "analysis"
+
+
+class TestDataclasses:
+    """Tests for dataclass types."""
+
+    def test_execution_result_defaults(self):
+        """Test ExecutionResult default values."""
+        result = ExecutionResult(status=ExecutionStatus.COMPLETED)
+
+        assert result.status == ExecutionStatus.COMPLETED
+        assert result.stdout == ""
+        assert result.stderr == ""
+        assert result.exit_code is None
+        assert result.elapsed_seconds == 0.0
+        assert result.operation_id is None
+
+    def test_execution_result_with_values(self):
+        """Test ExecutionResult with custom values."""
+        result = ExecutionResult(
+            status=ExecutionStatus.FAILED,
+            stdout="output",
+            stderr="error",
+            exit_code=1,
+            elapsed_seconds=5.5,
+            operation_id="abc123",
+        )
+
+        assert result.status == ExecutionStatus.FAILED
+        assert result.stdout == "output"
+        assert result.stderr == "error"
+        assert result.exit_code == 1
+        assert result.elapsed_seconds == 5.5
+        assert result.operation_id == "abc123"
+
+    def test_mcp_tool_info(self):
+        """Test MCPToolInfo dataclass."""
+        tool = MCPToolInfo(
+            name="test_tool",
+            description="A test tool",
+            input_schema={"type": "object"},
+        )
+
+        assert tool.name == "test_tool"
+        assert tool.description == "A test tool"
+        assert tool.input_schema == {"type": "object"}
+
+    def test_mcp_tool_info_defaults(self):
+        """Test MCPToolInfo default values."""
+        tool = MCPToolInfo(name="minimal", description="")
+
+        assert tool.name == "minimal"
+        assert tool.input_schema == {}
+
+
+class TestPydanticModels:
+    """Tests for Pydantic models."""
+
+    def test_codex_exec_options_defaults(self):
+        """Test CodexExecOptions default values."""
+        options = CodexExecOptions(prompt="test prompt")
+
+        assert options.prompt == "test prompt"
+        assert options.working_dir is None
+        assert options.sandbox_mode == SandboxMode.READ_ONLY
+        assert options.model is None
+        assert options.timeout == 300
+        assert options.env_vars == {}
+        assert options.databricks_profile == "DEFAULT"
+        assert options.inject_databricks_env is True
+
+    def test_codex_exec_options_custom(self):
+        """Test CodexExecOptions with custom values."""
+        options = CodexExecOptions(
+            prompt="custom prompt",
+            working_dir="/tmp",
+            sandbox_mode=SandboxMode.WORKSPACE_WRITE,
+            model="gpt-4",
+            timeout=600,
+            env_vars={"KEY": "value"},
+            databricks_profile="PROD",
+            inject_databricks_env=False,
+        )
+
+        assert options.prompt == "custom prompt"
+        assert options.working_dir == "/tmp"
+        assert options.sandbox_mode == SandboxMode.WORKSPACE_WRITE
+        assert options.model == "gpt-4"
+        assert options.timeout == 600
+        assert options.env_vars == {"KEY": "value"}
+        assert options.databricks_profile == "PROD"
+        assert options.inject_databricks_env is False
+
+    def test_codex_tool_call(self):
+        """Test CodexToolCall model."""
+        call = CodexToolCall(
+            id="call_123",
+            name="test_tool",
+            arguments={"arg1": "value1"},
+        )
+
+        assert call.id == "call_123"
+        assert call.name == "test_tool"
+        assert call.arguments == {"arg1": "value1"}
+        assert isinstance(call.timestamp, datetime)
+
+    def test_codex_response_success(self):
+        """Test CodexResponse for success."""
+        response = CodexResponse(
+            success=True,
+            content="Result content",
+            elapsed_ms=100,
+        )
+
+        assert response.success is True
+        assert response.content == "Result content"
+        assert response.tool_calls == []
+        assert response.error is None
+        assert response.elapsed_ms == 100
+
+    def test_codex_response_failure(self):
+        """Test CodexResponse for failure."""
+        response = CodexResponse(
+            success=False,
+            error="Something went wrong",
+        )
+
+        assert response.success is False
+        assert response.error == "Something went wrong"
+
+    def test_databricks_context_defaults(self):
+        """Test DatabricksContext default values."""
+        ctx = DatabricksContext()
+
+        assert ctx.host is None
+        assert ctx.profile == "DEFAULT"
+        assert ctx.catalog is None
+        assert ctx.schema_name is None
+        assert ctx.warehouse_id is None
+
+    def test_codex_integration_config(self):
+        """Test CodexIntegrationConfig model."""
+        config = CodexIntegrationConfig(
+            codex_path="/custom/codex",
+            default_timeout=600,
+            sandbox_mode="workspace-write",
+        )
+
+        assert config.codex_path == "/custom/codex"
+        assert config.default_timeout == 600
+        assert config.sandbox_mode == "workspace-write"
+        assert isinstance(config.databricks_context, DatabricksContext)
+
+    def test_model_serialization(self):
+        """Test model serialization to dict."""
+        options = CodexExecOptions(
+            prompt="test",
+            sandbox_mode=SandboxMode.READ_ONLY,
+        )
+
+        data = options.model_dump()
+        assert data["prompt"] == "test"
+        assert data["sandbox_mode"] == SandboxMode.READ_ONLY
diff --git a/databricks-codex/tests/test_session.py b/databricks-codex/tests/test_session.py
new file mode 100644
index 00000000..16d85d08
--- /dev/null
+++ b/databricks-codex/tests/test_session.py
@@ -0,0 +1,354 @@
+"""Tests for session management."""
+
+from datetime import datetime
+from pathlib import Path
+from unittest.mock import MagicMock, patch
+
+import pytest
+
+from databricks_codex.session import (
+    CodexSession,
+    SessionManager,
+)
+
+
+class TestCodexSession:
+    """Tests for CodexSession dataclass."""
+
+    def test_minimal_session(self):
+        """Test minimal session creation."""
+        session = CodexSession(
+            session_id="abc123",
+            created_at=datetime.now(),
+        )
+
+        assert session.session_id == "abc123"
+        assert session.last_activity is None
+        assert session.project_dir is None
+        assert session.metadata == {}
+
+    def test_full_session(self):
+        """Test session with all fields."""
+        now = datetime.now()
+        session = CodexSession(
+            session_id="abc123",
+            created_at=now,
+            last_activity=now,
+            project_dir=Path("/workspace"),
+            metadata={"key": "value"},
+        )
+
+        assert session.session_id == "abc123"
+        assert session.project_dir == Path("/workspace")
+        assert session.metadata == {"key": "value"}
+
+
+class TestSessionManager:
+    """Tests for SessionManager class."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        """Create session manager with temp directory."""
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_init_default(self):
+        """Test default initialization."""
+        manager = SessionManager()
+        assert manager.session_dir == Path.home() / ".codex" / "sessions"
+
+    def test_init_custom_dir(self, tmp_path):
+        """Test initialization with custom directory."""
+        custom_dir = tmp_path / "custom"
+        manager = SessionManager(session_dir=custom_dir)
+        assert manager.session_dir == custom_dir
+
+
+class TestListSessions:
+    """Tests for list_sessions method."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_list_sessions_success(self, manager, mock_subprocess_run):
+        """Test listing sessions successfully."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="abc12345-def6-7890-abcd-ef1234567890\nxyz98765-uvw4-3210-zyxw-vu0987654321",
+            stderr="",
+        )
+
+        sessions = manager.list_sessions()
+
+        # Should parse UUID-like patterns
+        assert len(sessions) >= 0  # May or may not find patterns
+
+    def test_list_sessions_codex_not_found(self, manager, mock_subprocess_run_not_found):
+        """Test list sessions when Codex not installed."""
+        sessions = manager.list_sessions()
+        assert sessions == []
+
+    def test_list_sessions_timeout(self, manager, mock_subprocess_run_timeout):
+        """Test list sessions timeout."""
+        sessions = manager.list_sessions()
+        assert sessions == []
+
+    def test_list_sessions_with_limit(self, manager, mock_subprocess_run):
+        """Test list sessions respects limit."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="session1\nsession2\nsession3\nsession4\nsession5",
+            stderr="",
+        )
+
+        sessions = manager.list_sessions(limit=2)
+
+        assert len(sessions) <= 2
+
+
+class TestGetLastSession:
+    """Tests for get_last_session method."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_get_last_session(self, manager, mock_subprocess_run):
+        """Test getting last session."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="abc12345-def6-7890-abcd-ef1234567890",
+            stderr="",
+        )
+
+        session = manager.get_last_session()
+
+        # May or may not find a session depending on output parsing
+
+    def test_get_last_session_none(self, manager, mock_subprocess_run):
+        """Test getting last session when none exist."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="",
+            stderr="",
+        )
+
+        session = manager.get_last_session()
+        assert session is None
+
+
+class TestResumeSession:
+    """Tests for resume_session method."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_resume_session_by_id(self, manager, mock_subprocess_run):
+        """Test resuming session by ID."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Resumed session abc123",
+            stderr="",
+        )
+
+        result = manager.resume_session("abc123")
+
+        assert result is not None
+        mock_subprocess_run.assert_called_once()
+        cmd = mock_subprocess_run.call_args[0][0]
+        assert "resume" in cmd
+        assert "abc123" in cmd
+
+    def test_resume_session_last(self, manager, mock_subprocess_run):
+        """Test resuming last session."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Resumed last session",
+            stderr="",
+        )
+
+        result = manager.resume_session(last=True)
+
+        cmd = mock_subprocess_run.call_args[0][0]
+        assert "--last" in cmd
+
+    def test_resume_session_no_args(self, manager):
+        """Test resume without session_id or last flag."""
+        result = manager.resume_session()
+        assert result is None
+
+    def test_resume_session_failure(self, manager, mock_subprocess_run):
+        """Test resume session failure."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=1,
+            stdout="",
+            stderr="Session not found",
+        )
+
+        result = manager.resume_session("nonexistent")
+        assert result is None
+
+    def test_resume_session_not_installed(self, manager, mock_subprocess_run_not_found):
+        """Test resume when Codex not installed."""
+        result = manager.resume_session("abc123")
+        assert result is None
+
+
+class TestForkSession:
+    """Tests for fork_session method."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_fork_session_basic(self, manager, mock_subprocess_run):
+        """Test basic session fork."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Forked to new123",
+            stderr="",
+        )
+
+        result = manager.fork_session("original123")
+
+        cmd = mock_subprocess_run.call_args[0][0]
+        assert "fork" in cmd
+        assert "original123" in cmd
+
+    def test_fork_session_with_prompt(self, manager, mock_subprocess_run):
+        """Test forking with new prompt."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="Forked",
+            stderr="",
+        )
+
+        result = manager.fork_session("original", new_prompt="Continue with tests")
+
+        cmd = mock_subprocess_run.call_args[0][0]
+        assert "--prompt" in cmd
+        assert "Continue with tests" in cmd
+
+    def test_fork_session_failure(self, manager, mock_subprocess_run):
+        """Test fork session failure."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=1,
+            stdout="",
+            stderr="Fork failed",
+        )
+
+        result = manager.fork_session("original")
+        assert result is None
+
+    def test_fork_session_not_installed(self, manager, mock_subprocess_run_not_found):
+        """Test fork when Codex not installed."""
+        result = manager.fork_session("original")
+        assert result is None
+
+
+class TestSessionExists:
+    """Tests for session_exists method."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_session_exists_true(self, manager, mock_subprocess_run):
+        """Test session exists check - positive."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="abc123",
+            stderr="",
+        )
+
+        # This depends on parsing - may need adjustment
+        exists = manager.session_exists("abc123")
+        # Result depends on implementation
+
+    def test_session_exists_false(self, manager, mock_subprocess_run):
+        """Test session exists check - negative."""
+        mock_subprocess_run.return_value = MagicMock(
+            returncode=0,
+            stdout="other-session",
+            stderr="",
+        )
+
+        exists = manager.session_exists("nonexistent")
+        # Result depends on implementation
+
+
+class TestClearSessions:
+    """Tests for clear_sessions method."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_clear_sessions_not_implemented(self, manager):
+        """Test clear sessions returns 0 (not implemented)."""
+        result = manager.clear_sessions()
+        assert result == 0
+
+
+class TestParseSessionOutput:
+    """Tests for _parse_session_output method."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_parse_uuid_format(self, manager):
+        """Test parsing UUID format session IDs."""
+        output = "abc12345-def6-7890-abcd-ef1234567890"
+        sessions = manager._parse_session_output(output)
+
+        assert len(sessions) == 1
+        assert sessions[0].session_id == "abc12345-def6-7890-abcd-ef1234567890"
+
+    def test_parse_multiple_sessions(self, manager):
+        """Test parsing multiple sessions."""
+        output = """Session: abc12345-def6-7890-abcd-ef1234567890
+Session: xyz98765-uvw4-3210-zyxw-vu0987654321"""
+        sessions = manager._parse_session_output(output)
+
+        assert len(sessions) == 2
+
+    def test_parse_empty_output(self, manager):
+        """Test parsing empty output."""
+        sessions = manager._parse_session_output("")
+        assert sessions == []
+
+    def test_parse_short_hex_format(self, manager):
+        """Test parsing short hex format IDs."""
+        output = "abcd1234"
+        sessions = manager._parse_session_output(output)
+
+        assert len(sessions) == 1
+
+
+class TestExtractSessionId:
+    """Tests for _extract_session_id method."""
+
+    @pytest.fixture
+    def manager(self, tmp_path):
+        return SessionManager(session_dir=tmp_path / "sessions")
+
+    def test_extract_uuid(self, manager):
+        """Test extracting UUID format ID."""
+        output = "Created session abc12345-def6-7890-abcd-ef1234567890"
+        session_id = manager._extract_session_id(output)
+
+        assert session_id == "abc12345-def6-7890-abcd-ef1234567890"
+
+    def test_extract_short_id(self, manager):
+        """Test extracting short hex ID."""
+        output = "Forked to abcd1234"
+        session_id = manager._extract_session_id(output)
+
+        assert session_id is not None
+
+    def test_extract_from_empty(self, manager):
+        """Test extracting from empty output."""
+        session_id = manager._extract_session_id("")
+        assert session_id is None