Blame: sdk/python/docs/api-reference.md - openai/codex

openai / codex UNCLAIMED

Lightweight coding agent that runs in your terminal

Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00			`# Codex App Server SDK — API Reference`

			Public surface of `codex_app_server` for app-server v2.

Add Python SDK thread.run convenience methods (#15088) ## TL;DR Add `thread.run(...)` / `async thread.run(...)` convenience methods to the Python SDK for the common case. - add `RunInput = Input \| str` and `RunResult` with `final_response`, collected `items`, and optional `usage` - keep `thread.turn(...)` strict and lower-level for streaming, steering, interrupting, and raw generated `Turn` access - update Python SDK docs, quickstart examples, and tests for the sync and async convenience flows ## Validation - `python3 -m pytest sdk/python/tests/test_public_api_signatures.py sdk/python/tests/test_public_api_runtime_behavior.py` - `python3 -m pytest sdk/python/tests/test_real_app_server_integration.py -k 'thread_run_convenience or async_thread_run_convenience'` (skipped in this environment) --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-18 17:57:48 -07:00			This SDK surface is experimental. The current implementation intentionally allows only one active turn consumer (`Thread.run()`, `TurnHandle.stream()`, or `TurnHandle.run()`) per client instance at a time.
Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00
			`## Package Entry`

			```python
			`from codex_app_server import (`
			`Codex,`
			`AsyncCodex,`
Add Python SDK thread.run convenience methods (#15088) ## TL;DR Add `thread.run(...)` / `async thread.run(...)` convenience methods to the Python SDK for the common case. - add `RunInput = Input \| str` and `RunResult` with `final_response`, collected `items`, and optional `usage` - keep `thread.turn(...)` strict and lower-level for streaming, steering, interrupting, and raw generated `Turn` access - update Python SDK docs, quickstart examples, and tests for the sync and async convenience flows ## Validation - `python3 -m pytest sdk/python/tests/test_public_api_signatures.py sdk/python/tests/test_public_api_runtime_behavior.py` - `python3 -m pytest sdk/python/tests/test_real_app_server_integration.py -k 'thread_run_convenience or async_thread_run_convenience'` (skipped in this environment) --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-18 17:57:48 -07:00			`RunResult,`
Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00			`Thread,`
			`AsyncThread,`
			`TurnHandle,`
			`AsyncTurnHandle,`
			`InitializeResponse,`
			`Input,`
			`InputItem,`
			`TextInput,`
			`ImageInput,`
			`LocalImageInput,`
			`SkillInput,`
			`MentionInput,`
			`TurnStatus,`
			`)`
Add Python SDK thread.run convenience methods (#15088) ## TL;DR Add `thread.run(...)` / `async thread.run(...)` convenience methods to the Python SDK for the common case. - add `RunInput = Input \| str` and `RunResult` with `final_response`, collected `items`, and optional `usage` - keep `thread.turn(...)` strict and lower-level for streaming, steering, interrupting, and raw generated `Turn` access - update Python SDK docs, quickstart examples, and tests for the sync and async convenience flows ## Validation - `python3 -m pytest sdk/python/tests/test_public_api_signatures.py sdk/python/tests/test_public_api_runtime_behavior.py` - `python3 -m pytest sdk/python/tests/test_real_app_server_integration.py -k 'thread_run_convenience or async_thread_run_convenience'` (skipped in this environment) --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-18 17:57:48 -07:00			`from codex_app_server.generated.v2_all import ThreadItem, ThreadTokenUsage`
Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00			```

			- Version: `codex_app_server.__version__`
			`- Requires Python >= 3.10`
			- Canonical generated app-server models live in `codex_app_server.generated.v2_all`

			`## Codex (sync)`

			```python
			`Codex(config: AppServerConfig \| None = None)`
			```

			`Properties/methods:`

			- `metadata -> InitializeResponse`
			- `close() -> None`
			- `thread_start(*, approval_policy=None, base_instructions=None, config=None, cwd=None, developer_instructions=None, ephemeral=None, model=None, model_provider=None, personality=None, sandbox=None) -> Thread`
			- `thread_list(*, archived=None, cursor=None, cwd=None, limit=None, model_providers=None, sort_key=None, source_kinds=None) -> ThreadListResponse`
			- `thread_resume(thread_id: str, *, approval_policy=None, base_instructions=None, config=None, cwd=None, developer_instructions=None, model=None, model_provider=None, personality=None, sandbox=None) -> Thread`
			- `thread_fork(thread_id: str, *, approval_policy=None, base_instructions=None, config=None, cwd=None, developer_instructions=None, model=None, model_provider=None, sandbox=None) -> Thread`
			- `thread_archive(thread_id: str) -> ThreadArchiveResponse`
			- `thread_unarchive(thread_id: str) -> Thread`
			- `models(*, include_hidden: bool = False) -> ModelListResponse`

			`Context manager:`

			```python
			`with Codex() as codex:`
			`...`
			```

			`## AsyncCodex (async parity)`

			```python
			`AsyncCodex(config: AppServerConfig \| None = None)`
			```

			`Preferred usage:`

			```python
			`async with AsyncCodex() as codex:`
			`...`
			```

			`AsyncCodex` initializes lazily. Context entry is the standard path because it
			`ensures startup and shutdown are paired explicitly.`

			`Properties/methods:`

			- `metadata -> InitializeResponse`
			- `close() -> Awaitable[None]`
			- `thread_start(*, approval_policy=None, base_instructions=None, config=None, cwd=None, developer_instructions=None, ephemeral=None, model=None, model_provider=None, personality=None, sandbox=None) -> Awaitable[AsyncThread]`
			- `thread_list(*, archived=None, cursor=None, cwd=None, limit=None, model_providers=None, sort_key=None, source_kinds=None) -> Awaitable[ThreadListResponse]`
			- `thread_resume(thread_id: str, *, approval_policy=None, base_instructions=None, config=None, cwd=None, developer_instructions=None, model=None, model_provider=None, personality=None, sandbox=None) -> Awaitable[AsyncThread]`
			- `thread_fork(thread_id: str, *, approval_policy=None, base_instructions=None, config=None, cwd=None, developer_instructions=None, ephemeral=None, model=None, model_provider=None, sandbox=None) -> Awaitable[AsyncThread]`
			- `thread_archive(thread_id: str) -> Awaitable[ThreadArchiveResponse]`
			- `thread_unarchive(thread_id: str) -> Awaitable[AsyncThread]`
			- `models(*, include_hidden: bool = False) -> Awaitable[ModelListResponse]`

			`Async context manager:`

			```python
			`async with AsyncCodex() as codex:`
			`...`
			```

			`## Thread / AsyncThread`

			`Thread` and `AsyncThread` share the same shape and intent.

			`### Thread`

Add Python SDK thread.run convenience methods (#15088) ## TL;DR Add `thread.run(...)` / `async thread.run(...)` convenience methods to the Python SDK for the common case. - add `RunInput = Input \| str` and `RunResult` with `final_response`, collected `items`, and optional `usage` - keep `thread.turn(...)` strict and lower-level for streaming, steering, interrupting, and raw generated `Turn` access - update Python SDK docs, quickstart examples, and tests for the sync and async convenience flows ## Validation - `python3 -m pytest sdk/python/tests/test_public_api_signatures.py sdk/python/tests/test_public_api_runtime_behavior.py` - `python3 -m pytest sdk/python/tests/test_real_app_server_integration.py -k 'thread_run_convenience or async_thread_run_convenience'` (skipped in this environment) --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-18 17:57:48 -07:00			- `run(input: str \| Input, *, approval_policy=None, approvals_reviewer=None, cwd=None, effort=None, model=None, output_schema=None, personality=None, sandbox_policy=None, service_tier=None, summary=None) -> RunResult`
Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00			- `turn(input: Input, *, approval_policy=None, cwd=None, effort=None, model=None, output_schema=None, personality=None, sandbox_policy=None, summary=None) -> TurnHandle`
			- `read(*, include_turns: bool = False) -> ThreadReadResponse`
			- `set_name(name: str) -> ThreadSetNameResponse`
			- `compact() -> ThreadCompactStartResponse`

			`### AsyncThread`

Add Python SDK thread.run convenience methods (#15088) ## TL;DR Add `thread.run(...)` / `async thread.run(...)` convenience methods to the Python SDK for the common case. - add `RunInput = Input \| str` and `RunResult` with `final_response`, collected `items`, and optional `usage` - keep `thread.turn(...)` strict and lower-level for streaming, steering, interrupting, and raw generated `Turn` access - update Python SDK docs, quickstart examples, and tests for the sync and async convenience flows ## Validation - `python3 -m pytest sdk/python/tests/test_public_api_signatures.py sdk/python/tests/test_public_api_runtime_behavior.py` - `python3 -m pytest sdk/python/tests/test_real_app_server_integration.py -k 'thread_run_convenience or async_thread_run_convenience'` (skipped in this environment) --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-18 17:57:48 -07:00			- `run(input: str \| Input, *, approval_policy=None, approvals_reviewer=None, cwd=None, effort=None, model=None, output_schema=None, personality=None, sandbox_policy=None, service_tier=None, summary=None) -> Awaitable[RunResult]`
Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00			- `turn(input: Input, *, approval_policy=None, cwd=None, effort=None, model=None, output_schema=None, personality=None, sandbox_policy=None, summary=None) -> Awaitable[AsyncTurnHandle]`
			- `read(*, include_turns: bool = False) -> Awaitable[ThreadReadResponse]`
			- `set_name(name: str) -> Awaitable[ThreadSetNameResponse]`
			- `compact() -> Awaitable[ThreadCompactStartResponse]`

Add Python SDK thread.run convenience methods (#15088) ## TL;DR Add `thread.run(...)` / `async thread.run(...)` convenience methods to the Python SDK for the common case. - add `RunInput = Input \| str` and `RunResult` with `final_response`, collected `items`, and optional `usage` - keep `thread.turn(...)` strict and lower-level for streaming, steering, interrupting, and raw generated `Turn` access - update Python SDK docs, quickstart examples, and tests for the sync and async convenience flows ## Validation - `python3 -m pytest sdk/python/tests/test_public_api_signatures.py sdk/python/tests/test_public_api_runtime_behavior.py` - `python3 -m pytest sdk/python/tests/test_real_app_server_integration.py -k 'thread_run_convenience or async_thread_run_convenience'` (skipped in this environment) --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-18 17:57:48 -07:00			`run(...)` is the common-case convenience path. It accepts plain strings, starts
			`the turn, consumes notifications until completion, and returns a small result`
			`object with:`

			- `final_response: str \| None`
			- `items: list[ThreadItem]`
			- `usage: ThreadTokenUsage \| None`

			`final_response` is `None` when the turn finishes without a final-answer or
			`phase-less assistant message item.`

			Use `turn(...)` when you need low-level turn control (`stream()`, `steer()`,
			`interrupt()`) or the canonical generated `Turn` from `TurnHandle.run()`.

Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00			`## TurnHandle / AsyncTurnHandle`

			`### TurnHandle`

			- `steer(input: Input) -> TurnSteerResponse`
			- `interrupt() -> TurnInterruptResponse`
			- `stream() -> Iterator[Notification]`
			- `run() -> codex_app_server.generated.v2_all.Turn`

			`Behavior notes:`

			- `stream()` and `run()` are exclusive per client instance in the current experimental build
			- starting a second turn consumer on the same `Codex` instance raises `RuntimeError`

			`### AsyncTurnHandle`

			- `steer(input: Input) -> Awaitable[TurnSteerResponse]`
			- `interrupt() -> Awaitable[TurnInterruptResponse]`
			- `stream() -> AsyncIterator[Notification]`
			- `run() -> Awaitable[codex_app_server.generated.v2_all.Turn]`

			`Behavior notes:`

			- `stream()` and `run()` are exclusive per client instance in the current experimental build
			- starting a second turn consumer on the same `AsyncCodex` instance raises `RuntimeError`

			`## Inputs`

			```python
			`@dataclass class TextInput: text: str`
			`@dataclass class ImageInput: url: str`
			`@dataclass class LocalImageInput: path: str`
			`@dataclass class SkillInput: name: str; path: str`
			`@dataclass class MentionInput: name: str; path: str`

			`InputItem = TextInput \| ImageInput \| LocalImageInput \| SkillInput \| MentionInput`
			`Input = list[InputItem] \| InputItem`
			```

			`## Generated Models`

			`The SDK wrappers return and accept canonical generated app-server models wherever possible:`

			```python
			`from codex_app_server.generated.v2_all import (`
			`AskForApproval,`
			`ThreadReadResponse,`
			`Turn,`
			`TurnStartParams,`
			`TurnStatus,`
			`)`
			```

			`## Retry + errors`

			```python
			`from codex_app_server import (`
			`retry_on_overload,`
			`JsonRpcError,`
			`MethodNotFoundError,`
			`InvalidParamsError,`
			`ServerBusyError,`
			`is_retryable_error,`
			`)`
			```

			- `retry_on_overload(...)` retries transient overload errors with exponential backoff + jitter.
			- `is_retryable_error(exc)` checks if an exception is transient/overload-like.

			`## Example`

			```python
Add Python SDK thread.run convenience methods (#15088) ## TL;DR Add `thread.run(...)` / `async thread.run(...)` convenience methods to the Python SDK for the common case. - add `RunInput = Input \| str` and `RunResult` with `final_response`, collected `items`, and optional `usage` - keep `thread.turn(...)` strict and lower-level for streaming, steering, interrupting, and raw generated `Turn` access - update Python SDK docs, quickstart examples, and tests for the sync and async convenience flows ## Validation - `python3 -m pytest sdk/python/tests/test_public_api_signatures.py sdk/python/tests/test_public_api_runtime_behavior.py` - `python3 -m pytest sdk/python/tests/test_real_app_server_integration.py -k 'thread_run_convenience or async_thread_run_convenience'` (skipped in this environment) --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-18 17:57:48 -07:00			`from codex_app_server import Codex`
Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00
			`with Codex() as codex:`
			`thread = codex.thread_start(model="gpt-5.4", config={"model_reasoning_effort": "high"})`
Add Python SDK thread.run convenience methods (#15088) ## TL;DR Add `thread.run(...)` / `async thread.run(...)` convenience methods to the Python SDK for the common case. - add `RunInput = Input \| str` and `RunResult` with `final_response`, collected `items`, and optional `usage` - keep `thread.turn(...)` strict and lower-level for streaming, steering, interrupting, and raw generated `Turn` access - update Python SDK docs, quickstart examples, and tests for the sync and async convenience flows ## Validation - `python3 -m pytest sdk/python/tests/test_public_api_signatures.py sdk/python/tests/test_public_api_runtime_behavior.py` - `python3 -m pytest sdk/python/tests/test_real_app_server_integration.py -k 'thread_run_convenience or async_thread_run_convenience'` (skipped in this environment) --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-18 17:57:48 -07:00			`result = thread.run("Say hello in one sentence.")`
			`print(result.final_response)`
Add Python SDK public API and examples (#14446) ## TL;DR WIP esp the examples Thin the Python SDK public surface so the wrapper layer returns canonical app-server generated models directly. - keeps `Codex` / `AsyncCodex` / `Thread` / `Turn` and input helpers, but removes alias-only type layers and custom result models - `metadata` now returns `InitializeResponse` and `run()` returns the generated app-server `Turn` - updates docs, examples, notebook, and tests to use canonical generated types and regenerates `v2_all.py` against current schema - keeps the pinned runtime-package integration flow and real integration coverage ## Validation - `PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests` - `GH_TOKEN="$(gh auth token)" RUN_REAL_CODEX_TESTS=1 PYTHONPATH=sdk/python/src python3 -m pytest sdk/python/tests -rs` --------- Co-authored-by: Codex <noreply@openai.com> 2026-03-17 16:05:56 -07:00			```