Blame: tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py - BerriAI/litellm

BerriAI / litellm UNCLAIMED

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

0 0 1 Python

Normal View History Raw

(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`import json`
			`import os`
			`import sys`
			`from datetime import datetime`
			`from typing import AsyncIterator, Dict, Any`
			`import asyncio`
			`import unittest.mock`
			`from unittest.mock import AsyncMock, MagicMock`

			`sys.path.insert(`
working anthropic API tests 2025-03-26 17:34:41 -07:00			`0, os.path.abspath("../../..")`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`) # Adds the parent directory to the system path`
			`import litellm`
			`import pytest`
			`from dotenv import load_dotenv`
			`from litellm.llms.anthropic.experimental_pass_through.messages.handler import (`
			`anthropic_messages,`
			`)`
working anthropic API tests 2025-03-26 17:34:41 -07:00
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`from typing import Optional`
			`from litellm.types.utils import StandardLoggingPayload`
			`from litellm.integrations.custom_logger import CustomLogger`
			`from litellm.llms.custom_httpx.http_handler import AsyncHTTPHandler`
			`from litellm.router import Router`
			`import importlib`
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`from litellm.llms.bedrock.base_aws_llm import BaseAWSLLM`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`from base_anthropic_unified_messages_test import BaseAnthropicMessagesTest`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`# Load environment variables`
			`load_dotenv()`


			`@pytest.fixture(scope="session")`
			`def event_loop():`
			`"""Create an instance of the default event loop for each test session."""`
			`loop = asyncio.get_event_loop_policy().new_event_loop()`
			`yield loop`
			`loop.close()`


			`@pytest.fixture(scope="function", autouse=True)`
			`def setup_and_teardown(event_loop): # Add event_loop as a dependency`
			`curr_dir = os.getcwd()`
			`sys.path.insert(0, os.path.abspath("../.."))`

			`import litellm`
			`from litellm import Router`

			`importlib.reload(litellm)`

			`# Set the event loop from the fixture`
			`asyncio.set_event_loop(event_loop)`

			`print(litellm)`
			`yield`

			`# Clean up any pending tasks`
			`pending = asyncio.all_tasks(event_loop)`
			`for task in pending:`
			`task.cancel()`

			`# Run the event loop until all tasks are cancelled`
			`if pending:`
			`event_loop.run_until_complete(asyncio.gather(*pending, return_exceptions=True))`


			`def _validate_anthropic_response(response: Dict[str, Any]):`
			`assert "id" in response`
			`assert "content" in response`
			`assert "model" in response`
			`assert response["role"] == "assistant"`


[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`class TestAnthropicDirectAPI(BaseAnthropicMessagesTest):`
			`"""Tests for direct Anthropic API calls"""`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`@property`
			`def model_config(self) -> Dict[str, Any]:`
			`return {`
			`"model": "claude-3-haiku-20240307",`
			`"api_key": os.getenv("ANTHROPIC_API_KEY"),`
			`}`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Cost tracking and logging via the /v1/messages API are not working when using Claude Code (#11928) * add test_anthropic_messages_litellm_router_streaming_with_logging to base tests * move test * fixes for base ant tests * working bedrock ant logging * use BaseAnthropicMessagesStreamingIterator * use common iterator for messages streaming * TestAnthropicDirectAPI * test_anthropic_claude3_transformation.py * fix code QA checks * fix logging for anthropic messages in SLP * fix TestAnthropicOpenAIAPI * remove hard coded usage for adapter * test_anthropic_messages_litellm_router_streaming_with_logging 2025-06-20 18:08:35 -07:00			`@property`
			`def expected_model_name_in_logging(self) -> str:`
			`"""`
			`This is the model name that is expected to be in the logging payload`
			`"""`
			`return "claude-3-haiku-20240307"`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`class TestAnthropicBedrockAPI(BaseAnthropicMessagesTest):`
			`"""Tests for Anthropic via Bedrock"""`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`@property`
			`def model_config(self) -> Dict[str, Any]:`
			`return {`
			`"model": "bedrock/us.anthropic.claude-3-5-sonnet-20240620-v1:0",`
			`}`
[Bug Fix] Cost tracking and logging via the /v1/messages API are not working when using Claude Code (#11928) * add test_anthropic_messages_litellm_router_streaming_with_logging to base tests * move test * fixes for base ant tests * working bedrock ant logging * use BaseAnthropicMessagesStreamingIterator * use common iterator for messages streaming * TestAnthropicDirectAPI * test_anthropic_claude3_transformation.py * fix code QA checks * fix logging for anthropic messages in SLP * fix TestAnthropicOpenAIAPI * remove hard coded usage for adapter * test_anthropic_messages_litellm_router_streaming_with_logging 2025-06-20 18:08:35 -07:00
			`@property`
			`def expected_model_name_in_logging(self) -> str:`
			`"""`
			`This is the model name that is expected to be in the logging payload`
			`"""`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`return "bedrock/us.anthropic.claude-3-5-sonnet-20240620-v1:0"`
[Feat] Add streaming support for using bedrock invoke models with /v1/messages (#10710) * add basic bedrock transform * test_anthropic_messages_streaming_bedrock_invoke * fix: typing ant * fix: get async response iterator * fix: code quality check 2025-05-09 18:56:23 -07:00

[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`class TestAnthropicOpenAIAPI(BaseAnthropicMessagesTest):`
			`"""Tests for OpenAI via Anthropic messages interface"""`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`@property`
			`def model_config(self) -> Dict[str, Any]:`
			`return {`
			`"model": "openai/gpt-4o-mini",`
[Bug Fix] Cost tracking and logging via the /v1/messages API are not working when using Claude Code (#11928) * add test_anthropic_messages_litellm_router_streaming_with_logging to base tests * move test * fixes for base ant tests * working bedrock ant logging * use BaseAnthropicMessagesStreamingIterator * use common iterator for messages streaming * TestAnthropicDirectAPI * test_anthropic_claude3_transformation.py * fix code QA checks * fix logging for anthropic messages in SLP * fix TestAnthropicOpenAIAPI * remove hard coded usage for adapter * test_anthropic_messages_litellm_router_streaming_with_logging 2025-06-20 18:08:35 -07:00			`"client": None,`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`}`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Cost tracking and logging via the /v1/messages API are not working when using Claude Code (#11928) * add test_anthropic_messages_litellm_router_streaming_with_logging to base tests * move test * fixes for base ant tests * working bedrock ant logging * use BaseAnthropicMessagesStreamingIterator * use common iterator for messages streaming * TestAnthropicDirectAPI * test_anthropic_claude3_transformation.py * fix code QA checks * fix logging for anthropic messages in SLP * fix TestAnthropicOpenAIAPI * remove hard coded usage for adapter * test_anthropic_messages_litellm_router_streaming_with_logging 2025-06-20 18:08:35 -07:00			`@property`
			`def expected_model_name_in_logging(self) -> str:`
			`"""`
			`This is the model name that is expected to be in the logging payload`
			`"""`
			`return "gpt-4o-mini"`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Cost tracking and logging via the /v1/messages API are not working when using Claude Code (#11928) * add test_anthropic_messages_litellm_router_streaming_with_logging to base tests * move test * fixes for base ant tests * working bedrock ant logging * use BaseAnthropicMessagesStreamingIterator * use common iterator for messages streaming * TestAnthropicDirectAPI * test_anthropic_claude3_transformation.py * fix code QA checks * fix logging for anthropic messages in SLP * fix TestAnthropicOpenAIAPI * remove hard coded usage for adapter * test_anthropic_messages_litellm_router_streaming_with_logging 2025-06-20 18:08:35 -07:00			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_litellm_router_streaming_with_logging(self):`
			`"""`
			`Test the anthropic_messages with streaming request`
			`"""`
			`pass`
[Feat] Add streaming support for using bedrock invoke models with /v1/messages (#10710) * add basic bedrock transform * test_anthropic_messages_streaming_bedrock_invoke * fix: typing ant * fix: get async response iterator * fix: code quality check 2025-05-09 18:56:23 -07:00

(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_streaming_with_bad_request():`
			`"""`
			`Test the anthropic_messages with streaming request`
			`"""`
			`try:`
use new anthropic interface 2025-03-31 14:31:09 -07:00			`response = await litellm.anthropic.messages.acreate(`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`messages=[{"role": "user", "content": "hi"}],`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`api_key=os.getenv("ANTHROPIC_API_KEY"),`
			`model="claude-3-haiku-20240307",`
			`max_tokens=100,`
			`stream=True,`
			`)`
			`print(response)`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`if isinstance(response, AsyncIterator):`
			`async for chunk in response:`
			`print("chunk=", chunk)`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`except Exception as e:`
			`print("got exception", e)`
			`print("vars", vars(e))`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`if hasattr(e, "status_code"):`
			`assert getattr(e, "status_code") == 400`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`else:`
			`assert isinstance(e, Exception)`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00

			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_router_streaming_with_bad_request():`
			`"""`
			`Test the anthropic_messages with streaming request`
			`"""`
			`try:`
			`router = Router(`
			`model_list=[`
			`{`
			`"model_name": "claude-special-alias",`
			`"litellm_params": {`
			`"model": "claude-3-haiku-20240307",`
			`"api_key": os.getenv("ANTHROPIC_API_KEY"),`
			`},`
			`}`
			`]`
			`)`

			`response = await router.aanthropic_messages(`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`messages=[{"role": "user", "content": "hi"}],`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`model="claude-special-alias",`
			`max_tokens=100,`
			`stream=True,`
			`)`
			`print(response)`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`if isinstance(response, AsyncIterator):`
			`async for chunk in response:`
			`print("chunk=", chunk)`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`except Exception as e:`
			`print("got exception", e)`
			`print("vars", vars(e))`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`if hasattr(e, "status_code"):`
			`assert getattr(e, "status_code") == 400`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`else:`
			`assert isinstance(e, Exception)`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00

			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_litellm_router_non_streaming():`
			`"""`
			`Test the anthropic_messages with non-streaming request`
			`"""`
			`litellm._turn_on_debug()`
			`router = Router(`
			`model_list=[`
			`{`
			`"model_name": "claude-special-alias",`
			`"litellm_params": {`
			`"model": "claude-3-haiku-20240307",`
			`"api_key": os.getenv("ANTHROPIC_API_KEY"),`
			`},`
			`}`
			`]`
			`)`

			`# Set up test parameters`
			`messages = [{"role": "user", "content": "Hello, can you tell me a short joke?"}]`

			`# Call the handler`
			`response = await router.aanthropic_messages(`
			`messages=messages,`
			`model="claude-special-alias",`
			`max_tokens=100,`
			`)`

			`# Verify response`
			`assert "id" in response`
			`assert "content" in response`
			`assert "model" in response`
			`assert response["role"] == "assistant"`

			`print(f"Non-streaming response: {json.dumps(response, indent=2)}")`
			`return response`

Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_litellm_router_routing_strategy():`
			`"""`
			`Test the anthropic_messages with routing strategy + non-streaming request`
			`"""`
			`litellm._turn_on_debug()`
			`router = Router(`
			`model_list=[`
			`{`
			`"model_name": "claude-special-alias",`
			`"litellm_params": {`
			`"model": "claude-3-haiku-20240307",`
			`"api_key": os.getenv("ANTHROPIC_API_KEY"),`
			`},`
			`}`
			`],`
			`routing_strategy="latency-based-routing",`
			`)`

			`# Set up test parameters`
			`messages = [{"role": "user", "content": "Hello, can you tell me a short joke?"}]`

			`# Call the handler`
			`response = await router.aanthropic_messages(`
			`messages=messages,`
			`model="claude-special-alias",`
			`max_tokens=100,`
			`metadata={`
			`"user_id": "hello",`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`},`
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`)`

			`# Verify response`
			`assert "id" in response`
			`assert "content" in response`
			`assert "model" in response`
			`assert response["role"] == "assistant"`

			`print(f"Non-streaming response: {json.dumps(response, indent=2)}")`
[Bug fix] - Fix /messages fallback from Anthropic API -> Bedrock API (#13946) * use helper get_provider_specific_headers * fix get_provider_specific_headers * test_anthropic_messages_fallbacks * bedrock/us.anthropic.claude-sonnet-4 * fix: get_provider_specific_headers * TestProviderSpecificHeaderUtils * test_anthropic_messages_fallbacks 2025-08-25 13:44:54 -07:00			`return response`

			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_fallbacks():`
			`"""`
			`E2E test the anthropic_messages fallbacks from Anthropic API to Bedrock`
			`"""`
			`litellm._turn_on_debug()`
			`router = Router(`
			`model_list=[`
			`{`
			`"model_name": "anthropic/claude-opus-4-20250514",`
			`"litellm_params": {`
			`"model": "anthropic/claude-opus-4-20250514",`
			`"api_key": "bad-key",`
			`},`
			`},`
			`{`
			`"model_name": "bedrock/us.anthropic.claude-sonnet-4-20250514-v1:0",`
			`"litellm_params": {`
			`"model": "bedrock/us.anthropic.claude-sonnet-4-20250514-v1:0",`
			`},`
			`}`
			`],`
			`fallbacks=[`
			`{`
			`"anthropic/claude-opus-4-20250514":`
			`["bedrock/us.anthropic.claude-sonnet-4-20250514-v1:0"]`
			`}`
			`]`
			`)`

			`# Set up test parameters`
			`messages = [{"role": "user", "content": "Hello, can you tell me a short joke?"}]`

			`# Call the handler`
			`response = await router.aanthropic_messages(`
			`messages=messages,`
			`model="anthropic/claude-opus-4-20250514",`
			`max_tokens=100,`
			`metadata={`
			`"user_id": "hello",`
			`},`
			`)`

			`# Verify response`
			`assert "id" in response`
			`assert "content" in response`
			`assert "model" in response`
			`assert response["role"] == "assistant"`

			`print(f"Non-streaming response: {json.dumps(response, indent=2)}")`
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`return response`


			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_litellm_router_latency_metadata_tracking():`
			`"""`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`Test the anthropic_messages with routing strategy and verify that _latency_per_deployment`
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`field is passed in litellm_metadata when calling litellm.anthropic_messages`
			`"""`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`with unittest.mock.patch("litellm.anthropic_messages") as mock_anthropic_messages:`
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`# Mock the return value`
			`mock_response = {`
			`"id": "msg_123456",`
			`"type": "message",`
			`"role": "assistant",`
			`"content": [{"type": "text", "text": "Here's a joke for you!"}],`
			`"model": "claude-3-haiku-20240307",`
			`"stop_reason": "end_turn",`
			`"usage": {"input_tokens": 10, "output_tokens": 20},`
			`}`
			`mock_anthropic_messages.return_value = mock_response`
			`# Set the __name__ attribute that the router expects`
			`mock_anthropic_messages.__name__ = "anthropic_messages"`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`MODEL_GROUP = "claude-special-alias"`
			`router = Router(`
			`model_list=[`
			`{`
			`"model_name": MODEL_GROUP,`
			`"litellm_params": {`
			`"model": "claude-3-haiku-20240307",`
			`"api_key": os.getenv("ANTHROPIC_API_KEY"),`
			`},`
			`}`
			`],`
			`routing_strategy="latency-based-routing",`
			`)`

			`# Set up test parameters`
			`messages = [{"role": "user", "content": "Hello, can you tell me a short joke?"}]`

			`# Call the handler`
			`response = await router.aanthropic_messages(`
			`messages=messages,`
			`model=MODEL_GROUP,`
			`max_tokens=100,`
			`metadata={`
			`"user_id": "hello",`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`},`
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`)`

			`# Verify response`
			`assert response == mock_response`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`# Verify that litellm.anthropic_messages was called`
			`mock_anthropic_messages.assert_called_once()`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`# Get the call arguments`
			`call_args = mock_anthropic_messages.call_args`
			`call_kwargs = call_args.kwargs`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`print("Call kwargs:", json.dumps(call_kwargs, indent=2, default=str))`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`# Verify that litellm_metadata was passed and contains _latency_per_deployment`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`assert (`
			`"litellm_metadata" in call_kwargs`
			`), "litellm_metadata should be passed to anthropic_messages"`

[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`litellm_metadata = call_kwargs["litellm_metadata"]`
			`assert litellm_metadata is not None, "litellm_metadata should not be None"`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`assert isinstance(`
			`litellm_metadata, dict`
			`), "litellm_metadata should be a dictionary"`

[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`# Verify _latency_per_deployment is present`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`assert (`
			`"_latency_per_deployment" in litellm_metadata`
			`), "litellm_metadata should contain _latency_per_deployment field"`

[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`# Verify the structure of _latency_per_deployment`
			`latency_per_deployment = litellm_metadata["_latency_per_deployment"]`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`assert isinstance(`
			`latency_per_deployment, dict`
			`), "_latency_per_deployment should be a dictionary"`

[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`print(f"✅ Latency per deployment data: {latency_per_deployment}")`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`# Verify other expected fields in litellm_metadata`
			`assert "model_group" in litellm_metadata`
			`assert litellm_metadata["model_group"] == MODEL_GROUP`
			`assert "deployment" in litellm_metadata`
			`assert "model_info" in litellm_metadata`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`# Verify other call parameters`
			`assert call_kwargs["model"] == "claude-3-haiku-20240307"`
			`assert call_kwargs["messages"] == messages`
			`assert call_kwargs["max_tokens"] == 100`
			`assert call_kwargs["metadata"] == {"user_id": "hello"}`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
			`print(`
			`"✅ Successfully verified that _latency_per_deployment is passed in litellm_metadata to anthropic_messages"`
			`)`

[Bug Fix] Using /messages with lowest latency routing (#12180) * add validate_anthropic_api_metadata * fixes for lowest latency deployment * add _select_metadata_field * test_anthropic_messages_litellm_router_latency_metadata_tracking 2025-06-30 15:57:19 -07:00			`return response`

(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00
			`class TestCustomLogger(CustomLogger):`
			`def __init__(self):`
			`super().__init__()`
			`self.logged_standard_logging_payload: Optional[StandardLoggingPayload] = None`

			`async def async_log_success_event(self, kwargs, response_obj, start_time, end_time):`
			`print("inside async_log_success_event")`
			`self.logged_standard_logging_payload = kwargs.get("standard_logging_object")`

			`pass`


			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_litellm_router_non_streaming_with_logging():`
			`"""`
			`Test the anthropic_messages with non-streaming request`

			`- Ensure Cost + Usage is tracked`
			`"""`
			`test_custom_logger = TestCustomLogger()`
			`litellm.callbacks = [test_custom_logger]`
			`litellm._turn_on_debug()`
[Bug Fix] Fix model_group tracked for /v1/messages and /moderations (#11933) * fixes _get_router_metadata_variable_name * fixes _update_kwargs_before_fallbacks * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_moderations_api_logging * fix _pass_through_moderation_endpoint_factory 2025-06-20 14:51:50 -07:00			`MODEL_GROUP = "claude-special-alias"`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`router = Router(`
			`model_list=[`
			`{`
[Bug Fix] Fix model_group tracked for /v1/messages and /moderations (#11933) * fixes _get_router_metadata_variable_name * fixes _update_kwargs_before_fallbacks * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_moderations_api_logging * fix _pass_through_moderation_endpoint_factory 2025-06-20 14:51:50 -07:00			`"model_name": MODEL_GROUP,`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`"litellm_params": {`
			`"model": "claude-3-haiku-20240307",`
			`"api_key": os.getenv("ANTHROPIC_API_KEY"),`
			`},`
			`}`
			`]`
			`)`

			`# Set up test parameters`
			`messages = [{"role": "user", "content": "Hello, can you tell me a short joke?"}]`

			`# Call the handler`
			`response = await router.aanthropic_messages(`
			`messages=messages,`
[Bug Fix] Fix model_group tracked for /v1/messages and /moderations (#11933) * fixes _get_router_metadata_variable_name * fixes _update_kwargs_before_fallbacks * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_moderations_api_logging * fix _pass_through_moderation_endpoint_factory 2025-06-20 14:51:50 -07:00			`model=MODEL_GROUP,`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`max_tokens=100,`
			`)`

			`# Verify response`
			`_validate_anthropic_response(response)`

			`print(f"Non-streaming response: {json.dumps(response, indent=2)}")`

			`await asyncio.sleep(1)`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
			`assert (`
			`test_custom_logger.logged_standard_logging_payload is not None`
			`), "Logging payload should not be None"`
			`print(`
			`"tracked standard logging payload",`
			`json.dumps(`
			`test_custom_logger.logged_standard_logging_payload, indent=4, default=str`
			`),`
			`)`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`assert test_custom_logger.logged_standard_logging_payload["messages"] == messages`
			`assert test_custom_logger.logged_standard_logging_payload["response"] is not None`
			`assert (`
			`test_custom_logger.logged_standard_logging_payload["model"]`
			`== "claude-3-haiku-20240307"`
			`)`

			`# check logged usage + spend`
			`assert test_custom_logger.logged_standard_logging_payload["response_cost"] > 0`
			`assert (`
			`test_custom_logger.logged_standard_logging_payload["prompt_tokens"]`
			`== response["usage"]["input_tokens"]`
			`)`
			`assert (`
			`test_custom_logger.logged_standard_logging_payload["completion_tokens"]`
			`== response["usage"]["output_tokens"]`
			`)`

[Bug Fix] Fix model_group tracked for /v1/messages and /moderations (#11933) * fixes _get_router_metadata_variable_name * fixes _update_kwargs_before_fallbacks * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_moderations_api_logging * fix _pass_through_moderation_endpoint_factory 2025-06-20 14:51:50 -07:00			`# assert model_group`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`assert (`
			`test_custom_logger.logged_standard_logging_payload["model_group"] == MODEL_GROUP`
			`)`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00

			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_with_extra_headers():`
			`"""`
			`Test the anthropic_messages with extra headers`
			`"""`
			`# Get API key from environment`
			`api_key = os.getenv("ANTHROPIC_API_KEY", "fake-api-key")`

			`# Set up test parameters`
			`messages = [{"role": "user", "content": "Hello, can you tell me a short joke?"}]`
			`extra_headers = {`
			`"anthropic-version": "custom-version-for-test",`
			`}`

			`# Create a mock response`
			`mock_response = MagicMock()`
			`mock_response.raise_for_status = MagicMock()`
			`mock_response.json.return_value = {`
			`"id": "msg_123456",`
			`"type": "message",`
			`"role": "assistant",`
			`"content": [`
			`{`
			`"type": "text",`
			`"text": "Why did the chicken cross the road? To get to the other side!",`
			`}`
			`],`
			`"model": "claude-3-haiku-20240307",`
			`"stop_reason": "end_turn",`
			`"usage": {"input_tokens": 10, "output_tokens": 20},`
			`}`

			`# Create a mock client with AsyncMock for the post method`
			`mock_client = MagicMock(spec=AsyncHTTPHandler)`
			`mock_client.post = AsyncMock(return_value=mock_response)`

			`# Call the handler with extra_headers and our mocked client`
use new anthropic interface 2025-03-31 14:31:09 -07:00			`response = await litellm.anthropic.messages.acreate(`
(Refactor) `/v1/messages` to follow simpler logic for Anthropic API spec (#9013) * anthropic_messages_handler v0 * fix /messages * working messages with router methods * test_anthropic_messages_handler_litellm_router_non_streaming * test_anthropic_messages_litellm_router_non_streaming_with_logging * AnthropicMessagesConfig * _handle_anthropic_messages_response_logging * working with /v1/messages endpoint * working /v1/messages endpoint * refactor to use router factory function * use aanthropic_messages * use BaseConfig for Anthropic /v1/messages * track api key, team on /v1/messages endpoint * fix get_logging_payload * BaseAnthropicMessagesTest * align test config * test_anthropic_messages_with_thinking * test_anthropic_streaming_with_thinking * fix - display anthropic url for debugging * test_bad_request_error_handling * test_anthropic_messages_router_streaming_with_bad_request * fix ProxyException * test_bad_request_error_handling_streaming * use provider_specific_header * test_anthropic_messages_with_extra_headers * test_anthropic_messages_to_wildcard_model * fix gcs pub sub test * standard_logging_payload * fix unit testing for anthopic /v1/messages support * fix pass through anthropic messages api * delete dead code * fix anthropic pass through response * revert change to spend tracking utils * fix get_litellm_metadata_from_kwargs * fix spend logs payload json * proxy_pass_through_endpoint_tests * TestAnthropicPassthroughBasic * fix pass through tests * test_async_vertex_proxy_route_api_key_auth * _handle_anthropic_messages_response_logging * vertex_credentials * test_set_default_vertex_config * test_anthropic_messages_litellm_router_non_streaming_with_logging * test_ageneric_api_call_with_fallbacks_basic * test__aadapter_completion 2025-03-06 00:43:08 -08:00			`messages=messages,`
			`api_key=api_key,`
			`model="claude-3-haiku-20240307",`
			`max_tokens=100,`
			`client=mock_client,`
			`provider_specific_header={`
			`"custom_llm_provider": "anthropic",`
			`"extra_headers": extra_headers,`
			`},`
			`)`

			`# Verify the post method was called with the right parameters`
			`mock_client.post.assert_called_once()`
			`call_kwargs = mock_client.post.call_args.kwargs`

			`# Verify headers were passed correctly`
			`headers = call_kwargs.get("headers", {})`
			`print("HEADERS IN REQUEST", headers)`
			`for key, value in extra_headers.items():`
			`assert key in headers`
			`assert headers[key] == value`

			`# Verify the response was processed correctly`
			`assert response == mock_response.json.return_value`

			`return response`
[Refactor] Anthropic /v1/messages endpoint - Refactor to use base llm http handler and transformations (#10677) * fix: add transform_anthropic_messages_request * fix: add get_requested_response_api_optional_param * fix: use base llm http handler for anthropic messages * fix: add anthropic transform response * fix: transform_anthropic_messages_response * fix: fixes for anthropic messages * fix: code qa fixes * fix: pass thinking to anthropic * fix: linting * fixes 2025-05-08 17:56:50 -07:00

Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# @pytest.mark.asyncio`
			`# async def test_bedrock_messages_api_header_forwarding():`
			`# """`
			`# Test that headers from kwargs (set by proxy's add_headers_to_llm_call_by_model_group)`
			`# are correctly passed to validate_anthropic_messages_environment for Bedrock Invoke API.`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# This verifies that forward_client_headers_to_llm_api works for Bedrock Invoke API (Messages API).`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# Issue: When calling Anthropic models via the Messages API, LiteLLM makes a call to`
			`# Bedrock's Invoke API, and custom headers were not being forwarded, even though`
			`# they worked correctly for Chat Completions API with Bedrock's Converse API.`
			`# """`
			`# from litellm.llms.custom_httpx.llm_http_handler import BaseLLMHTTPHandler`
			`# from litellm.litellm_core_utils.litellm_logging import Logging as LiteLLMLoggingObj`
			`# from litellm.types.router import GenericLiteLLMParams`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# handler = BaseLLMHTTPHandler()`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # Headers that would be set by the proxy when forward_client_headers_to_llm_api is configured`
			`# custom_headers = {`
			`# "X-Custom-Header": "CustomValue",`
			`# "X-Request-ID": "req-123",`
			`# }`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # Mock the provider config`
			`# mock_provider_config = MagicMock()`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # We'll check what headers are passed to this method`
			`# mock_provider_config.validate_anthropic_messages_environment.return_value = (`
			`# {"Authorization": "Bearer test"},`
			`# "https://bedrock-runtime.us-east-1.amazonaws.com/invoke"`
			`# )`
			`# mock_provider_config.transform_anthropic_messages_request.return_value = {"model": "test"}`
			`# mock_provider_config.get_complete_url.return_value = "https://test.com"`
			`# mock_provider_config.sign_request.return_value = ({}, None)`
			`# mock_provider_config.transform_anthropic_messages_response.return_value = {"id": "test"}`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # Mock HTTP client to prevent actual network calls`
			`# with unittest.mock.patch("litellm.llms.custom_httpx.llm_http_handler.get_async_httpx_client") as mock_get_client:`
			`# mock_http_client = AsyncMock()`
			`# mock_response = MagicMock()`
			`# mock_response.status_code = 200`
			`# mock_response.json.return_value = {"id": "test", "content": []}`
			`# mock_response.text = "{}"`
			`# mock_http_client.post.return_value = mock_response`
			`# mock_get_client.return_value = mock_http_client`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # Mock logging object`
			`# mock_logging_obj = MagicMock(spec=LiteLLMLoggingObj)`
			`# mock_logging_obj.model_call_details = {}`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # Call the handler with headers in kwargs`
			`# try:`
			`# await handler.async_anthropic_messages_handler(`
			`# model="bedrock/anthropic.claude-3-5-sonnet-20241022-v2:0",`
			`# messages=[{"role": "user", "content": "Hello"}],`
			`# anthropic_messages_provider_config=mock_provider_config,`
			`# anthropic_messages_optional_request_params={"max_tokens": 100},`
			`# custom_llm_provider="bedrock",`
			`# litellm_params=GenericLiteLLMParams(`
			`# api_key="test-key",`
			`# aws_region_name="us-east-1"`
			`# ),`
			`# logging_obj=mock_logging_obj,`
			`# api_key="test-key",`
			`# stream=False,`
			`# kwargs={"headers": custom_headers} # Headers set by proxy`
			`# )`
			`# except Exception:`
			`# pass # Ignore errors, we're only checking if headers were passed`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # Verify that validate_anthropic_messages_environment was called`
			`# assert mock_provider_config.validate_anthropic_messages_environment.called`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # Get the headers that were passed`
			`# call_args = mock_provider_config.validate_anthropic_messages_environment.call_args`
			`# passed_headers = call_args[1]["headers"]`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30
Fix test_bedrock_messages_api_header_forwarding 2026-02-11 16:56:34 +05:30			`# # The custom headers from kwargs should be in the passed headers`
			`# assert "X-Custom-Header" in passed_headers or "x-custom-header" in passed_headers`
			`# assert "X-Request-ID" in passed_headers or "x-request-id" in passed_headers`
Add Add per model group header forwarding for Bedrock Invoke API (#16042) 2025-10-31 08:40:17 +05:30

[Refactor] Anthropic /v1/messages endpoint - Refactor to use base llm http handler and transformations (#10677) * fix: add transform_anthropic_messages_request * fix: add get_requested_response_api_optional_param * fix: use base llm http handler for anthropic messages * fix: add anthropic transform response * fix: transform_anthropic_messages_response * fix: fixes for anthropic messages * fix: code qa fixes * fix: pass thinking to anthropic * fix: linting * fixes 2025-05-08 17:56:50 -07:00			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_with_thinking():`
			`"""`
			`Test the anthropic_messages with thinking`
			`"""`
			`# Get API key from environment`
			`api_key = os.getenv("ANTHROPIC_API_KEY", "fake-api-key")`

			`# Set up test parameters`
			`messages = [{"role": "user", "content": "Hello, can you tell me a short joke?"}]`

			`# Create a mock response`
			`mock_response = MagicMock()`
			`mock_response.raise_for_status = MagicMock()`
			`mock_response.json.return_value = {`
			`"id": "msg_123456",`
			`"type": "message",`
			`"role": "assistant",`
			`"content": [`
			`{`
			`"type": "text",`
			`"text": "Why did the chicken cross the road? To get to the other side!",`
			`}`
			`],`
			`"model": "claude-3-haiku-20240307",`
			`"stop_reason": "end_turn",`
			`"usage": {"input_tokens": 10, "output_tokens": 20},`
			`}`

			`# Create a mock client with AsyncMock for the post method`
			`mock_client = MagicMock(spec=AsyncHTTPHandler)`
			`mock_client.post = AsyncMock(return_value=mock_response)`

			`# Call the handler with extra_headers and our mocked client`
			`response = await litellm.anthropic.messages.acreate(`
			`messages=messages,`
			`api_key=api_key,`
			`model="claude-3-haiku-20240307",`
			`max_tokens=100,`
			`client=mock_client,`
			`thinking={"budget_tokens": 100},`
			`)`

			`# Verify the post method was called with the right parameters`
			`mock_client.post.assert_called_once()`
			`call_kwargs = mock_client.post.call_args.kwargs`
			`print("CALL KWARGS", call_kwargs)`

			`# Verify headers were passed correctly`
			`request_body = json.loads(call_kwargs.get("data", {}))`
			`print("REQUEST BODY", request_body)`
			`assert request_body["max_tokens"] == 100`
			`assert request_body["model"] == "claude-3-haiku-20240307"`
			`assert request_body["messages"] == messages`
			`assert request_body["thinking"] == {"budget_tokens": 100}`

			`# Verify the response was processed correctly`
			`assert response == mock_response.json.return_value`

			`return response`
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00

			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_bedrock_credentials_passthrough():`
			`"""`
			`Test that AWS credentials are correctly passed through to BaseAWSLLM.get_credentials`
			`when using anthropic.messages.acreate with a bedrock model`
			`"""`
			`# Mock the get_credentials method`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`with unittest.mock.patch.object(`
			`BaseAWSLLM, "get_credentials"`
			`) as mock_get_credentials:`
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`# Create a proper mock for credentials with the necessary attributes`
			`mock_credentials = unittest.mock.MagicMock()`
			`mock_credentials.access_key = "mock_access_key"`
			`mock_credentials.secret_key = "mock_secret_key"`
			`mock_credentials.token = "mock_session_token"`
			`mock_get_credentials.return_value = mock_credentials`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`# We also need to mock the actual AWS request signing to avoid real API calls`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`with unittest.mock.patch("botocore.auth.SigV4Auth.add_auth"):`
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`# Set up mock for AsyncHTTPHandler.post to avoid actual API calls`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`with unittest.mock.patch(`
			`"litellm.llms.custom_httpx.http_handler.AsyncHTTPHandler.post"`
			`) as mock_post:`
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`# Configure mock response`
			`mock_response = unittest.mock.MagicMock()`
			`mock_response.raise_for_status = unittest.mock.MagicMock()`
			`mock_response.json.return_value = {`
			`"id": "msg_bedrock_123",`
			`"type": "message",`
			`"role": "assistant",`
			`"content": [{"type": "text", "text": "This is a mock response"}],`
			`"model": "bedrock/us.anthropic.claude-3-5-sonnet-20240620-v1:0",`
			`"stop_reason": "end_turn",`
			`"usage": {"input_tokens": 10, "output_tokens": 20},`
			`}`
			`mock_post.return_value = mock_response`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`# Test AWS credentials parameters - separate from function call parameters`
			`aws_params = {`
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`"aws_access_key_id": "test_access_key",`
			`"aws_secret_access_key": "test_secret_key",`
			`"aws_session_token": "test_session_token",`
			`"aws_region_name": "us-west-2",`
			`"aws_role_name": "test_role_name",`
			`"aws_session_name": "test_session_name",`
			`"aws_profile_name": "test_profile",`
			`"aws_web_identity_token": "test_web_identity_token",`
			`"aws_sts_endpoint": "https://sts.test-region.amazonaws.com",`
			`}`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`# Call the function with AWS credentials`
			`await litellm.anthropic.messages.acreate(`
			`messages=[{"role": "user", "content": "Hello, test credentials"}],`
			`model="bedrock/us.anthropic.claude-3-5-sonnet-20240620-v1:0",`
			`max_tokens=100,`
test fix - test_anthropic_messages_passthrough.py 2025-06-30 21:56:19 -07:00			`**aws_params,`
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`)`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`# Verify get_credentials was called with the correct parameters`
			`mock_get_credentials.assert_called_once()`
			`call_args = mock_get_credentials.call_args[1]`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix]: /messages - allow using dynamic AWS params (#10769) * fix: dynamic AWS params added for messages routes * Update tests/pass_through_unit_tests/test_anthropic_messages_passthrough.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> 2025-05-12 14:09:17 -07:00			`# Assert that our test credentials were passed correctly`
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00			`for param_name, param_value in aws_params.items():`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`assert (`
			`call_args[param_name] == param_value`
			`), f"Parameter {param_name} was not passed correctly"`
[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00

			`@pytest.mark.asyncio`
			`async def test_anthropic_messages_bedrock_dynamic_region():`
			`"""`
			`Test that when aws_region_name is provided, it is used in request url`
			`"""`
			`# Mock the HTTP response`
			`mock_response = MagicMock()`
			`mock_response.raise_for_status = MagicMock()`
			`mock_response.json.return_value = {`
			`"id": "msg_bedrock_123",`
			`"type": "message",`
			`"role": "assistant",`
			`"content": [{"type": "text", "text": "This is a mock response"}],`
			`"model": "bedrock/us.anthropic.claude-3-5-sonnet-20240620-v1:0",`
			`"stop_reason": "end_turn",`
			`"usage": {"input_tokens": 10, "output_tokens": 20},`
			`}`

			`# Create a mock client with AsyncMock for the post method`
			`mock_client = AsyncMock(spec=AsyncHTTPHandler)`
			`mock_client.post = AsyncMock(return_value=mock_response)`

			`# Patch necessary AWS components`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`with unittest.mock.patch(`
			`"botocore.auth.SigV4Auth.add_auth"`
			`), unittest.mock.patch.object(`
			`BaseAWSLLM, "get_credentials"`
			`) as mock_get_credentials:`

[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00			`# Setup mock credentials`
			`mock_credentials = unittest.mock.MagicMock()`
			`mock_credentials.access_key = "test_access_key"`
			`mock_credentials.secret_key = "test_secret_key"`
			`mock_credentials.token = "test_session_token"`
			`mock_get_credentials.return_value = mock_credentials`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00			`# Test with specific region`
			`test_region = "us-east-1"`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00			`# Call anthropic.messages.acreate with aws_region_name`
			`response = await litellm.anthropic.messages.acreate(`
			`messages=[{"role": "user", "content": "Hello, test region"}],`
			`model="bedrock/us.anthropic.claude-3-5-sonnet-20240620-v1:0",`
			`max_tokens=100,`
			`aws_region_name=test_region,`
			`client=mock_client,`
			`)`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00			`# Verify response`
			`assert response == mock_response.json.return_value`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00			`# Verify the post method was called with the correct URL containing the region`
			`mock_client.post.assert_called_once()`
			`call_args = mock_client.post.call_args`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00
[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00			`# Check that the URL contains the correct region`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`url = call_args.kwargs.get("url", "")`
			`assert (`
			`f"bedrock-runtime.{test_region}.amazonaws.com" in url`
			`), f"URL does not contain the correct region. URL: {url}"`

[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00			`# Verify get_credentials was called with the correct region`
			`mock_get_credentials.assert_called_once()`
			`credentials_args = mock_get_credentials.call_args.kwargs`
Azure `api_version="preview"` support + Bedrock cost tracking via Anthropic `/v1/messages` (#13072) * fix(azure/chat/gpt_transformation.py): support api_version="preview" Fixes https://github.com/BerriAI/litellm/issues/12945 * Fix anthropic passthrough logging handler model fallback for streaming requests (#13022) * fix: anthropic passthrough logging handler model fallback for streaming requests - Add fallback logic to retrieve model from logging_obj.model_call_details when request_body.model is empty - Fixes issue #12933 where streaming requests to anthropic passthrough endpoints would crash due to missing model field - Ensures downstream logging and cost calculation work correctly for all streaming scenarios - Maintains backwards compatibility with existing non-streaming requests * test: add minimal tests for anthropic passthrough logging handler model fallback - Add unit tests for the model fallback logic in _handle_logging_anthropic_collected_chunks - Test existing behavior when request_body.model is present - Test fallback logic when request_body.model is empty but logging_obj.model_call_details has model - Test edge cases where both sources are empty or missing - Ensure backwards compatibility and graceful degradation * fix(anthropic_passthrough_logging_handler.py): add provider to model name (accurate cost tracking) * fix(anthropic_passthrough_logging_handler.py): don't reset custom llm provider, if already set * fix: fix check --------- Co-authored-by: Haggai Shachar <haggai.shachar@backline.ai> 2025-07-29 08:13:55 -07:00			`assert credentials_args.get("aws_region_name") == test_region`
[Fix] Allow using dynamic aws_region with /messages on Bedrock (#10779) * fix: fix get_complete_url * test: test_anthropic_messages_bedrock_dynamic_region 2025-05-12 20:22:38 -07:00
[Feat] Allow using litellm.completion with /v1/messages API Spec (use gpt-4, gemini etc with claude code) (#11502) * feat: add anthropic stream wrapper * feat: add AnthropicExperimentalPassThroughConfig * feat: working non streaming anthropic * feat: working streaming anthropic-litellm bridge * test - anthropic OpenAI bridge tests * fix: add sync support for anthropic_messages * fix: using is async check * fix: ensure streams are SSE * fix: imports * fix code qa check * fix: linting errors * test_sync_openai_messages * cleanup remove stash file 2025-06-06 20:35:53 -07:00
			`def test_sync_openai_messages():`
			`"""`
			`Test the anthropic_messages with sync request`
			`"""`
			`litellm._turn_on_debug()`
			`response = litellm.anthropic.messages.create(`
			`messages=[{"role": "user", "content": "Hello, can you tell me a short joke?"}],`
			`model="openai/gpt-4o-mini",`
			`max_tokens=100,`
			`)`
			`print("ANT response", response)`

			`assert response is not None`
			`assert isinstance(response, dict)`
[Fix] inconsistent response format in anthropic.messages.acreate() when using non anthropic providers (#20442) * _translate_openai_content_to_anthropic * test_response_format_consistency * test fixes unit tests * test fix * fix: use dict access for anthropic content blocks in tests (#20447) The translate_openai_response_to_anthropic method returns dicts, not objects. Changed .type/.text/.thinking attribute access to dict ['key'] access. --------- Co-authored-by: shin-bot-litellm <shin-bot-litellm@berri.ai> 2026-02-04 16:37:40 -08:00			`assert response["content"][0]["text"] is not None`