Releases: BerriAI/litellm
Releases · BerriAI/litellm
v1.81.6-nightly
What's Changed
- test(proxy): add regression tests for vertex passthrough model names … by @michelligabriele in #19855
- fix: guardrails issues streaming-response regex by @Harshit28j in #19901
- fix: add fix for migration issue and and stable image by @Harshit28j in #19843
- fix: filter unsupported beta headers for AWS Bedrock Invoke API by @jayy-77 in #19877
- fix: allow tool_choice for Azure GPT-5 chat models by @jayy-77 in #19813
- fix: tool with antropic #19800 by @zifeo in #19805
- inspect BadRequestError after all other policy types by @demoray in #19878
- fix(main): use local tiktoken cache in lazy loading by @Chesars in #19774
- fix(gemini): subtract implicit cached tokens from text_tokens for correct cost calculation by @Chesars in #19775
- fix Prompt Studio history to load tools and system messages by @naaa760 in #19920
- [Release Day] - Fixed CI/CD issues & changed processes by @AlexsanderHamir in #19902
- [Feat] - Search API add /list endpoint to list what search tools exist in router by @ishaan-jaff in #19969
- [Feature] UI - Tables: Reusable Table Sort Component by @yuneng-jiang in #19970
- [Feature] UI - Logs: Adding Error message search to ui spend logs by @yuneng-jiang in #19963
- [Feat] LiteLLM Vector Stores - Add permission management for users, teams by @ishaan-jaff in #19972
- feat: Add new OpenRouter models:
xiaomi/mimo-v2-flash, `z-ai/glm-4.… by @rushilchugh01 in #19938 - fix gemini gemini-robotics-er-1.5-preview entry by @Sameerlite in #19974
- fix(vertex_ai): convert image URLs to base64 in tool messages for Anthropic by @Chesars in #19896
- Fix/router search tools v2 by @Harshit28j in #19840
- [Infra] Remove _experimental/out routes from gitignore + UI Build by @yuneng-jiang in #19976
- [Feature] UI - Usage Export: Breakdown by Teams and Keys by @yuneng-jiang in #19953
- Fix stream_chunk_builder to preserve images from streaming chunks by @Chesars in #19654
- fix(docker): add libsndfile to main Dockerfile for ARM64 audio processing by @Chesars in #19776
- fix(proxy): add datadog_llm_observability to /health/services allowed… by @michelligabriele in #19952
- fix(proxy): prevent provider-prefixed model leaks by @bcdonadio in #19943
- fix(hosted_vllm): route through base_llm_http_handler to support ssl_verify by @cfchase in #19893
- Add OpenRouter Kimi K2.5 by @ayim in #19872
- Add test to check if model map is corretly formatted by @Sameerlite in #19992
- Add validation of model cost map as job by @Sameerlite in #19993
- Fix model map path in validation test by @Sameerlite in #19994
- Fix: litellm_fix_robotic_model_map_entry by @Sameerlite in #19997
- oss staging 01/28/2026 by @krrishdholakia in #19906
- Add custom_llm_provider as gemini translation by @Sameerlite in #19988
- [Fix] Sorting for /v2/model/info by @yuneng-jiang in #19971
- [Feature] Bulk Update Keys Endpoint by @yuneng-jiang in #19886
- [Fix] error_code in Spend Logs metadata by @yuneng-jiang in #20015
- [Feature] UI - Spend Logs: Show Current Store and Retention Status by @yuneng-jiang in #20017
- [Feature] UI - New Badge Dot Render by @yuneng-jiang in #20024
- Add event-driven coordination for global spend query to prevent cache stampede by @AlexsanderHamir in #20030
- [Feat] New Model - amazon.nova-2-pro-preview-20251202-v1:0 by @ishaan-jaff in #20033
- [Feat] LiteLLM x Claude Agent SDK Integration by @ishaan-jaff in #20035
- [Docs] Claude Agents SDK x LiteLLM Guide by @ishaan-jaff in #20036
- fix: run prisma generate as nobody user in non-root container by @milan-berri in #20000
- merge main in passthrough by @Sameerlite in #20042
- Add /openai_passthrough route for openai passthrough requests: by @Sameerlite in #19989
- fix(gemini): support file retrieval in GoogleAIStudioFilesHandle by @varunsripad123 in #20018
- fix(ResponseAPILoggingUtils): extract input tokens details as dict by @nht1206 in #20046
- Fix
max_input_tokensforgpt-5.2-codexby @genga6 in #20009 - Litellm oss staging 01 29 2026 by @krrishdholakia in #19975
- feat: add /delete endpoint support for gemini by @Sameerlite in #20055
- Fix: Batch and File user level permissions by @Sameerlite in #19981
- [Feat]Add cost tracking and usage object in aretrieve_batch call type by @Sameerlite in #19986
- Add routing of xai chat completions to responses when web search options is present by @Sameerlite in #20051
- Add disable flag for anthropic gemini cache translation by @Sameerlite in #20052
- fix aspectRatio mapping in image edit by @Sameerlite in #20053
- Fix: vllm embedding format by @Sameerlite in #20056
- Fix: remove unsupported prompt-caching-scope-2026-01-05 header for vertex ai by @Sameerlite in #20058
- [Feature] UI - Usage: Model Breakdown Per Key by @yuneng-jiang in #20039
- Add mock client factory pattern and mock support for PostHog, Helicone, and Braintrust integrations by @AlexsanderHamir in #19707
- Realtime API benchmarks by @ishaan-jaff in #20074
- fixes: ci pipeline router coverage failure by @Harshit28j in #20065
- [cookbook] - add section for using claude agent sdk + MCPs with LiteLLM by @ishaan-jaff in #20081
- [Feat] Add async_post_call_response_headers_hook to CustomLogger by @ishaan-jaff in #20083
- fix(proxy): resolve high CPU when router_settings in DB by avoiding REGISTRY.collect() in PrometheusServicesLogger by @AlexsanderHamir in #20087
- Revert logs view commits by @ishaan-jaff in #20090
- [Fix] UI - Navbar: Fixed Default Logo + Bound Logo Box by @yuneng-jiang in #20092
- [Refactor] UI - Navbar: User Dropdown by @yuneng-jiang in #20095
- [Feat] v2 - Logs view with side panel and improved UX by @ishaan-jaff in #20091
- [Feat] UI - New View to render "Tools" on Logs View by @ishaan-jaff in #20093
- [Feat] UI - Add Pretty print view of request/response by @ishaan-jaff in #20096
- fixed mcp tools instructions on ui to show comma seprated str instead… by @shivamrawat1 in #20101
- litellm_fix: add missing timezone import to proxy_server.py by @shin-bot-litellm in #20121
- litellm_fix(proxy): reduce PLR0915 complexity (minimal) by @shin-bot-litellm in #20127
- litellm_fix(ui): remove unused ToolOutlined import by @shin-bot-litellm in #20129
- litellm_fix(e2e): disable bedrock-converse-claude-sonnet-4.5 in tests by @shin-bot-litellm in #20131
- litellm_fix(test): fix Azure AI cost calculator test - use Logging class by @shin-bot-litellm in #20134
- litellm_fix(test): fix Bedrock tool search header test regression by @shin-bot-litellm in #20135
- litellm_fix(test): allow comment field in schema and exclude robotics models from tpm check by @shin-bot-litellm in #20139
- litellm_docs: add missing environment variable documentation by @shin-bot-litellm in #20138
- litellm_fix(test): add acancel_batch to Azure SDK client initialization test by @shin-bot-litellm in #20143
- litellm_fix: handle unknown models in Azure AI cost calculator by @shin-bot-litellm in https:/...
v1.81.0-patch2
Full Changelog: v1.81.0-stable...v1.81.0-patch2
v1.81.5-nightly
What's Changed
- [Feature] UI - Virtual Keys: Auto Truncation of Table Values by @yuneng-jiang in #19718
- fix: args issue & refactor into helper function to reduce bloat for b… by @Harshit28j in #19720
- fix(proxy): use return value from CustomLogger.async_post_call_success_hook by @michelligabriele in #19670
- bump: version 1.81.3 → 1.81.4 by @AlexsanderHamir in #19793
- fix: server rooth path by @Harshit28j in #19790
- refactor: extract transport context creation into separate method by @AlexsanderHamir in #19794
- Make test_get_users_key_count deterministic by creating dedicated test user by @AlexsanderHamir in #19795
- Add test for Router.get_valid_args, fix router code coverage encoding by @AlexsanderHamir in #19797
- Fix test_mcp_server_manager_config_integration_with_database cancellation error by @AlexsanderHamir in #19801
- fix: make HTTPHandler mockable in OIDC secret manager tests by @AlexsanderHamir in #19803
- Messages api bedrock converse caching and pdf support by @colinlin-stripe in #19785
- [Feat] enable progress notifications for MCP tool calls by @houdataali in #19809
- [Feat] CLI Auth - Add configurable CLI JWT expiration via environment variable by @ishaan-jaff in #19780
- fix(ui): prevent clearing content filter patterns when editing guardrail by @yogeshwaran10 in #19671
- [Infra] CI/CD - Fixing Flaky Tests in OIDC and Email by @yuneng-jiang in #19819
- Fix(#19781): Unable to reset user max budget to unlimited by @yogeshwaran10 in #19796
- CI/CD: Increase retries and stabilize litellm_mapped_tests_core by @AlexsanderHamir in #19826
- Fix broken mocks in 6 flaky tests to prevent real API calls by @AlexsanderHamir in #19829
Full Changelog: v1.81.3-nightly...v1.81.5-nightly
v1.81.3.rc.5
What's Changed
- Fix/nova grounding by @jquinter in #19598
- add Vertex_AI llm credentials sensitive keywords "vertex_credentials" by @ruanjf in #19551
- Bugfix/19481 num retries env var type by @moh-dev-stack in #19507
- Fix gpt-image-1.5 cost calculation not including output image tokens by @Chesars in #19515
- chore: Add context for volcengine models (deepseek-v3-2, glm-4-7, kimi-k2-thinking) by @yfge in #19335
- feat(datadog): add agent support for LLM Observability by @Harshit28j in #19574
- feat: add datadog cost management support and fix startup callback issue by @Harshit28j in #19584
- fix: recursive pydantic issue by @Harshit28j in #19531
- feat(vercel_ai_gateway): add embeddings support by @Chesars in #19660
- feat: support role_mappings from environment variables by @formorter in #19498
- a2a agent Header-Based Context Propagation by @houdataali in #19504
- fix(ui): Change default key type from 'Default' to 'LLM API' for impr… by @priyam-that in #19516
- fix(rag): use router for completion in RAG query pipeline by @marcosgriselli in #19550
- [Fix] Password comparison with non-ASCII characters by @jgreek in #19568
- Feature/guardrail model argument by @jquinter in #19619
- fix: propagate JWT auth metadata to OTEL spans by @Harshit28j in #19627
- fix(docker): use correct schema path for prisma generation by @Harshit28j in #19631
- [Feature] UI - Virtual Keys: Auto Truncation of Table Values by @yuneng-jiang in #19718
- fix: args issue & refactor into helper function to reduce bloat for b… by @Harshit28j in #19720
- fix(proxy): support slashes in google generateContent model names by @jayy-77 in #19737
- Fix/non standard mcp url pattern by @jquinter in #19738
- add callbacks and labels to prometheus by @Harshit28j in #19708
- feat: add clientip and user agent in metrics by @Harshit28j in #19717
- fix: optimize logo fetching and resolve mcp import blockers by @Harshit28j in #19719
- feat: tpm-rpm limit in prometheus metrics by @Harshit28j in #19725
- add timeout to onyx guardrail by @tamirkiviti13 in #19731
- fix(proxy): use return value from CustomLogger.async_post_call_success_hook by @michelligabriele in #19670
- bump: version 1.81.3 → 1.81.4 by @AlexsanderHamir in #19793
- fix: server rooth path by @Harshit28j in #19790
- refactor: extract transport context creation into separate method by @AlexsanderHamir in #19794
- Make test_get_users_key_count deterministic by creating dedicated test user by @AlexsanderHamir in #19795
- Add test for Router.get_valid_args, fix router code coverage encoding by @AlexsanderHamir in #19797
- Fix test_mcp_server_manager_config_integration_with_database cancellation error by @AlexsanderHamir in #19801
- fix: make HTTPHandler mockable in OIDC secret manager tests by @AlexsanderHamir in #19803
- Messages api bedrock converse caching and pdf support by @colinlin-stripe in #19785
- [Feat] enable progress notifications for MCP tool calls by @houdataali in #19809
- [Feat] CLI Auth - Add configurable CLI JWT expiration via environment variable by @ishaan-jaff in #19780
- fix(ui): prevent clearing content filter patterns when editing guardrail by @yogeshwaran10 in #19671
- [Infra] CI/CD - Fixing Flaky Tests in OIDC and Email by @yuneng-jiang in #19819
- Fix(#19781): Unable to reset user max budget to unlimited by @yogeshwaran10 in #19796
- CI/CD: Increase retries and stabilize litellm_mapped_tests_core by @AlexsanderHamir in #19826
- Fix broken mocks in 6 flaky tests to prevent real API calls by @AlexsanderHamir in #19829
- [Feat] Add UI for /rag/ingest API - upload docs, pdfs etc to create vector stores by @ishaan-jaff in #19822
- docs(readme): add OpenAI Agents SDK to OSS Adopters by @Chesars in #19820
- [Fix] UI - Create Key: Expire Key Input Duration by @yuneng-jiang in #19807
- [Feature] UI - Feedback Prompts: Option To Hide Prompts by @yuneng-jiang in #19831
- [Feature] UI - Add Light/Dark Mode Switch for Development by @yuneng-jiang in #19804
- [Fix] UI - Internal User: Bulk Add by @yuneng-jiang in #19721
- Litellm release notes 01 26 2026 by @krrishdholakia in #19836
- Litellm release notes 01 26 2026 by @krrishdholakia in #19838
- feat: Add model_id label to Prometheus metrics (#18048) by @krrishdholakia in #19678
- fix(models): set gpt-5.2-codex mode to responses for Azure and OpenRouter by @Chesars in #19770
- fix(responses): fix custom_llm_provider for provider-specific params by @Chesars in #19798
- fix(azure): use generic cost calculator for audio token pricing by @Chesars in #19771
- fix(xai): correct cached token cost calculation for xAI models by @Chesars in #19772
- fix: token calculations and refactor by @Harshit28j in #19696
- fix(prometheus): safely handle None metadata in logging to prevent At… by @Harshit28j in #19691
- fix: resolve 'does not exist' migration errors as applied in setup_da… by @Harshit28j in #19281
- Fix /batches to return encoded ids (from managed objects table) by @ephrimstanley in #19040
- Fix: ensure function content is valid JSON for GigaChat by @natimofeev in #19232
- Fix: A2A Python SDK URL by @Sameerlite in #19832
- Litellm oss staging 01 23 2026 by @krrishdholakia in #19617
- Litellm oss staging 01 24 2026 by @krrishdholakia in #19692
- fix(proxy): support slashes in google generateContent model names (#1… by @krrishdholakia in #19753
- Add grok reasoning content by @Sameerlite in #19850
- Fix: Stream the download in chunks for image handling by @Sameerlite in #19847
- Add Gemini Robotics-ER 1.5 preview support by @Sameerlite in #19845
- [Docs]Add sarvam usage documentation by @Sameerlite in #19844
- Translate advanced-tool-use to Bedrock-specific headers for Claude Opus 4.5 by @Sameerlite in #19841
- feat: add feature to make silent calls by @Harshit28j in #19544
- fix(enterprise): correct error message for DISABLE_ADMIN_ENDPOINTS by @michelligabriele in #19861
- fix(proxy): handle agent parameter in /interactions endpoint by @michelligabriele in #19866
- [Fix] /team/member_add User Email and ID Verifications by @yuneng-jiang in #19814
- [Fix] SSO Email Case Sensitivity by @yuneng-jiang in #19799
- fix(bedrock): support tool search header translation for Sonnet 4.5 by @michelligabriele in #19871
- [Feat] RAG API - Add support for using s3 Vectors as Vector Store Provider for /rag/ingest by @ishaan-jaff in #19888
- [Feat] RAG API - Add s3_vectors as provider on /vector_store/search API + UI for creating + PDF support for /rag/ingest by @ishaan-jaff in #19895
- [Feature] Add sortBy and sortOrder params for /v2/model/info by @yuneng-jiang in #19903
- [Infra] CI/CD - Fixing UI Tests by @yuneng-jiang in #19905
- #19351 Add Init Containers in the community helm chart by @pragyasardana in #19816
- 📖 docs: remove enterprise requirement for guardrail monitoring by @ryewilson in #19833
- merge 01 27 by @Sameerlite in https://git...
v1.81.3.dev1
Full Changelog: v1.81.3.rc.4...v1.81.3.dev1
v1.81.0-patch
Full Changelog: v1.81.0-stable...v1.81.0-patch
v1.81.3.rc.4
Full Changelog: v1.81.3.rc.3...v1.81.3.rc.4
v1.81.3.rc.3
Full Changelog: v1.81.3.rc.2...v1.81.3.rc.3
v1.81.3.rc.2
Full Changelog: v1.81.3-nightly...v1.81.3.rc.2
v1.81.3.rc.1
Full Changelog: v1.81.3.rc...v1.81.3.rc.1