- Update default API version from 2024-05-01-preview to 2025-04-01-preview
(required for o-series and GPT-5 models)
- Remove NeedsRawMode override that always returned false, inheriting parent
logic that correctly skips temperature/top_p for reasoning models
- Add /responses route to deployment middleware for future v1 API support
Tested against Azure OpenAI gpt-5.2-chat deployment.
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>