Replace thinking with native Gemini thinking summaries (#400)
This uses Gemini's native [thinking summaries](https://cloud.google.com/vertex-ai/generative-ai/docs/thinking#thought-summaries) which were recently added to the API. Why? The grafted thinking would sometimes cause weird issues where the model, especially Gemini 2.5 Flash, got confused and put dyad tags like `<dyad-write>` inside the `<think>` tags. This also improves the UX because you can see the native thoughts rather than having the Gemini response load for a while without any feedback. I tried adding Anthropic extended thinking, however it requires temp to be set at 1, which isn't ideal for Dyad's use case where we need precise syntax following.
This commit is contained in:
33
e2e-tests/snapshots/gateway.spec.ts_gemini-2-5-flash-1.txt
Normal file
33
e2e-tests/snapshots/gateway.spec.ts_gemini-2-5-flash-1.txt
Normal file
File diff suppressed because one or more lines are too long
Reference in New Issue
Block a user