moreminimore-vibe

Author	SHA1	Message	Date
Will Chen	d96e95c1da	Support web search (#1370 ) <!-- This is an auto-generated description by cubic. --> ## Summary by cubic Adds web search to Dyad Pro chats with a new UI, tag parsing, and a Pro Mode toggle that wires through to the engine. - New Features - Pro Mode toggle: “Web Search” (settings.enableProWebSearch). - New custom tags: dyad-web-search, dyad-web-search-result, dyad-read. - Collapsible Web Search Result UI with in-progress badge and markdown rendering. - Engine integration: passes enable_web_search and activates DyadEngine when web search is on. <!-- End of auto-generated description by cubic. -->	2025-09-24 19:39:39 -07:00
Will Chen	a8e9caf7b0	Turbo models (#1249 ) <!-- This is an auto-generated description by cubic. --> ## Summary by cubic Adds “Dyad Turbo” models for Pro users and centralizes model/provider constants. Pro users can pick fast, cost‑effective models directly from the ModelPicker, with clearer labels and gating. - New Features - Added Dyad Turbo provider in ModelPicker with Qwen3 Coder and Kimi K2 (Pro only). - Turbo options are hidden for non‑Pro users; “Pro only” badge shown where applicable. - “Smart Auto” label now applies only to the Auto model to avoid confusion. - Refactors - Moved all model/provider constants into language_model_constants.ts and updated imports (helpers, client, thinking utils). <!-- End of auto-generated description by cubic. -->	2025-09-10 15:59:54 -07:00
Will Chen	72acb31d59	More free models (#1244 ) <!-- This is an auto-generated description by cubic. --> ## Summary by cubic Adds support for free OpenRouter models and a new “Free (OpenRouter)” auto option that fails over across free models for reliability. Improves setup flow and UI with provider cards, a “Free” price badge, and an OpenRouter setup prompt in chat. - New Features - Added OpenRouter free models: Qwen3 Coder (free), DeepSeek v3 (free), DeepSeek v3.1 (free), marked with dollarSigns=0 and a “Free” badge. - New auto model: “Free (OpenRouter)” that uses a fallback client to cycle through free models with smart retry on transient errors. - New SetupProviderCard component and updated SetupBanner with dedicated Google and OpenRouter setup cards. - Chat shows an OpenRouter setup prompt when “Free (OpenRouter)” is selected and OpenRouter isn’t configured. - New PriceBadge component in ModelPicker to display “Free” or price tier. - E2E: added setup flow test and option to show the setup screen in tests. - Model updates: added DeepSeek v3.1, updated Kimi K2 to kimi-k2-0905, migrated providers to LanguageModelV2. <!-- End of auto-generated description by cubic. -->	2025-09-10 14:20:17 -07:00
Adeniji Adekunle James	f8ec10ec6b	feat: add xAI (Grok) as AI provider (#1209 ) # Add xAI (Grok) Provider Support ## Overview This PR adds support for xAI's Grok models as an AI provider, focusing on coding-optimized models. ## Changes Made ### Provider Configuration (`language_model_helpers.ts`) - Added xAI to `MODEL_OPTIONS` with 3 coding-focused models: - `grok-code-fast-1`: Fast, economical coding model (256k context) - `grok-4`: Most capable flagship model (256k context) - `grok-3`: Powerful coding model (131k context) <img width="805" height="592" alt="image" src="https://github.com/user-attachments/assets/a99b9495-e90e-40f3-a772-be9807b24501" /> <img width="805" height="653" alt="image" src="https://github.com/user-attachments/assets/aad7b333-ee74-457a-b5b7-5d20bd54d7e0" /> ## Dependencies - Requires `@ai-sdk/xai` package (already imported) - Uses existing provider pattern and infrastructure ## Why xAI for Coding? xAI's Grok models have shown impressive results in coding benchmarks: - Trained on high-quality programming datasets reflecting real-world tasks - Excels at agentic coding workflows with fast reasoning capabilities - Strong performance across multiple programming languages (TypeScript, Python, Java, Rust, C++, Go) - Achieved 70.8% on SWE-Bench-Verified using internal evaluation - Optimized for rapid iteration in development environments <!-- This is an auto-generated description by cubic. --> --- ## Summary by cubic Adds xAI (Grok) as a provider so users can pick Grok coding models in the app. Integrates provider config, client wiring, and schema updates. - New Features - Added xAI provider with env var mapping (XAI_API_KEY) and provider metadata. - Exposed models: grok-code-fast-1 (256k), grok-4 (256k), grok-3 (131k). - Hooked up get_model_client to use @ai-sdk/xai (createXai). - Included "xai" in validation schemas and model options. - Migration - Set XAI_API_KEY to enable xAI. <!-- End of auto-generated description by cubic. --> --------- Co-authored-by: Will Chen <willchen90@gmail.com>	2025-09-08 23:01:59 -07:00
Samrat Jha	938595aab2	Add support for Amazon Bedrock provider (#1185 ) - follows existing patterns for AI SDK to provide Bedrock integration - Uses Bedrock's API token feature for authentication which provides a standard experience - bedrock provided models match the Anthropic provided models (for now) Disclaimer: The contributing docs are extremely sparse. I don't actually know how to build this and get this running in Electron ## Testing - AWS Bedrock provider is available for selection <img width="994" height="496" alt="image" src="https://github.com/user-attachments/assets/3cb21fed-9826-40e5-8019-b2b5df5e873b" /> - The provider settings also show the right models and offer the right env variable to use <img width="949" height="862" alt="image" src="https://github.com/user-attachments/assets/8c23d5c8-d84d-4bf7-856a-8dc8d9d6c4b4" /> <!-- This is an auto-generated description by cubic. --> --- ## Summary by cubic Adds AWS Bedrock as a provider so users can run Claude models via Bedrock with API token authentication. The settings now list Bedrock with supported models and a new env var. - New Features - New provider: bedrock using @ai-sdk/amazon-bedrock, wired into model client and schemas. - Models: Claude 4 Sonnet, Claude 3.7 Sonnet, Claude 3.5 Sonnet (Bedrock model IDs). - Settings: shows AWS Bedrock with correct models and env var AWS_BEARER_TOKEN_BEDROCK. - Default region: us-east-1. - Migration - Set AWS_BEARER_TOKEN_BEDROCK with your Bedrock API token. - Select AWS Bedrock in settings and pick a model. <!-- End of auto-generated description by cubic. --> Co-authored-by: Samrat Jha <samratj@amazon.com> Co-authored-by: Will Chen <willchen90@gmail.com>	2025-09-08 22:52:12 -07:00
Md Rakibul Islam Rocky	4db6d63b72	Add Google Vertex AI provider (#1163 ) # Summary * Adds first-class Google Vertex AI provider using `@ai-sdk/google-vertex`. * Supports Gemini 2.5 models and partner MaaS (Model Garden) models via full publisher IDs. * New Vertex-specific settings UI for Project, Location, and Service Account JSON. * Implements a “thinking” toggle for Gemini 2.5 Flash * Pro: always on * Flash: toggleable * Flash Lite: none * Fixes “AI not found” for Vertex built-ins by mapping to `publishers/google` paths. * Hardens cross-platform file ops and ensures all tests pass. --- # What’s New ### Vertex AI Provider * Uses `@ai-sdk/google-vertex` with `googleAuthOptions.credentials` from pasted Service Account JSON. * Configurable project and location. * Base URL → `/projects/{project}/locations/{location}` * Built-ins: `publishers/google/models/<id>` * Partner MaaS: `publishers/<partner>/models/...` ### Built-in Vertex Models * `gemini-2.5-pro` * `gemini-2.5-flash` * `gemini-2.5-flash-lite` ### Thinking Behavior * Vertex + Google marked as thinking-capable. * Pro: always thinking * Flash: toggle in UI * Flash Lite: none ### Vertex Settings UI * New Google Vertex AI panel for Project ID, Location, Service Account JSON. * Keys encrypted like other secrets. --- # Fixes * Model resolution: built-ins auto-map to `publishers/google/models/<id>`. * Partner MaaS support: full publisher IDs work directly (e.g. DeepSeek). * Cross-platform paths: normalize file ops with `toPosixPath`, preserve `safeJoin` semantics. --- # Why This Is Better * Users can select Vertex alongside other providers. * More models available through Model Garden. * Dedicated setup UI reduces misconfig. * Thinking toggle gives control over cost vs. reasoning depth. --- # Files Changed * Provider & Models: `language_model_helpers.ts`, `get_model_client.ts` * Streaming: `chat_stream_handlers.ts` * Schemas & Encryption: `schemas.ts`, `settings.ts` * Settings UI: `VertexConfiguration.tsx`, `ApiKeyConfiguration.tsx` * Models UI: `ModelsSection.tsx` (Flash toggle) * Setup Detection: `useLanguageModelProviders.ts` * Path Utils: `path_utils.ts`, `response_processor.ts` * Deps: `package.json` → `@ai-sdk/google-vertex@3.0.16` --- # Tests & Validation * TypeScript: `npm run ts` → ✅ * Lint: `npm run lint` → ✅ * Unit tests: `npm test` → ✅ 231 passed, 0 failed --- # Migration / Notes * No breaking changes. * For Vertex usage: * Ensure Vertex AI API is enabled. * Service Account needs `roles/aiplatform.user`. * Region must support model (e.g. `us-central1`). * Thinking toggle currently affects only Gemini 2.5 Flash. --- # Manual QA 1. Configure Vertex with Project/Location/Service Account JSON. 2. Test built-ins: * `gemini-2.5-pro` * `gemini-2.5-flash` (toggle on/off) * `gemini-2.5-flash-lite` 3. Test MaaS partner model (e.g., DeepSeek) via full publisher ID. 4. Verify other providers remain unaffected. <!-- This is an auto-generated description by cubic. --> --- ## Summary by cubic Adds a first-class Google Vertex AI provider with Gemini 2.5 models, a Vertex settings panel, and a “thinking” toggle for Gemini 2.5 Flash. Also fixes model resolution for Vertex and hardens cross-platform file operations. - New Features - Vertex AI provider via @ai-sdk/google-vertex with project, location, and service account JSON. - Built-in models: gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-lite. - “Thinking” support: Pro always on; Flash toggle in Models UI; Flash Lite none. - MaaS partners supported via full publisher paths (e.g., publishers/<partner>/models/...). - Vertex settings UI with encrypted service account key storage. - Bug Fixes - Built-in Vertex models auto-map to publishers/google/models/<id>. - Consistent file ops across platforms using toPosixPath. - Vertex readiness detection requires project/location/service account JSON. - Streaming “thinking” behavior respects Vertex Flash toggle and Pro always-on. <!-- End of auto-generated description by cubic. --> --------- Co-authored-by: Md Rakibul Islam Rocky <mdrirocky08@gmail.com> Co-authored-by: graphite-app[bot] <96075541+graphite-app[bot]@users.noreply.github.com> Co-authored-by: Will Chen <willchen90@gmail.com>	2025-09-08 22:41:12 -07:00
Will Chen	56d0e76790	Make balanced smart context option the default (#1186 ) <!-- This is an auto-generated description by cubic. --> ## Summary by cubic Set “balanced” as the default smart context mode. Users now get balanced when Smart Files Context is enabled and no mode is set; “conservative” must be explicitly selected. - Refactors - Default fallback to balanced in UI and engine (proSmartContextOption undefined -> "balanced"). - ProModeSelector saves "conservative" explicitly; selector reads undefined as balanced. - Updated schema and types to allow "balanced" \| "conservative". - Engine payload now includes smart_context_mode with "balanced" by default; e2e tests and snapshots updated. - Migration - No action needed. Existing users without an explicit mode will use balanced by default; selecting conservative persists. <!-- End of auto-generated description by cubic. -->	2025-09-04 11:06:46 -07:00
Tanner-Maasen	2ffbbbca8f	Add Azure OpenAI Custom Model Integration (#1001 ) Fixes #710 This PR implements comprehensive Azure OpenAI integration for Dyad, enabling users to leverage Azure OpenAI models through proper environment variable configuration. The implementation adds Azure as a supported provider with full integration into the existing language model architecture, including support for GPT-5 models. Key features include environment-based configuration using `AZURE_API_KEY` and `AZURE_RESOURCE_NAME`, specialized UI components that provide clear setup instructions and status indicators, and seamless integration with Dyad's existing provider system. The Azure provider leverages the @ai-sdk/azure package (v1.3.25) for compatibility with the current TypeScript language model interfaces. The implementation includes robust error handling for missing configuration, comprehensive test coverage with 9 new unit tests covering critical functionality like model client creation and error scenarios, and an E2E test for the Azure-specific settings UI. <img width="1510" height="908" alt="Screenshot 2025-08-18 at 9 14 32 PM" src="https://github.com/user-attachments/assets/04aa99e1-1590-4bb0-86c9-a67b97bc7500" /> --------- Co-authored-by: graphite-app[bot] <96075541+graphite-app[bot]@users.noreply.github.com> Co-authored-by: Will Chen <willchen90@gmail.com>	2025-08-30 20:47:25 -07:00
Will Chen	4e9a927a7b	smart context v3 (#1022 ) <!-- This is an auto-generated description by cubic. --> ## Summary by cubic Adds Smart Context v3 with selectable modes (Off, Conservative, Balanced) and surfaces token savings in chat. Also improves token estimation by counting per-file tokens when Smart Context is enabled. - New Features - Smart Context selector in Pro settings with three options. Conservative is the default when enabled without an explicit choice. - New setting: proSmartContextOption ("balanced"); undefined implies Conservative. - Engine now receives enable_smart_files_context and smart_context_mode. - Chat shows a DyadTokenSavings card when the message contains token-savings?original-tokens=...&smart-context-tokens=..., with percent saved and a tooltip for exact tokens. - Token estimation uses extracted file contents for accuracy when Pro + Smart Context is on; otherwise falls back to formatted codebase output. <!-- End of auto-generated description by cubic. -->	2025-08-20 14:16:07 -07:00
Will Chen	d535db6251	Upgrade to AI sdk with codemod (#1000 )	2025-08-18 22:21:27 -07:00
Will Chen	ab757d2b96	Add GPT 5 support (#902 )	2025-08-11 15:00:02 -07:00
Will Chen	2ea9500f73	Configurable thinking budget (default to medium) (#494 )	2025-06-25 15:36:05 -07:00
Will Chen	5d678c2ead	Smart auto (#476 )	2025-06-23 23:08:29 -07:00
Will Chen	c879a4e01d	always use engine if pro modes are enabled (#449 )	2025-06-19 12:04:24 -07:00
Will Chen	9fbd7031d9	build ask mode (#444 )	2025-06-19 10:42:51 -07:00
Will Chen	897d2e522c	enable engine for all models (#434 )	2025-06-18 17:17:29 -07:00
Will Chen	e326f14779	Revert "Revert "Update gemini 2.5 models to GA variants (#425 )" (#429 )" (#437 ) This reverts commit `ff4e93d747`.	2025-06-18 17:11:17 -07:00
Will Chen	30b5c0d0ef	Replace thinking with native Gemini thinking summaries (#400 ) This uses Gemini's native [thinking summaries](https://cloud.google.com/vertex-ai/generative-ai/docs/thinking#thought-summaries) which were recently added to the API. Why? The grafted thinking would sometimes cause weird issues where the model, especially Gemini 2.5 Flash, got confused and put dyad tags like `<dyad-write>` inside the `<think>` tags. This also improves the UX because you can see the native thoughts rather than having the Gemini response load for a while without any feedback. I tried adding Anthropic extended thinking, however it requires temp to be set at 1, which isn't ideal for Dyad's use case where we need precise syntax following.	2025-06-16 17:29:32 -07:00
Will Chen	67dc9f4c42	Print engine/gateway URL more clearly (#396 )	2025-06-13 10:05:12 -07:00
Will Chen	fa80014e16	Remove budget saver mode (#378 ) This code was quite complex and hairy and resulted in very opaque errors (for both free and pro users). There's not much benefit to budget saver because Google removed 2.5 Pro free quota a while ago (after it graduated the model from experimental to preview). Dyad Pro users can still use 2.5 Flash free quota by disabling Dyad Pro by clicking on the Dyad Pro button at the top.	2025-06-10 13:54:27 -07:00
Will Chen	8a743ca4f5	LM studio e2e test (#297 )	2025-05-31 23:04:28 -07:00
Will Chen	af7d6fa9f8	Create ollama e2e test (#296 )	2025-05-31 22:01:48 -07:00
Will Chen	8cfd476ea9	Fix engine enabled (#255 )	2025-05-27 00:10:49 -07:00
Will Chen	0f6f069e43	Update Flash to 05-20 (#226 )	2025-05-22 10:37:26 -07:00
Will Chen	f9f33596bd	Smart files context (#184 )	2025-05-16 22:21:45 -07:00
Will Chen	7bcb68e87d	Fix model client gateway prefix check (openAI erroneously not using dyad gateway for dyad pro) (#174 )	2025-05-15 16:31:52 -07:00
Will Chen	09fc028f94	Mark which models are eligible for turbo edits (#172 )	2025-05-15 16:02:42 -07:00
Will Chen	bbf4bb765c	Allow overriding gateway URL (#169 )	2025-05-15 15:09:45 -07:00
Will Chen	35b459d82d	Support turbo edits (pro) (#166 )	2025-05-14 23:35:50 -07:00
Will Chen	d545babb63	Quick fix for Google models (#160 )	2025-05-13 22:21:27 -07:00
Will Chen	069c221292	Implement saver mode (#154 )	2025-05-13 15:34:41 -07:00
Will Chen	f628c81f4c	Refactor constants/models and inline (#143 )	2025-05-12 22:20:16 -07:00
Will Chen	877c8f7f4f	Simplify provider logic and migrate getContextWindow (#142 )	2025-05-12 22:18:49 -07:00
Will Chen	cd7eaa8ece	Prep for custom models: support reading custom providers (#131 )	2025-05-12 14:52:48 -07:00
Will Chen	2537fbb342	lint using oxlint (#106 )	2025-05-08 17:21:35 -07:00
Will Chen	0d56651220	Run prettier on everything (#104 )	2025-05-06 23:02:28 -07:00
Will Chen	744ea68ac8	fix auto model logic so that dyad pro key doesn't error	2025-05-06 22:12:59 -07:00
Piotr Wilkin (ilintar)	5fc49231ee	Add LM Studio support (#22 )	2025-05-02 14:51:32 -07:00
Will Chen	e65b80bcfa	Set explicit max output tokens to avoid truncated responses (#31 )	2025-04-28 13:43:34 -07:00
Will Chen	04b9f81647	Do not use free variant for dyad pro gateway	2025-04-26 11:12:20 -07:00
Will Chen	4848b2f085	Add toggle for disabling Dyad Pro (#24 ) * Add toggle for disabling Dyad Pro * Entering key for Dyad pro enables dyad pro	2025-04-26 10:13:35 -07:00
Will Chen	2ad10ba039	Support LLM gateway with Dyad API key (#23 ) * Do not make API key input (password) - hurts usability * Support LLM gateway (and add GPT 4.1 mini model) * Show Dyad Pro button * Fix to use auto (not dyad) for detecting dyad pro * Fix description of gpt 4.1-mini	2025-04-26 08:52:08 -07:00
Will Chen	b616598bab	Add ollama support (#7 )	2025-04-23 14:48:57 -07:00
Will Chen	658d4e0bde	proper secret encrpytion	2025-04-14 23:15:58 -07:00
Will Chen	6ca060d207	Fix env var handling for MacOs	2025-04-11 10:33:10 -07:00
Will Chen	43f67e0739	Initial open-source release	2025-04-11 09:38:16 -07:00

46 Commits