feat: publish 3 articles from today's news digest

1. ChatGPT เลือกแหล่งข้อมูลยังไง? (Source Selection - GEO) 2. ChatGPT Instant vs Thinking Mode 75% ต่างกัน (Reasoning Modes) 3. DeepSeek DSpark + Devin Fusion: ลดต้นทุน AI 85% (Cost Revolution) Categories: SEO (2), AI & Marketing (1) Images: 9 (3 featured + 6 inline) Source: news digest 2026-07-01
2026-07-01 14:25:11 +07:00
parent abafd4015f
commit 28d2be03f0
12 changed files with 574 additions and 0 deletions
--- a/public/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-featured.png
+++ b/public/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-featured.png
--- a/public/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-source-stacks-comparison.png
+++ b/public/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-source-stacks-comparison.png
--- a/public/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-venn-overlap.png
+++ b/public/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-venn-overlap.png
--- a/public/images/blog/2026-07-01-chatgpt-source-selection-fan-out-queries.png
+++ b/public/images/blog/2026-07-01-chatgpt-source-selection-fan-out-queries.png
--- a/public/images/blog/2026-07-01-chatgpt-source-selection-featured.png
+++ b/public/images/blog/2026-07-01-chatgpt-source-selection-featured.png
--- a/public/images/blog/2026-07-01-chatgpt-source-selection-source-pipelines.png
+++ b/public/images/blog/2026-07-01-chatgpt-source-selection-source-pipelines.png
--- a/public/images/blog/2026-07-01-deepseek-dspark-devin-fusion-featured.png
+++ b/public/images/blog/2026-07-01-deepseek-dspark-devin-fusion-featured.png
--- a/public/images/blog/2026-07-01-deepseek-dspark-devin-fusion-speculative-decoding-flow.png
+++ b/public/images/blog/2026-07-01-deepseek-dspark-devin-fusion-speculative-decoding-flow.png
--- a/public/images/blog/2026-07-01-deepseek-dspark-devin-fusion-two-agent-architecture.png
+++ b/public/images/blog/2026-07-01-deepseek-dspark-devin-fusion-two-agent-architecture.png
--- a/src/content/blog/2026-07-01-chatgpt-reasoning-modes-overlap.md
+++ b/src/content/blog/2026-07-01-chatgpt-reasoning-modes-overlap.md
@@ -0,0 +1,170 @@
 ---
 title: "ChatGPT Instant vs Thinking Mode: 75% ของ Sources ที่ใช้ ไม่เหมือนกัน"
 description: "Semrush study เผย ChatGPT Instant mode กับ Thinking mode cite sources ต่างกัน 75% — 99 domains โผล่เฉพาะใน Thinking mode ที่ไม่เคยมีใน Instant mode"
 pubDate: 2026-07-01
 category: "SEO"
 heroImage: "/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-featured.png"
 tags: ["ChatGPT", "Reasoning-Modes", "Semrush-Study", "AI-Visibility", "GEO"]
 draft: false
 ---
 # ChatGPT Instant vs Thinking Mode: 75% ของ Sources ที่ใช้ ไม่เหมือนกัน
 > ถ้าคุณ optimize content ให้ติดแค่ mode เดียว — คุณหายไป 75% แบบไม่รู้ตัว
 ถ้าคุณคิดว่า ChatGPT คือ search engine ตัวเดียวที่ทำงานเหมือนเดิมทุกครั้ง — คุณคิดผิด
 Kevin Indig (Growth Memo) และทีม Semrush ร่วมกันวิเคราะห์ข้อมูลจาก AI Visibility Toolkit ผ่าน 100 prompts ใน 4 หมวดหมู่ธุรกิจ (B2B SaaS, Finance, Consumer Tech, Health & Lifestyle) — แบ่งเป็น 20 buyer journeys 5 ขั้นตอน (Problem → Exploration → Comparison → Validation → Selection)
 และพวกเขาพบว่า **ChatGPT Instant mode กับ Thinking mode cite sources ต่างกัน 75%**
 ## 📚 สารบัญ
 - Only 25.6% ซ้ำ — สอง mode = สอง search engine
 - Citation rate + citations ต่อ response เพิ่มขึ้นเท่าตัว
 - 4.6x fan-out queries ใน Thinking mode
 - Source types: Reddit หาย, Government กับ Academic มา
 - Finance เปลี่ยนมากที่สุด, Consumer Tech เปลี่ยนน้อย
 - Full-funnel persistence — 4 ใน 20
 - Actionable Takeaways สำหรับ Content Strategists
 ---
 ## Only 25.6% ซ้ำ — นี่คือ "Different Search Engine"
 ![Venn diagram showing only 25% overlap between Instant and Thinking modes](/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-venn-overlap.png)
 ตัวเลขที่ช็อคที่สุดในงานวิจัยนี้:
 - **Only 25.6%** ของ cited domains ซ้ำกันระหว่าง Instant mode กับ Thinking mode
 - **เกือบ 3 ใน 4** ของ sources ที่ cited ต่างกันโดยสิ้นเชิง
 - Thinking mode ดึงข้อมูลจาก **173 unique domains** vs Instant mode **127 domains**
 - **99 domains** ปรากฏเฉพาะใน Thinking mode — ไม่เคยมีใน Instant mode เลย
 นี่ไม่ใช่แค่ "ปรับเพิ่มอีกนิด" — มันคือ **คนละ machine**
 | Metric | Instant Mode | Thinking Mode |
 |:---|:---:|:---:|
 | Unique domains cited | 127 | 173 |
 | Overlap | — | 25.6% |
 | Domains exclusive to this mode | 74 | 99 |
 ---
 ## Citation Rate + Citations ต่อ Response เพิ่มขึ้นเท่าตัว
 ![Source stacks comparison: Instant Mode (2.6 citations) vs Thinking Mode (4.5 citations)](/images/blog/2026-07-01-chatgpt-reasoning-modes-overlap-source-stacks-comparison.png)
 เมื่อเปิด Thinking mode:
 - **Citation rate**: 50% → 68% (+18 pp) — อัตราการตอบที่มี external sources เพิ่มขึ้น
 - **Average citations**: 2.6 → 4.5 — เกือบเท่าตัว
 - **Fan-out queries**: 4.6x — ยิง sub-queries เพื่อค้นหามากกว่าเดิม 4.6 เท่า
 แปลว่า Thinking mode **ค้นหาจริงจังกว่า** — มันไม่ shortcut ตอบจากความจำ แต่ออกไปเสิร์ช หา sources เพิ่ม อ่านเยอะขึ้น และ cite มากขึ้น
 ### ภาพรวมของ Query Pattern
 ใน Instant mode — ChatGPT จะยิง search query 1-2 ครั้ง รวบรัดได้ใจความ ใน Thinking mode — 1 คำถามของคุณกลายเป็น 15-40 การค้นหาย่อย ตามที่ Suganthan พบในงานวิจัยของ SEJ
 ---
 ## Source Types Shift: Reddit หาย, Government มา
 เมื่อเปิด reasoning — ลักษณะของ sources ที่ cited เปลี่ยนไปอย่างมีนัยยะ:
 | Source Type | Instant Mode | Thinking Mode | การเปลี่ยนแปลง |
 |:---|:---:|:---:|:---:|
 | **Reddit & UGC** | สัดส่วนสูง (cited มากที่สุด) | **ลดลง ~50%** | เสีย share ครึ่งนึง |
 | **Government (.gov)** | น้อย | เพิ่มขึ้นมาก | ได้ credibility boost |
 | **Academic (.edu, .org)** | ปานกลาง | เพิ่มขึ้นมาก | กลายเป็น primary sources |
 | **Official documentation** | ปานกลาง | เพิ่มขึ้นมาก | API docs, product docs |
 เหตุผล logical มาก: เมื่อ ChatGPT อยู่ในโหมด "คิดหนัก" — มันอยากได้ข้อมูลที่ **เชื่อถือได้** มากกว่า Reddit opinion ดังนั้นมันจึง shift ไปหา government sites, academic papers, และ official documentation แทน
 Reddit ยังคงถูก cited อยู่ — แต่สัดส่วนลดลง ขณะที่ Wikipedia ยังคงเป็น domain ที่ถูก cited มากที่สุดใน 11 จาก 12 ภาษาตามงานวิจัยของ arXiv
 ---
 ## Finance เปลี่ยนมากที่สุด, Consumer Tech แทบไม่เปลี่ยน
 Industry matters — แต่ละอุตสาหกรรมได้รับผลกระทบจาก reasoning mode ไม่เท่ากัน:
 | Industry | Instant Citation Rate | Thinking Citation Rate | Change |
 |:---|:---:|:---:|:---:|
 | **Finance** | ~45% | ~73% | **+28 pp** ⬆️ มากที่สุด |
 | B2B SaaS | ~52% | ~70% | +18 pp |
 | Health & Lifestyle | ~48% | ~65% | +17 pp |
 | Consumer Tech | ~55% | ~62% | +7 pp (น้อยที่สุด) |
 **ทำไม Finance ถึงเปลี่ยนมาก?** เพราะคำถามการเงินต้องการข้อมูลปัจจุบัน อัตราดอกเบี้ย, ราคาหุ้น, กฎหมายภาษี — Instant mode ที่ตอบจาก training data ไม่พอ ต้อง search จริงใน Thinking mode
 **ทำไม Consumer Tech แทบไม่เปลี่ยน?** เพราะคนถาม "iPhone รุ่นไหนดี" — ความคิดเห็นจาก Reddit และ review sites ก็พอแล้ว ไม่ต้องไปหา government sources
 ---
 ## Full-Funnel Persistence — 4 ใน 20
 นี่คือสิ่งที่อาจเปลี่ยนวิธีคิดเรื่อง content marketing ของคุณ:
 ใน **4 จาก 20 buyer journeys** (20%) — เมื่อใช้ Thinking mode — **แบรนด์เดียวกันปรากฏตั้งแต่ Problem → Exploration → Comparison → Validation → Selection**
 Instant mode แทบไม่เกิด pattern นี้
 **นัยยะ:** ถ้าคุณทำ content ที่ติดอันดับ Top-of-Funnel (ToFu) ใน Thinking mode — มีโอกาสที่แบรนด์คุณจะถูก cite ต่อเนื่องไปจนถึง Bottom-of-Funnel (BoFu) โดยที่คุณไม่ต้องทำ content BoFu ด้วยซ้ำ
 แต่นี่ใช้ได้กับ Thinking mode เท่านั้น — Instant mode ไม่มี persistence pattern นี้
 ---
 ## Actionable Takeaways สำหรับ Content Strategists
 ### 1. ต้อง optimize ทั้งสอง mode
 ไม่ใช่แค่เขียน content ดีๆ — แต่ต้องเข้าใจว่า **Instant mode กับ Thinking mode เลือก sources ต่างกัน**
 | Content Type | Instant Mode | Thinking Mode |
 |:---|:---|:---|
 | Reddit / UGC | 🟢 มีโอกาส cited สูง | 🟡 โอกาสลดลง ~50% |
 | Brand-owned / Blog | 🟡 ต้อง text-based | 🟡 ต้อง text-based |
 | Official docs / Whitepapers | 🔴 โอกาสต่ำ | 🟢 โอกาสสูงมาก |
 | Government / Academic | 🔴 โอกาสต่ำ | 🟢 โอกาสสูงมาก |
 ### 2. สร้าง multi-mode citation strategy
 คำถามที่คุณควรถามตัวเอง:
 - **คำถามในอุตสาหกรรมคุณถูกถามใน mode ไหนบ่อย?** — Finance → Thinking mode แทบ guarantee
 - **เนื้อหาแบบไหนที่แต่ละ mode ชอบ?** — Instant → Reddit, listicles | Thinking → Academic, official sources
 - **คุณติดทั้งสอง mode หรือแค่ mode เดียว?** — ใช้เครื่องมืออย่าง Semrush AI Visibility Toolkit เช็ค
 ### 3. ลงทุนกับ Top-of-Funnel content
 **— โดยเฉพาะถ้าคุณอยู่ในอุตสาหกรรมที่ Thinking mode มีบทบาทมาก (Finance, B2B)** — เพราะ 20% ของ journeys ใน Thinking mode แสดง persistence จาก ToFu → BoFu
 ### 4. เปิดใช้งาน Instant และ Thinking mode ในการ test
 อย่า test แค่ default mode — รัน prompt เดียวกันทั้งสอง mode แล้วดูว่า:
 - sources ที่ cited ต่างกันยังไง?
 - แบรนด์คุณโผล่ใน mode ไหน? หายไปจาก mode ไหน?
 - competitors ใครที่โผล่ใน mode ที่คุณไม่ติด?
 ### 5. Official documentation ≠ เนื้อหาการตลาด
 สำหรับ Thinking mode — การมี API documentation ที่ดี, technical whitepapers, และ product specs ที่อ่านได้ (plain HTML) — มีค่าเทียบเท่าการมี blog post ที่ viral
 ---
 ## สรุปภาพใหญ่
 **ChatGPT ไม่ใช่ระบบเดียว** — มันคือสอง search engines ในร่างเดียว:
 - **Instant mode**: เร็ว, ใช้ UGC/Reddit, citation rate 50%
 - **Thinking mode**: ช้า, ลึก, ชอบ official sources, citation rate 68%, 4.6x fan-out queries
 แค่ 25.6% ของ sources ซ้ำกัน — ถ้าคุณ optimize แค่ mode เดียว คุณกำลังหายไป 75%
 และนี่คือ GPT-5.2 — รุ่นถัดไปอาจแตกต่างกว่าเดิมอีก
 ---
 **ที่มา:**
 - Margarita Loktionova, "Only 25% of cited sources overlap between ChatGPT's different reasoning modes [Study]" — Semrush Blog, June 30, 2026
 - Kevin Indig, Growth Memo — data partnership with Semrush AI Visibility Toolkit
 - Suganthan Mohanadasan, "How ChatGPT Actually Picks Sources" — SEJ, June 30, 2026
--- a/src/content/blog/2026-07-01-chatgpt-source-selection.md
+++ b/src/content/blog/2026-07-01-chatgpt-source-selection.md
@@ -0,0 +1,187 @@
 ---
 title: "ChatGPT เลือกแหล่งข้อมูลยังไง? เปิด Network Traffic ดูของจริง"
 description: "เบื้องหลังการทำงานของ ChatGPT ในการเลือก sources — จาก network traffic จริง พบ result_source 4 แบบ, turn_use_case 6 ประเภท, และ 75% ของ citations ไม่ได้มาจากเว็บที่คุณคิด"
 pubDate: 2026-07-01
 category: "SEO"
 heroImage: "/images/blog/2026-07-01-chatgpt-source-selection-featured.png"
 tags: ["ChatGPT", "GEO", "AI-Search", "Source-Selection"]
 draft: false
 ---
 # ChatGPT เลือกแหล่งข้อมูลยังไง? เปิด Network Traffic ดูของจริง
 > การตลาดยุคใหม่ต้องเข้าใจว่า AI "เห็น" เรายังไง — ไม่ใช่แค่ Google Rank เท่านั้น
 ถ้าคุณเป็นคนทำ SEO หรือ Content Marketing คุณคงเคยได้ยินคำถามนี้:
 **"ทำยังไงให้แบรนด์เราโผล่ใน ChatGPT?"**
 คำตอบที่เราได้ยินซ้ำๆ คือ "เขียนคอนเทนต์ดีๆ", "ทำลิสต์icle", "ไปเม้น Reddit" — แต่ไม่มีใครเคยบอกว่า **เราไปอ่าน network traffic ของ ChatGPT จริงๆ ดูไหมว่า mechanism มันทำงานยังไง?**
 Suganthan Mohanadasan (Co-founder Snippet Digital) ทำแบบนั้นแหละ เขานั่งอ่าน JSON ที่ ChatGPT ส่งไปมายังบราวเซอร์ของเขา — ไม่ใช่แค่อ่าน output ที่เราเห็น แต่ดู layer ลึกลงไปว่า ChatGPT **ตัดสินใจดึงข้อมูลจากที่ไหน** และใช้เกณฑ์อะไร
 นี่คือสิ่งที่เขาค้นพบ
 ## 📚 สารบัญ
 - 4 ชั้นของผลลัพธ์ที่ ChatGPT ใช้
 - 6 ประเภทคำถาม — บางประเภทไม่ Search เลย
 - Fan-out Queries: 1 คำถาม → 40 การค้นหา
 - Fetched ≠ Cited ≠ Mentioned
 - สิ่งที่ ChatGPT มองไม่เห็น
 - ทำยังไงให้แบรนด์คุณโดนเลือก?
 - สรุป Actionable Takeaways
 ---
 ## 4 ชั้นของแหล่งข้อมูลใน ChatGPT
 ![ChatGPT source pipelines: SERP, Labrador, Bright Data, Oxylabs](/images/blog/2026-07-01-chatgpt-source-selection-source-pipelines.png)
 ใน network traffic ของ ChatGPT มี field หนึ่งชื่อ `result_source` — มันถูกติดไว้กับทุก web result ที่ ChatGPT ดึงมา โดยคุณไม่มีทางเห็น field นี้ใน answer ปกติ
 `result_source` มี 4 ค่า:
 | result_source | มันคืออะไร | ตัวอย่าง |
 |:---|:---|:---|
 | **serp** | Open web baseline — ส่วนใหญ่เป็นข่าว | Yahoo, StreetInsider |
 | **labrador** | Allowlist ของ publishers ที่มีดีลกับ OpenAI | Reuters, WSJ, FT, Wikipedia, arXiv |
 | **bright** | Bright Data — commercial web scraper | Reddit, Forbes, rtings |
 | **oxylabs** | Oxylabs — คู่แข่ง Bright Data | Gulf News, Khaleej Times, regional press |
 **labrador** คือ licensed tier — ถ้าคุณไม่ได้เป็นเจ้าของหนังสือพิมพ์ระดับประเทศ ก็เข้า tier นี้ไม่ได้
 **bright** กับ **oxylabs** คือคู่หูที่ ChatGPT จ้างมา crawl เว็บให้ โดย bright ทำหน้าที่หลักใน commercial, shopping, finance, weather ส่วน oxylabs เก็บบริเวณ regional และ local press
 ใน query เดียว — เช่นถามสภาพอากาศ — ChatGPT ดึงข้อมูลจากหลาย pipeline พร้อมกัน: `metoffice.gov.uk` ผ่าน **bright**, `khaleejtimes.com` ผ่าน **oxylabs**
 ### 📌 GEO Takeaway
 คุณแข่งขันใน **scraped tier** (bright + oxylabs) ดังนั้น:
 - **ทำให้เว็บคุณ cleanly scrapable** — ข้อมูล ตัวเลข ราคา ต้องอยู่ใน plain HTML text
 - อย่าซ่อนไว้ใน script, PDF, หรือ image — scrapers อ่านไม่ได้
 - สร้าง third-party coverage, PR, และ brand mentions — นี่คือทางเข้า scraper ที่จะเจอคุณ
 ---
 ## 6 ประเภทคำถาม — บางประเภทไม่ Search เลย
 ChatGPT มี field `turn_use_case` ที่จัดประเภทคำถามของคุณก่อนว่าจะ search หรือไม่:
 1. **instant search** — ค้นหาทั่วไป
 2. **shopping** — ค้นหาสินค้า
 3. **text** — **ไม่ search เลย**, ตอบจาก training data
 4. **local** — ค้นหารอบตัว
 5. **thinking** — reasoning mode, ค้นหาเชิงลึก
 6. **image generation** — สร้างภาพ
 **ประเภทที่ต้องสนใจมากที่สุดคือ `text`**
 เมื่อ ChatGPT จัดคำถามคุณเป็น `text` — มันจะไม่ search เลย มันตอบจากความจำใน training data คำถามประเภท how-to, definition, translation — แม้แต่ "latest treatment guidelines for type 2 diabetes" ก็ถูกจัดเป็น `text`!
 ใน 10 คำถามปัจจุบันที่เขาทดสอบ — 3 คำถามถูกจัดเป็น `text` และตอบจาก training data โดยไม่ search เลย
 **การตัดสินใจขึ้นอยู่กับ wording ไม่ใช่ topic** — "best coffee near me" → local pipeline, "best 4K TVs to buy" → shopping แต่ "best 4K TVs with reviews" → normal search
 ### 📌 GEO Takeaway
 **ก่อนลงทุนทำหน้า page หนึ่ง — เช็คก่อนว่าคำถามนี้ ChatGPT search จริงๆ หรือตอบจาก training data**
 ถ้ามันถูกจัดเป็น `text` — ไม่มี page ไหนในโลกเข้าไปได้ เพราะ ChatGPT ไม่ได้ search เลย ต้องใช้เวลาสร้าง authority และรอให้แบรนด์คุณถูกรวมใน training data รอบหน้า
 ---
 ## Fan-out Queries: 1 คำถาม → 40 การค้นหา
 ![Fan-out queries: 1 question splitting into 15-40 sub-queries](/images/blog/2026-07-01-chatgpt-source-selection-fan-out-queries.png)
 ChatGPT เปิดเผย search queries ที่มันรันผ่าน API — และใน Thinking mode **1 คำถามกลายเป็นการค้นหาย่อย 15-40 ครั้ง**
 ตัวอย่างของ query เปรียบเทียบ pricing เครื่องมือ AI visibility:
 ```
 "Profound AI search visibility pricing AI engines tracked 2026"
 "AthenaHQ pricing AI search visibility tool"
 "site:peec.ai/pricing Peec AI Starter Pro Advanced 50 prompts 150 prompts"
 "Peec AI pricing $95 $245 $495 official"  ← เดาราคาแล้ว search ยืนยัน
 "Scrunch AI pricing"  ← ไม่ได้อยู่ใน prompt แต่เจอระหว่างค้น
 ...รวม ~40 queries
 ```
 สามสิ่งที่โดดเด่น:
 1. ChatGPT ยิง `site:` probes ตรงไปที่ vendor pricing pages
 2. มัน **เดาราคา แล้ว search เพื่อยืนยัน**
 3. มัน widen ไปเรื่อยๆ — เจอ tool ที่คุณไม่ได้ถามถึง ก็ chase pricing ต่อ
 ### 📌 GEO Takeaway
 - **ตัวเลขสำคัญต้องอยู่ใน plain HTML text** — โดยเฉพาะ pricing เพราะ ChatGPT grep หา `$` และ `€`
 - JavaScript-based toggle และ dynamic data loading = ตาย เพราะ ChatGPT อ่านไม่เจอ
 - **เขียนให้ตรงกับ query ที่ ChatGPT รัน จริง — ไม่ใช่คำค้นของมนุษย์**
 - ถ้า pricing ของคุณซ่อนใน JS — ChatGPT จะยอมแพ้แล้วไป quote G2 แทน
 ---
 ## Fetched ≠ Cited ≠ Mentioned
 นี่คือความแตกต่างที่คนสับสนมากที่สุด — สามสิ่งนี้ไม่เหมือนกัน:
 1. **Fetched** — ChatGPT ดึงหน้าคุณเข้ามาใน context (user ไม่เห็น)
 2. **Cited** — ใส่ URL ของคุณเป็น source ข้างหลัง specific sentence
 3. **Mentioned** — ชื่อแบรนด์คุณโผล่ใน answer (อาจเป็น chip ลิงก์เว็บคุณ)
 **คุณสามารถชนะหรือแพ้แต่ละอย่างได้โดยอิสระ**
 ในการทดสอบของ Suganthan — Reddit ถูก fetch 278 ครั้ง แต่ cited แค่ 11 ครั้ง ส่วน YouTube ถูก fetch 201 ครั้ง แต่ **ไม่เคยถูก cited เลยสักครั้ง**
 สาเหตุ: citation ต้อง bind กับ text ที่ model อ่านได้จริง — YouTube page ให้แค่ metadata (title, description) ไม่ใช่ transcript ส่วน Reddit มี text ให้ quote เต็มๆ
 ### 📌 GEO Takeaway
 - **คุณ cite ตัวเองไม่ได้** — claim เกี่ยวกับคุณ ต้องมี third-party coverage
 - เน้น content ที่เป็น text (Reddit, blog, review sites) มากกว่า video
 - **1 หน้าที่แข็งแรง ดีกว่า 20 หน้าที่บาง** — ChatGPT dedup by domain
 - ราคาและ spec ของคุณควรอ่านได้จากหน้าของคุณเอง ไม่งั้นมือไป G2
 ---
 ## สิ่งที่ ChatGPT มองไม่เห็น — และทำไม G2 ถึงได้ cited
 ChatGPT เก็บ **chain of thought** ไว้ใน conversation — และมันอธิบายการเลือก sources ด้วยคำพูดของมันเอง:
 > "pricing page seems more current, so I should cite that"
 > "the pricing isn't showing up directly in the search result, possibly because it's loaded with JavaScript"
 > "I can quote third-party sources since the official page is hard to parse and doesn't show prices"
 > "use citations from G2 where appropriate"
 นี่คือเกมทั้งเกมใน trace เดียว — ChatGPT **อยาก cite official page** แต่ถ้า official page อ่านไม่ได้ (JS, image-based pricing) — มันจะ fallback ไปที่ G2 และ third-party sites
 **ข้อควรรู้เพิ่มเติม:**
 - ไม่มี ranking score หรือ domain authority ใน traffic ที่ browser เห็น — ใครขาย "ChatGPT ranking factors" คือขายน้ำมันงู
 - **Personalization มีจริง** — ChatGPT ดึง conversation history คุณมาใช้ใน answer (field: `personal_sources: ["convo_search", "gmail", "files"]`)
 - **Local results ถูก capped ที่ 2** — ถ้าถามร้านกาแฟใกล้ฉัน ได้แค่ 2 ที่ ไม่ใช่ top 10
 ---
 ## สรุป Actionable Takeaways
 | สิ่งที่ต้องทำ | เหตุผล |
 |:---|:---|
 | ✅ **ทำให้เว็บ cleanly scrapable** | ข้อมูล ราคา spec ต้องอยู่ใน plain HTML — scrapers อ่าน JS ไม่ได้ |
 | ✅ **สร้าง third-party coverage** | 85% ของ citations มาจาก third-party sources — PR, Reddit, review sites |
 | ✅ **เขียนเพื่อ query ที่ ChatGPT รันจริง** | ChatGPT ไม่ได้ใช้คำค้นมนุษย์ — มัน rephrase และยิง site: probes |
 | ✅ **เน้น text-based content** | YouTube ถูก cited = 0 — Reddit ถูก cited ตลอด |
 | ✅ **1 หน้าที่แข็งแรง ดีกว่า 20 หน้าบาง** | ChatGPT dedup by domain |
 | ❌ **อย่าซ่อน pricing ไว้ใน JS** | ChatGPT จะยอมแพ้และไป cite G2 แทน |
 | ❌ **อย่าลงทุนกับคำถามที่ ChatGPT ไม่ search** | เช็คก่อนว่าคำถามของคุณถูกจัดเป็น `text` หรือไม่ |
 **ทิ้งท้าย:** ChatGPT ไม่ใช่ search engine ดังนั้นหยุด optimize เหมือนทำ SEO บน Google มันอ่านหน้า **คุณเพื่อหาข้อเท็จจริง** และอ่าน **third-party เพื่อหาความคิดเห็น** — และมันจะทำอย่างนั้นก็ต่อเมื่อคำถามนั้นคุ้มค่าที่จะ search
 ---
 **ที่มา:**
 - Suganthan Mohanadasan, "How ChatGPT Actually Picks Sources (I Read The Network Traffic, Not The Outputs)" — Search Engine Journal, June 30, 2026
 - arXiv 2606.25787 — "How Large Language Models Source Brand Reputation Across Languages and Markets"
--- a/src/content/blog/2026-07-01-deepseek-dspark-devin-fusion.md
+++ b/src/content/blog/2026-07-01-deepseek-dspark-devin-fusion.md
@@ -0,0 +1,217 @@
 ---
 title: "DeepSeek DSpark + Devin Fusion: ถึงเวลา Multi-Model Architecture ที่ลดต้นทุน AI 85%"
 description: "สองเทรนด์ที่เปลี่ยนวิธีคิดเรื่อง AI inference: DeepSeek DSpark เร่งความเร็ว 85% ด้วย speculative decoding ส่วน Devin Fusion ลดต้นทุน 35% ด้วย multi-model harness"
 pubDate: 2026-07-01
 category: "AI & Marketing"
 heroImage: "/images/blog/2026-07-01-deepseek-dspark-devin-fusion-featured.png"
 tags: ["DeepSeek", "DSpark", "Devin", "Fusion", "Speculative-Decoding", "Multi-Model"]
 draft: false
 ---
 # DeepSeek DSpark + Devin Fusion: ถึงเวลา Multi-Model Architecture ที่ลดต้นทุน AI 85%
 > คุณไม่ขับ Lamborghini ไปซื้อของหน้าปากซอย — แล้วทำไมใช้ GPT-5.6 เพื่อตรวจ grammar?
 สองข่าวใหญ่ในสัปดาห์นี้ชี้ทางเดียวกัน: **ยุคของ single-model-for-everything กำลังจะจบลง**
 1. **DeepSeek เปิดซอร์ส DSpark** — เร่งความเร็ว LLM inference ได้ 60-85% โดยใช้ "scout model" เดา path ล่วงหน้า (MIT License)
 2. **Cognition เปิด Devin Fusion** — multi-model harness ลดต้นทุน 35% โดยใช้ two-agent architecture (main + sidekick)
 ทั้งสอง approach มีแนวคิดเดียวกัน: **ไม่ต้องใช้ flagship model สำหรับทุกงาน** — แค่ใช้มันในจังหวะที่จำเป็นเท่านั้น
 ## 📚 สารบัญ
 - DSpark: AI Scout ที่เดาทางก่อน
 - Devin Fusion: Two-Agent Architecture
 - เปรียบเทียบ: กลไกเดียวกัน, scale ต่างกัน
 - Sidekick Pattern: ทำไม Fable 5 ถึง performance ดีกว่าใน multi-model setup
 - Dynamic Mid-Session Routing
 - ผลกระทบต่อธุรกิจ
 - สรุป: อนาคตของ AI Cost Architecture
 ---
 ## DSpark: AI Scout ที่เดาทางก่อน
 ![Speculative decoding: scout model generates tokens ahead, main model verifies](/images/blog/2026-07-01-deepseek-dspark-devin-fusion-speculative-decoding-flow.png)
 DeepSeek เปิดซอร์ส DSpark ภายใต้ MIT License — พร้อม technical paper, model checkpoints, และ DeepSpec (codebase สำหรับ speculative decoding)
 ### หลักการทำงาน
 LLM เขียนข้อความทีละ token — แต่ละ token ต้องรอให้ token ก่อนหน้าเสร็จก่อน เหมือนคนข้ามแม่น้ำทีละก้อนหิน
 DSpark เพิ่ม **scout model** ที่วิ่งล่วงหน้า 2-3 ก้าว:
 1. **Scout** เดา path ที่น่าจะเป็น
 2. **Main model** ตรวจสอบความถูกต้อง
 3. ถ้าเดาถูก — ข้ามไปได้เลย (เร็วกว่า)
 4. ถ้าเดาผิด — เสียเวลาแค่ตรวจสอบ path สั้นๆ
 | เปรียบเทียบ | Before (MTP-1) | After (DSpark) |
 |:---|:---:|:---:|
 | V4-Flash per-user speed | baseline | +60% ถึง **85%** 🚀 |
 | V4-Pro per-user speed | baseline | +57% ถึง **78%** |
 | V4-Flash throughput (80 tokens/s/user) | baseline | +51% |
 | V4-Pro throughput (35 tokens/s/user) | baseline | +52% |
 | V4-Flash ที่ 120 tokens/s/user | near collapse | +661% throughput |
 | V4-Pro ที่ 50 tokens/s/user | near collapse | +406% throughput |
 **ตัวเลข 661% และ 406%** ฟังดูเวอร์ แต่ต้องเข้าใจบริบท: มันคือ **increased throughput under strict speed targets** — ถ้า baseline รับ concurrent users ได้ 10 คนที่ speed 120 t/s, DSpark รับได้ ~70 คน — ไม่ใช่ "token generation เร็วขึ้น 6 เท่า"
 ที่สำคัญ — DSpark **ไม่จำกัดแค่ DeepSeek-V4** — มันรองรับ Qwen และ Gemma ด้วย (มี checkpoints ให้)
 ### Speculative Decoding คืออะไร?
 ปัญหาพื้นฐานของ LLM: มันสร้าง output ได้แค่ **1 token ต่อครั้ง** — เทียบกับมนุษย์ที่คิดทั้งประโยคก่อนพูด
 Speculative decoding = ให้ "draft model" (เล็กกว่า เร็วกว่า) เดา output ล่วงหน้า แล้วให้ "target model" (ใหญ่กว่า) ตรวจสอบและแก้ไข
 ถ้า draft model เดาถูก — target model verify เป็น batch (แทนทีละ token) → เร็วกว่า **โดยไม่เปลี่ยน output**
 ---
 ## Devin Fusion: Two-Agent Architecture
 ![Two-agent architecture: frontier model delegates to sidekick model](/images/blog/2026-07-01-deepseek-dspark-devin-fusion-two-agent-architecture.png)
 Cognition ประกาศ Devin Fusion — ไม่ใช่แค่ "โมเดลผสม" แต่เป็น **multi-model harness ที่ซับซ้อนกว่านั้นมาก**
 ### หลักการทำงาน
 Devin Fusion ใช้ **2 agents ที่ทำงานพร้อมกัน**:
 1. **Main Agent** (frontier model) — วางแผน, ตัดสินใจ, ตรวจสอบ
 2. **Sidekick Agent** (cost-effective model) — execute งานย่อย, รันเทส, เขียน boilerplate
 **ทั้งสอง maintain context cache ของตัวเอง** — ต่างจาก architecture อื่นที่ต้อง serialize context เมื่อเปลี่ยน model
 ผลลัพธ์บน FrontierCode benchmark:
 | Config | Score | Cost/Task |
 |:---|:---:|:---:|
 | Fusion + Fable 5 | **57.6** | **$3.00** |
 | Fable 5 (medium) alone | 57.0 | $5.12 |
 | Fusion alone | 47.9 | $2.38 |
 | Opus 4.8 (high) | 48.8 | $3.24 |
 | GPT-5.5 (high) | 44.8 | $2.70 |
 **น่าสนใจ:** Fusion + Fable 5 = 57.6 (คะแนนสูงสุด) + $3.00 (ต้นทุนถูกกว่า Fable 5 alone 41%)
 Fusion ที่ไม่มี Fable 5 = 47.9 (คะแนนต่ำกว่า Opus 4.8 เล็กน้อย แต่ต้นทุนต่ำกว่า 26%)
 ### Sidekick in Action
 Cognition ยกตัวอย่างจริง:
 1. **Modernize search.js → ES6** — เขียน diff เสร็จ ส่ง sidekick รัน test suite ที่ช้า → cost -62%, score +2 (98→100) ✅
 2. **Rip out OpenTracing integration** — mechanical removal หลายไฟล์ → cost -32%, score -1 (98→97) ✅
 3. **Add team selector with React/Redux** — hard feature, judgment-driven → cost -28% **แต่ score -27 (54→27)** ❌
 **บทเรียน:** เมื่องานต้องการ judgment — การ delegate coding ให้ sidekick ทำให้คุณภาพตก Hard tasks จะแพ้ทาง sidekick
 4. **Integrate LangChain4j WebSocket MCP** — hard task แต่กลไกเป็น mechanical reuse → cost -25%, **score +12 (69→81)** 🏆
 ---
 ## เปรียบเทียบ: DSpark vs Devin Fusion
 | มิติ | DSpark | Devin Fusion |
 |:---|:---|:---|
 | ปัญหาที่แก้ | Inference speed | Cost-quality tradeoff |
 | กลไก | Speculative decoding (draft + target) | Multi-agent routing (main + sidekick) |
 | Scale | Model-level (token generation) | Task-level (user request) |
 | จุดเด่น | 85% per-user speedup | 35-41% cost reduction |
 | Implementation | ควบคุม weights + serving stack | API-level, ไม่ต้องควบคุม weights |
 | Licensing | MIT License | Proprietary |
 | ใครใช้ได้ | Enterprise ที่ deploy own models | ผู้ใช้ Devin (cloud service) |
 **ทั้งสอง approach มาจากรากเดียวกัน:** Don't use the big model for everything. ใช้ big model เฉพาะจุดที่ต้องใช้ real intelligence ส่วนงานที่ routine / predictable — ให้ sidekick (DSpark: draft model → Devin Fusion: cost-effective model) จัดการ
 ---
 ## Sidekick Pattern: ทำไมยิ่งใช้ ยิ่งได้ผล
 Cognition พบ pattern สำคัญ: **Sidekick scales better as models get smarter**
 Fable 5 (Anthropic's latest frontier) ทำงานใน Devin Fusion ได้ performance ดีกว่า Opus 4.8 หรือ GPT-5.5 ใน role ของ main agent — เพราะ Fable 5:
 - **Delegate งานได้ฉลาดกว่า** — รู้ว่างานไหนควรส่ง sidekick
 - **Request context ได้มีประสิทธิภาพกว่า** — ไม่ขอข้อมูลซ้ำซ้อน
 - **Plan ได้แม่นยำกว่า** — ลดความจำเป็นต้องเปลี่ยนแผนระหว่างทาง
 | Model | Fusion Cost Reduction |
 |:---|:---:|
 | Fable 5 | **41%** 🏆 |
 | Opus 4.8 | ~35% |
 | GPT-5.5 | ~35% |
 > "The sidekick pattern is one that will become more useful as base models get better." — Cognition Team
 ---
 ## Dynamic Mid-Session Routing
 หนึ่งในความท้าทายของการใช้ multi-model: **เลือก model ผิดตั้งแต่ต้น? หรือ task ยากกว่าที่คิด?**
 Devin Fusion แก้ด้วย **Dynamic Mid-Session Routing**:
 - ระหว่างทำงาน จะมี lightweight classifiers ส่งสัญญาณว่า "ถึงเวลาต้องเปลี่ยน model"
 - การเปลี่ยน model เกิดขึ้นตอน **context compaction** — ตอนที่ architecture ต้องเคลียร์ cache อยู่แล้ว → ไม่มี extra cost
 - Route ได้ทั้งจาก sidekick → main, หรือ main → sidekick
 **นั่นหมายความว่า:** แม้ sidekick จะเจองานที่ยากเกินไป — system สามารถ escalate กลับไปหา main agent โดยไม่เสีย cache penalty
 ---
 ## ผลกระทบต่อธุรกิจ
 ### 1. AI Cost Reduction ถึง 85%
 รวมผลจากสองเทรนด์:
 - DSpark: 85% per-user speedup → **ลด compute cost ต่อ request**
 - Devin Fusion: 35-41% cost reduction → **ลด API cost ต่อ task**
 - รวม: enterprise ที่ deploy own models + ใช้ multi-model routing = **ลดต้นทุน AI ได้มากถึง 50-70%**
 ### 2. ยุคของ Agentic Architecture
 AI ไม่ใช่แค่ chatbot อีกต่อไป — แต่มันคือ **system of agents** ที่ทำงานร่วมกัน:
 - Planner agent (main, frontier)
 - Executor agent (sidekick, cost-effective)
 - Reviewer agent (verify quality)
 - Router agent (decide which model for which task)
 ### 3. Open Source vs Proprietary
 DSpark (MIT) = enterprise ที่ deploy own weights สามารถใช้ได้ทันที Devin Fusion (proprietary) = ต้องใช้ผ่าน Devin cloud — สำหรับทีมที่ไม่ต้องการจัดการ infrastructure เอง
 ### 4. Fable 5 กับ Geopolitics
 Fable 5 ถูก US government สั่งระงับ access วันที่ 12 มิถุนายน 2026 — Cognition จึงรายงานผล Fable 5 จาก measurement ก่อน suspension
 นี่เพิ่มความไม่แน่นอนให้ enterprise ที่พึ่งพา frontier model เจียวเดียว — ข้อดีของ multi-model architecture คือ resilience ต่อ supply shock
 ---
 ## สรุป: อนาคตของ AI Cost Architecture
 ```
 ก่อน: 1 Model → 1 Task (expensive for everything)
     GPT-5.6 → เขียน email, คิดโค้ด, ตรวจ grammar, ทุกอย่าง
 หลัง: Multi-Model → Smart Routing
     Frontier Model → เฉพาะ complex reasoning + judgment
     Cost-effective Model → routine tasks, boilerplate, test
     Open-source Speculative Decoding → faster token gen
 ```
 **DeepSeek DSpark** แก้ที่ layer ของ **token generation** — ทำให้ inference ถูกและเร็วขึ้นโดยไม่เปลี่ยน output
 **Devin Fusion** แก้ที่ layer ของ **task orchestration** — ทำให้ task ถูก execute ด้วย model ที่เหมาะสมที่สุด
 ทั้งสอง = อนาคตที่ AI deployment ไม่ bottleneck ที่ cost อีกต่อไป
 > "The age of using one model for all of your work is coming to an end." — Cognition Team
 ---
 **ที่มา:**
 - Carl Franzen, "DeepSeek open sources DSpark" — VentureBeat, June 29, 2026
 - Cognition Team, "Devin Fusion: Frontier Performance at 35% Lower Cost" — cognition.com, June 29, 2026
 - DeepSeek, "DSpark: Confidence-Scheduled Speculative Decoding" — GitHub/deepseek-ai/DeepSpec