Tarragon

搜尋文章標籤 Now RSS

"Model-Family"

2026-05-12 Reasoning Model 訓練成自然輸出長 reasoning trace 的 LLM 變體、o1 / DeepSeek-R1 / Claude thinking 為代表

Tarragon (CC BY 4.0) | 使用 hugo 製作