š§ Welcome to Sam-large-2: Dual-Mode Reasoning Demo
Our latest model features Chain-of-Thought (CoT) functionality and KV-Cache optimization for fast CPU inference!
š” Reasoning Mode (ON)
The model performs a CoT step first. The internal thought process is contained within <think>...</think> tags.
āŖ Direct Mode (OFF)
The model generates the final answer immediately, maximizing speed.