RTX 5060 Ti 16GB Local LLM Findings: 30B Still Wins, 35B UD Is Surprisingly Fast
Sentiment Mix
Geography
Expert Signals
Imaginary-Anywhere23
author • 1 mention
r/LocalLLaMA
source • 1 mention
AI-Generated Claims
Generated from linked receipts; click sources for full context.
RTX 5060 Ti 16GB Local LLM Findings: 30B Still Wins, 35B UD Is Surprisingly Fast.
Supported by 1 story
Bought 5060ti 16gb and tried various model.
Supported by 1 story
This is the short version for me deciding what to run on this card with `llama.cpp`, not a giant benchmark dump.
Supported by 1 story
Machine: * RTX 5060 Ti 16 GB * DDR4 now at 32 GB * llama-server `b8373` (`46dba9fce`) Relevant launch settings: * fast path: `fa=on`, `ngl=auto`, `threads=8` * KV: `-ctk q8_0 -ctv q8_0` * 30B coder path: `jinja`, `reasoning-budget 0`, `reasoning-format none` * 35B UD path: `c=262144`, `n-cpu-moe=8` * 35B `Q4_K_M` stable tune: `-ngl 26 -c 131072 --fit on --fit-ctx 131072 --fit-target 512M` Short version: * Best default coding model: `Unsloth Qwen3-Coder-30B UD-Q3_K_XL` * Best higher-context coding option: the same `Unsloth 30B` model at `96k` * Best fast 35B coding option: `Unsloth Qwen3.5-35B UD-Q2_K_XL` * `Unsloth Qwen3.5-35B Q4_K_M` is interesting, but still not the right default on this card What surprised me most is that the practical winners here were not just "smaller is...
Supported by 1 story
Related Events
What LLMs are you keeping your eye on?
LLMs • 3/20/2026
Follow-up: Qwen3 30B a3b at 7-8 t/s on a Raspberry Pi 5 8GB (source included)
Uncategorized • 3/20/2026
Mistral Small 4 vs Qwen3.5-9B on document understanding benchmarks, but it does better than GPT-4.1
LLMs • 3/20/2026
The AI IPO I’m Most Excited About (And No, It’s Not OpenAI, Anthropic or xAI) - 24/7 Wall St.
LLMs • 3/20/2026
Nvidia's Huang pitches AI tokens on top of salary
LLMs • 3/20/2026
Causality Chain
Preceded By
AI’s promise vs risk: Anthropic study reveals a global love-hate relationship - Firstpost
70 causal score
Anthropic study shows AI job exposure, AI baseball analysis, and spring travel tech | Tech Today - ZDNET
70 causal score
Anthropic Study Finds People Don’t Really Want AI for Creative Work - Gadgets 360
70 causal score
Led To
OpenAI owns the AI conversation and Anthropic's 'good guy' play isn't changing that: study - Campaign US
70 causal score
Claude Dispatch Lets You Control Claude Cowork With Your Phone - Forbes
45 causal score
As Anthropic Takes The Pentagon To Court, AI Leadership Faces Its Defining Test - Harvard Kennedy School
45 causal score