RTX 5060 Ti 16GB Local LLM Findings: 30B Still Wins, 35B UD Is Surprisingly Fast

1 sources1 storiesFirst seen 3/20/2026Score26Mixed Progress

Single Source

Bigness

Coverage

Recency

Engagement

Velocity

Confidence

Clipability

Polarization

Claims

Contradictions

Breakthrough

Sentiment Mix

Positive100%

Neutral0%

Negative0%

Geography

North America

Expert Signals

Imaginary-Anywhere23

author • 1 mention

r/LocalLLaMA

source • 1 mention

AI-Generated Claims

Generated from linked receipts; click sources for full context.

RTX 5060 Ti 16GB Local LLM Findings: 30B Still Wins, 35B UD Is Surprisingly Fast.

Supported by 1 story

Bought 5060ti 16gb and tried various model.

Supported by 1 story

This is the short version for me deciding what to run on this card with `llama.cpp`, not a giant benchmark dump.

Supported by 1 story

Machine: * RTX 5060 Ti 16 GB * DDR4 now at 32 GB * llama-server `b8373` (`46dba9fce`) Relevant launch settings: * fast path: `fa=on`, `ngl=auto`, `threads=8` * KV: `-ctk q8_0 -ctv q8_0` * 30B coder path: `jinja`, `reasoning-budget 0`, `reasoning-format none` * 35B UD path: `c=262144`, `n-cpu-moe=8` * 35B `Q4_K_M` stable tune: `-ngl 26 -c 131072 --fit on --fit-ctx 131072 --fit-target 512M` Short version: * Best default coding model: `Unsloth Qwen3-Coder-30B UD-Q3_K_XL` * Best higher-context coding option: the same `Unsloth 30B` model at `96k` * Best fast 35B coding option: `Unsloth Qwen3.5-35B UD-Q2_K_XL` * `Unsloth Qwen3.5-35B Q4_K_M` is interesting, but still not the right default on this card What surprised me most is that the practical winners here were not just "smaller is...

Supported by 1 story

Related Events

What LLMs are you keeping your eye on?

LLMs • 3/20/2026

48% match

Follow-up: Qwen3 30B a3b at 7-8 t/s on a Raspberry Pi 5 8GB (source included)

Uncategorized • 3/20/2026

42% match

Mistral Small 4 vs Qwen3.5-9B on document understanding benchmarks, but it does better than GPT-4.1

LLMs • 3/20/2026

37% match

The AI IPO I’m Most Excited About (And No, It’s Not OpenAI, Anthropic or xAI) - 24/7 Wall St.

LLMs • 3/20/2026

36% match

Nvidia's Huang pitches AI tokens on top of salary

LLMs • 3/20/2026

35% match

Causality Chain

Preceded By

AI’s promise vs risk: Anthropic study reveals a global love-hate relationship - Firstpost

70 causal score

Anthropic study shows AI job exposure, AI baseball analysis, and spring travel tech | Tech Today - ZDNET

70 causal score

Anthropic Study Finds People Don’t Really Want AI for Creative Work - Gadgets 360

70 causal score

Led To

OpenAI owns the AI conversation and Anthropic's 'good guy' play isn't changing that: study - Campaign US

70 causal score

Claude Dispatch Lets You Control Claude Cowork With Your Phone - Forbes

45 causal score

As Anthropic Takes The Pentagon To Court, AI Leadership Faces Its Defining Test - Harvard Kennedy School

45 causal score

Timeline (1 stories)

Mar 20 04:25 PMFirst

RTX 5060 Ti 16GB Local LLM Findings: 30B Still Wins, 35B UD Is Surprisingly Fast

r/LocalLLaMA38 engagement

Receipts (1)

Bias Snapshot

Center

Left 0%Center 100%Right 0%

Sociali.redd.it3/20/2026