The Kaitchup

By Various

Visit publication →

Efficient LLMs at Scale: My NeurIPS Week in KV Caches, Spec Decoding, and FP4

Various · The Kaitchup ·Dec 14, 2025 · 24 min read

Notes on RNJ-1, K2-V2, Devstral 2, and GLM-4.6V

Various · The Kaitchup ·Dec 12, 2025 · 7 min read

Eagle 3 Speculators: When To Use Them?

Various · The Kaitchup ·Dec 9, 2025 · 2 min read

Mistral Large 3: Not a Reasoning Model

Various · The Kaitchup ·Dec 5, 2025 · 4 min read

Quantizing Olmo 3: Most Efficient and Accurate Formats

Various · The Kaitchup ·Dec 2, 2025 · 1 min read

Scaling RL and Self-Verifiable Reasoning: INTELLECT-3 and DeepSeekMath-V2

Various · The Kaitchup ·Nov 28, 2025 · 8 min read

Accelerate Models with Quantization: Recipes for NVFP4, GPTQ, AWQ, SmoothQuant, AutoRound, and FP8

Various · The Kaitchup ·Nov 24, 2025 · 2 min read

Olmo 3 Is Here!

Various · The Kaitchup ·Nov 21, 2025 · 6 min read

Best GPUs Under $1,500 for AI: Should You Upgrade?

Various · The Kaitchup ·Nov 17, 2025 · 2 min read

The Limits of GRPO-like Methods for Reinforcement Learning

Various · The Kaitchup ·Nov 14, 2025 · 7 min read

Unsloth's Quantization-Aware Training (QAT) vs Post-Training Quantization (PTQ) for Small Models

Various · The Kaitchup ·Nov 10, 2025 · 2 min read

BF16 vs FP16 for Reinforcement Learning: Where Are We?

Various · The Kaitchup ·Nov 7, 2025 · 6 min read

Advanced LoRA Fine-Tuning: How to Pick LoRA, QLoRA, DoRA, PiSSA, OLoRA, EVA, and LoftQ for LLMs

Various · The Kaitchup ·Nov 3, 2025 · 2 min read

MiniMax M2 and Kimi-Linear: Why Full Attention Still Wins

Various · The Kaitchup ·Oct 31, 2025 · 6 min read

Generate Better Synthetic Datasets with a "User" LLM

Various · The Kaitchup ·Oct 27, 2025 · 2 min read

The Weekly Kaitchup #115

Various · The Kaitchup ·Oct 24, 2025 · 6 min read

Qwen3-VL Fine-Tuning on Your Computer

Various · The Kaitchup ·Oct 20, 2025 · 1 min read

DGX Spark: Use It for Fine-Tuning

Various · The Kaitchup ·Oct 17, 2025 · 6 min read

Choosing a GGUF Model: K-Quants, I-Quants, and Legacy Formats

Various · The Kaitchup ·Oct 13, 2025 · 10 min read

Tiny Recursive Models for Very Specific Problems

Various · The Kaitchup ·Oct 11, 2025 · 6 min read

Why Increasing Batch Size Doesn’t Always Speed Up Training

Various · The Kaitchup ·Oct 6, 2025 · 4 min read