I am training phi-4 (14B) using a single A6000. There’s some tricks you have to ...

		deepsquirrelnet on April 3, 2025 \| parent \| context \| favorite \| on: Search-R1: Training LLMs to Reason and Leverage Se... I am training phi-4 (14B) using a single A6000. There’s some tricks you have to use to keep VRAM consumption down - mainly LoRA and quantization. There’s a package called “unsloth” that integrates with huggingface’s TRL library that can help.