← All tags

#llmops

3 posts

Posts tagged llmops

llmopsinferenceself-hosting

Serving open models with vLLM

Hands-on guide to self-hosting open-weights LLMs with vLLM: install, serve an OpenAI-compatible API, quantize, benchmark, and manage VRAM.

6 min read
fine-tuningtrainingllmops

Fine-tuning an open model with QLoRA

A hands-on QLoRA fine-tuning walkthrough: dataset prep, 4-bit training with peft and trl, merging, and vLLM serving behind an OpenAI-compatible API.

7 min read